NEW: Scale AI Case Study — ~1,900 data requests per week across 4 business units Read now →
Contents
Select Star
Select Star is a modern, automated data catalog founded in 2020 and headquartered in San Francisco. It competes against Atlan and DataHub with a focus on automated column-level lineage and fast time-to-value for mid-market data teams.
Select Star is one of the scrappier modern data catalogs, founded in 2020 in San Francisco by Shinji Kim, a second-time founder whose previous company — Concord Systems, a distributed stream-processing framework — was acquired by Akamai in 2016. Select Star arrived a year after Atlan and shares much of the same worldview: catalogs should be automated, beautiful, cloud-native, and friendly to data engineers and analysts rather than to governance committees.
The differentiator Select Star has consistently leaned on is speed to value through automation. The pitch is that you point Select Star at your warehouse and your BI tool, wait a few hours, and have a fully populated catalog with column-level lineage, without any manual tagging or glossary curation upfront. For mid-market data teams that want most of what Atlan offers at a lower price point and with less implementation effort, Select Star is a compelling alternative.
Select Star covers the expected modern-catalog feature set:
Automated column-level lineage. Select Star's core engine parses query history from connected warehouses (Snowflake, BigQuery, Redshift, Databricks) and transformation tools (dbt) to reconstruct table-level and column-level lineage automatically. The lineage extends into BI tools (Looker, Tableau, Power BI, Mode, Sigma), so you can trace a specific chart back through its views, joins, and filters to the source columns. This end-to-end BI-to-source lineage is one of Select Star's stronger demo moments.
Automated documentation and popularity. Select Star auto-populates descriptions where possible (from dbt, from schema comments, and more recently from AI-generated summaries), ranks tables by usage, and surfaces the most common queries written against each one. The "popular tables" and "top users" views turn query logs into a usability feature the same way Alation's behavioral metadata did a decade earlier.
Impact analysis. Before you drop or rename a column, Select Star shows you every downstream asset — dbt models, dashboards, queries, notebooks — that would break. This is the practical payoff of column-level lineage and is the feature most data engineers remember fondly from their first demo.
dbt-native integration. Like Atlan, Select Star treats dbt as first-class: dbt models, tests, docs, and exposures flow directly into the catalog, and the lineage graph merges dbt's declared dependencies with query-log-derived lineage for cross-validation.
Data discovery chatbot and AI assistant. Select Star has shipped AI features for natural-language search over the catalog and for auto-generating descriptions, tagging PII, and suggesting owners. These are table-stakes in 2026 but Select Star was relatively early to them.
Governance basics. Select Star includes glossary, tags, certifications, PII classification, and ownership workflows — lighter than Collibra's operating model but sufficient for most non-regulated companies.
Select Star is delivered as a multi-tenant SaaS, with metadata ingestion handled by native connectors that pull query history, information schema data, and BI metadata on a schedule. The heavy lifting — SQL parsing for lineage, usage ranking, AI-generated descriptions — happens server-side. A single-tenant or VPC-deployed option exists for customers with stricter data-residency requirements.
Select Star is a legitimate modern alternative that wins where Atlan feels too expensive or too heavy. It is not the category leader and is unlikely to become one, but it has a clear lane: mid-market cloud-native data teams who want automated column-level lineage without a six-figure contract. In that segment it wins real deals against Atlan, DataHub, Secoda, and Castor.
The honest assessment of the competitive position goes something like this:
Where Select Star struggles. Column-level lineage has become a commodity feature across the modern catalog wave, which makes the "automation" story harder to sustain as a differentiator over time. The company is also considerably smaller than Atlan and DataHub by funding, headcount, and logo count, which makes large-enterprise procurement departments nervous. And the AI-catalog convergence — where Snowflake Horizon, Databricks Unity Catalog, and warehouse-native metadata keep getting better — hits mid-market independent catalogs hardest.
Where Select Star wins cleanly. It wins at companies where the data team lead has personally run a catalog POC, hates the idea of a months-long implementation, and wants to be up and running by the end of the week with something that immediately shows column-level impact analysis for dbt changes. That is a real and durable customer segment, and Select Star serves it well.
TextQL integrates with Select Star to read table descriptions, column metadata, tags, lineage, and usage data. Select Star's auto-populated lineage and popularity rankings are particularly useful for TextQL: when a business user asks an ambiguous question, Ana can lean on Select Star's usage signals to pick the table an actual human analyst would have picked. For mid-market Snowflake and BigQuery customers running dbt, the combination of Select Star and TextQL gives you a complete discovery-and-query stack at a price point well below Atlan plus any enterprise AI-analytics vendor.
See TextQL in action
Related topics