Fivetran is excited to announce our upcoming integration with Polaris data catalog. Soon Fivetran users will be able to take advantage of our integration to send contextual metadata from Fivetran directly to Polaris data catalog, reducing complexity across systems, destinations and vendors.
Polaris catalog is a vendor-neutral, open source catalog for Apache Iceberg, providing users with the flexibility to streamline their data management by writing metadata across engines, eliminating the need to copy and maintain multiple datasets.
Technical data catalogs (like Unity Catalog, Glue, and Polaris) serve as centralized repositories where organizations can manage, organize and discover their technical metadata from across all their data lake environments. Technical data catalogs are essential for engineering workflows, helping users find, understand and govern their data assets across various systems, improving data accessibility and facilitating better decision making.
Fivetran is excited to leverage the open source Polaris catalog to develop a hosted catalog solution for customers using the Fivetran Managed Data Lake Service. Customers no longer have to create and maintain their own technical metadata catalog. Our upcoming integration adds further value to the Fivetran Managed Data Lake Service with the functionality to direct metadata to Polaris, working in conjunction with the data lake.
Implementing a vendor-agnostic data catalog is beneficial for businesses because it ensures that the data catalog can integrate with a wide range of data sources and tools, avoiding vendor lock-in and providing flexibility to adapt to changing technologies and business needs. A vendor-neutral approach allows enterprises to scale and evolve their data strategies without being constrained by a single provider’s ecosystem.
With Fivetran’s integration, customers don’t have to worry about how their data and metadata is maintained. Instead, they can leverage their external storage location and avoid copying and managing data across multiple locations. Fivetran will set up their data lake and maintain technical metadata in Polaris, allowing customers to query the data in downstream warehouses or tools without having to duplicate the data to those warehouses, creating additional complexity and generating unneeded storage costs.
Fivetran’s Managed Data Lake Service enables customers to automatically and securely land clean, organized and standardized data from over 500 sources in your data lake using query-ready open table formats. Fivetran handles the compaction and deduplication of data in-flight, landing clean, analytics-ready data in your data lake in Apache Iceberg open table formats.
Not a Fivetran user? Start a free trial!