Skip to content
#

data-pipelines

Here are 262 public repositories matching this topic...

Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.

  • Updated May 19, 2025
  • HTML
fluvio
preswald

Preswald is a WASM packager for Python-based interactive data apps: bundle full complex data workflows, particularly visualizations, into single files, runnable completely in-browser, using Pyodide, DuckDB, Pandas, and Plotly, Matplotlib, etc. Build dashboards, reports, and notebooks that run offline, load fast, and share like a document.

  • Updated May 19, 2025
  • Python
elementary

The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.

  • Updated May 19, 2025
  • HTML
odd-platform

First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.

  • Updated Feb 19, 2025
  • Java

Improve this page

Add a description, image, and links to the data-pipelines topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-pipelines topic, visit your repo's landing page and select "manage topics."

Learn more