What is Data Pipeline?
AI InfrastructureLast updated:
An automated sequence of steps that extracts, transforms, and loads data from sources to destinations for ML or analytics.
Data pipelines move and transform data through ingestion, cleaning, enrichment, and storage stages. In AI systems, pipelines prepare training data, compute features, generate embeddings, and keep vector stores synchronized with source systems.