Home | Connectors | Prodigy | Prodigy - OpenText Content Storage Service Integration and Automation
Prodigy and OpenText Content Storage Service complement each other well in enterprise AI and content operations. Prodigy handles the active labeling and annotation of training data, while OpenText Content Storage Service provides secure, scalable, compliant object storage for large volumes of source content, intermediate datasets, and annotation outputs. Together, they support efficient machine learning workflows with stronger governance, lower storage overhead, and better collaboration across data science, operations, and compliance teams.
Data flow: OpenText Content Storage Service to Prodigy
Enterprises can store large source datasets such as images, documents, audio files, or video clips in OpenText Content Storage Service and expose them to Prodigy for annotation jobs. This is especially useful for computer vision, document classification, and NLP projects where data volumes are high and need to remain centrally managed.
Data flow: Prodigy to OpenText Content Storage Service
After annotation is completed in Prodigy, labeled datasets can be exported and stored in OpenText Content Storage Service as the authoritative version for downstream model training. This creates a durable record of the exact dataset used for a given model version, which is valuable for reproducibility, audit readiness, and regulated environments.
Data flow: Bi-directional
Prodigy?s active learning workflow can be paired with OpenText Content Storage Service to continuously refresh the pool of unlabeled content. New documents or media files can be ingested into OpenText, selected for annotation in Prodigy based on model uncertainty, and then written back as labeled outputs for the next training cycle.
Data flow: Prodigy to OpenText Content Storage Service
Organizations often need subject matter experts to review labeled samples, edge cases, or disagreement sets. Prodigy can export these review packages to OpenText Content Storage Service so business users can access them through controlled enterprise content processes without needing direct access to the annotation environment.
Data flow: Prodigy to OpenText Content Storage Service
For regulated industries such as healthcare, financial services, and public sector organizations, it is important to retain evidence of how AI models were trained. Prodigy annotation logs, label exports, and dataset snapshots can be archived in OpenText Content Storage Service to support internal audits, model risk management, and regulatory inquiries.
Data flow: OpenText Content Storage Service to Prodigy and Prodigy to OpenText Content Storage Service
Enterprises running large-scale labeling programs for image libraries, scanned documents, or video archives can use OpenText Content Storage Service as the primary repository for all content assets. Prodigy then accesses only the subsets needed for each labeling sprint, and completed annotations are stored back in OpenText for long-term management.
Data flow: Legacy content systems to OpenText Content Storage Service to Prodigy
Organizations modernizing legacy storage can migrate unstructured content into OpenText Content Storage Service and then use Prodigy to label selected content for AI use cases. This is a practical approach for enterprises that want to unlock value from archived documents, historical images, or legacy case files.
Overall, integrating Prodigy with OpenText Content Storage Service helps enterprises build a controlled, scalable AI data pipeline. OpenText provides the secure content foundation, while Prodigy adds the human-in-the-loop labeling capability needed to create high-quality training data efficiently.