Home | Connectors | Prodigy | Prodigy - OpenText Magellan Text Mining Engine Integration and Automation
Prodigy and OpenText Magellan Text Mining Engine complement each other well in enterprise AI and text analytics workflows. Magellan Text Mining Engine excels at extracting entities, topics, and relationships from large volumes of unstructured text, while Prodigy is designed for efficient human-in-the-loop annotation to create high-quality training data. Together, they support faster model development, better labeling quality, and more scalable text intelligence programs.
Data flow: OpenText Magellan Text Mining Engine ? Prodigy
Magellan can process large document sets such as contracts, case notes, emails, or investigation files and pre-extract candidate entities like names, organizations, dates, locations, and regulatory terms. Those extracted spans are then sent into Prodigy for expert review and correction. This reduces manual labeling effort and improves consistency across annotators.
Data flow: OpenText Magellan Text Mining Engine ? Prodigy
Magellan can identify likely topics or themes across large document repositories, such as fraud, privacy, litigation, supplier risk, or customer complaint categories. Prodigy can then be used to confirm or refine those topic labels and build a supervised training set for downstream classification models. This is especially useful when organizations need to automate document routing or prioritization.
Data flow: OpenText Magellan Text Mining Engine ? Prodigy
Magellan can detect relationships between entities, such as supplier to subsidiary, person to organization, or event to date. Prodigy can be used to validate these relationships and label edge cases that require domain expertise. The resulting dataset can support knowledge graph construction, investigative analytics, or entity resolution initiatives.
Data flow: OpenText Magellan Text Mining Engine ? Prodigy
Magellan can scan communications, policy documents, and case files to surface potentially relevant passages related to insider risk, harassment, conflicts of interest, or regulatory breaches. Prodigy can then be used by compliance reviewers to label true positives, false positives, and nuanced exceptions. This creates a stronger training set for automated monitoring and alerting models.
Data flow: Bi-directional
Magellan can generate initial predictions on unlabeled text, and Prodigy can prioritize the most uncertain or informative samples for human annotation. After review, the corrected labels can be fed back into the model training pipeline and used by Magellan for another extraction pass. This closed loop is effective for building specialized models in domains with limited labeled data, such as healthcare claims, financial investigations, or technical support cases.
Data flow: OpenText Magellan Text Mining Engine ? Prodigy
Magellan can analyze large discovery collections to identify likely relevant documents, extract named entities, and highlight key topics. Prodigy can then be used to label relevance, privilege, issue tags, or confidentiality categories on a curated subset of documents. This supports the development of custom review models that reduce manual attorney review time.
Data flow: OpenText Magellan Text Mining Engine ? Prodigy
Magellan can process news feeds, reports, filings, and internal notes to identify entities and emerging themes. Prodigy can then be used to label domain-specific categories such as sanctions exposure, geopolitical risk, adverse media, or supply chain disruption. The labeled data can train custom classifiers that improve monitoring accuracy across risk and intelligence workflows.
In summary, OpenText Magellan Text Mining Engine is strongest as a large-scale text extraction and insight engine, while Prodigy is strongest as a fast, flexible annotation layer for training and refining models. Integrated together, they create a practical workflow for turning unstructured text into reliable, business-ready AI outputs.