Home | Connectors | Prodigy | Prodigy - OpenText Magellan Text Mining Engine Integration and Automation

Prodigy - OpenText Magellan Text Mining Engine Integration and Automation

Integrate Prodigy Artificial intelligence (AI) and OpenText Magellan Text Mining Engine Artificial intelligence (AI) apps with any of the apps from the library with just a few clicks. Create automated workflows by integrating your apps.

Common Integration Use Cases Between Prodigy and OpenText Magellan Text Mining Engine

Prodigy and OpenText Magellan Text Mining Engine complement each other well in enterprise AI and text analytics workflows. Magellan Text Mining Engine excels at extracting entities, topics, and relationships from large volumes of unstructured text, while Prodigy is designed for efficient human-in-the-loop annotation to create high-quality training data. Together, they support faster model development, better labeling quality, and more scalable text intelligence programs.

1. Human-in-the-loop entity labeling for NLP model training

Data flow: OpenText Magellan Text Mining Engine ? Prodigy

Magellan can process large document sets such as contracts, case notes, emails, or investigation files and pre-extract candidate entities like names, organizations, dates, locations, and regulatory terms. Those extracted spans are then sent into Prodigy for expert review and correction. This reduces manual labeling effort and improves consistency across annotators.

  • Business value: Faster creation of high-quality training data for custom NLP models
  • Operational benefit: Analysts focus on validation instead of starting from scratch
  • Typical users: Legal operations, compliance teams, data science teams

2. Topic classification dataset creation for document triage

Data flow: OpenText Magellan Text Mining Engine ? Prodigy

Magellan can identify likely topics or themes across large document repositories, such as fraud, privacy, litigation, supplier risk, or customer complaint categories. Prodigy can then be used to confirm or refine those topic labels and build a supervised training set for downstream classification models. This is especially useful when organizations need to automate document routing or prioritization.

  • Business value: Better document triage and faster case routing
  • Operational benefit: Reduces dependence on manual review queues
  • Typical users: Records teams, legal review teams, risk operations

3. Relationship extraction validation for knowledge graph development

Data flow: OpenText Magellan Text Mining Engine ? Prodigy

Magellan can detect relationships between entities, such as supplier to subsidiary, person to organization, or event to date. Prodigy can be used to validate these relationships and label edge cases that require domain expertise. The resulting dataset can support knowledge graph construction, investigative analytics, or entity resolution initiatives.

  • Business value: More accurate relationship models and better enterprise knowledge structures
  • Operational benefit: Improves precision before scaling automated extraction
  • Typical users: Intelligence teams, fraud analysts, master data teams

4. Compliance and policy violation detection model improvement

Data flow: OpenText Magellan Text Mining Engine ? Prodigy

Magellan can scan communications, policy documents, and case files to surface potentially relevant passages related to insider risk, harassment, conflicts of interest, or regulatory breaches. Prodigy can then be used by compliance reviewers to label true positives, false positives, and nuanced exceptions. This creates a stronger training set for automated monitoring and alerting models.

  • Business value: More reliable compliance detection with fewer false alerts
  • Operational benefit: Speeds up review of high-volume text sources
  • Typical users: Compliance, internal audit, legal investigations

5. Active learning loop for specialized text analytics models

Data flow: Bi-directional

Magellan can generate initial predictions on unlabeled text, and Prodigy can prioritize the most uncertain or informative samples for human annotation. After review, the corrected labels can be fed back into the model training pipeline and used by Magellan for another extraction pass. This closed loop is effective for building specialized models in domains with limited labeled data, such as healthcare claims, financial investigations, or technical support cases.

  • Business value: Faster model improvement with less annotation volume
  • Operational benefit: Focuses expert time on the most valuable samples
  • Typical users: ML teams, domain experts, analytics teams

6. Large-scale document review acceleration for legal discovery

Data flow: OpenText Magellan Text Mining Engine ? Prodigy

Magellan can analyze large discovery collections to identify likely relevant documents, extract named entities, and highlight key topics. Prodigy can then be used to label relevance, privilege, issue tags, or confidentiality categories on a curated subset of documents. This supports the development of custom review models that reduce manual attorney review time.

  • Business value: Lower review costs and faster legal response times
  • Operational benefit: Improves prioritization of high-value documents
  • Typical users: Litigation support, eDiscovery teams, outside counsel operations

7. Domain-specific text model training for intelligence and risk monitoring

Data flow: OpenText Magellan Text Mining Engine ? Prodigy

Magellan can process news feeds, reports, filings, and internal notes to identify entities and emerging themes. Prodigy can then be used to label domain-specific categories such as sanctions exposure, geopolitical risk, adverse media, or supply chain disruption. The labeled data can train custom classifiers that improve monitoring accuracy across risk and intelligence workflows.

  • Business value: Better early warning signals and more targeted risk detection
  • Operational benefit: Reduces noise in large-scale text monitoring programs
  • Typical users: Risk management, corporate security, intelligence analysts

In summary, OpenText Magellan Text Mining Engine is strongest as a large-scale text extraction and insight engine, while Prodigy is strongest as a fast, flexible annotation layer for training and refining models. Integrated together, they create a practical workflow for turning unstructured text into reliable, business-ready AI outputs.

How to integrate and automate Prodigy with OpenText Magellan Text Mining Engine using OneTeg?