textractor
Service icon

TEXTractor

Stable version 2.5.0 (Compatible with OutSystems 11)
Uploaded
 on 21 Apr (8 days ago)
 by 
5.0
 (1 rating)
textractor

TEXTractor

Compatible with:
Created on OutSystems 11

Version 2.5.0

Stable
Current
See documentation
Uploaded on 21 Apr (8 days ago) by 
Compatible with:
Version 11
Database:
All
Release notes:

Added support for binary extraction of attachments from both MSG and EML file types.

Application Objects:
TEXTractor has 0 AOs.

Version 2.4.1

Stable
See documentation
Uploaded on 19 Apr (10 days ago) by 
Compatible with:
Version 11
Database:
All
Release notes:

Updated NuGet package dependencies to latest versions.

Application Objects:
TEXTractor has 0 AOs.

Version 2.4.0

Stable
See documentation
Uploaded on 5 Apr (3 weeks ago) by 
Compatible with:
Version 11
Database:
All
Release notes:

Document Structured Extraction Enhancements:

  • New Table Support: Introduced a dedicated Table element type alongside the existing Paragraph type. This allows for structured tabular data retrieval from DOC and DOCX files.
  • Human-Readable Styles: Improved DOC extraction to return actual Paragraph Style Names (e.g., "Heading 1") instead of internal style indexes.
  • Page Referencing: Added a PageNumber attribute to all document elements extracted from PDFs, enabling easier navigation and source tracking.
  • Enhanced PDF Segmentation: Upgraded the PDF page segmentation algorithm from "Recursive XY Cut" to "Docstrum". This change significantly improves the reliability of element detection, especially in complex layouts with tight margins or overlapping content.
Application Objects:
TEXTractor has 0 AOs.