Overview
PDF Metadata Extractor is a lightweight OutSystems Forge component that allows applications to extract metadata from user-uploaded PDF files completely on the client side, using PDF.js. No server calls. No storage. Everything happens in the browser.
Key Features
• Upload PDF files directly in the browser
• Extract key metadata instantly: Title, Author, Subject, Creator, Producer, Keywords, PDF Version, Creation Date, Modification Date, Number of Pages, File Name, File Size
• Returns both extracted metadata as JSON string and original PDF file as Binary Data
• Minimal UI
• Fast and dependency-free (other than PDF.js)
Use Cases
• KYC document upload - Validate metadata before upload
• Document management - Auto-fill fields from PDF metadata
• File verification - Detect corrupted or edited PDFs
• Compliance - Check creation/modification dates instantly
• Content Processing - Extract metadata for indexing