Hyland Document Filters Blog

Document Filters 24.3 Release

August 21, 2024 · 4 min read

Nabih Metri

Product Manager

We're excited to announce the latest release of Document Filters, packed with powerful new features designed to enhance your document processing capabilities. This update introduces a JSON Output Type for structured data handling, a Markdown Output Type for streamlined document conversion, advanced PDF Table Extraction for improved data accuracy, and MSI Installer Sub-File Extraction for comprehensive file analysis. Additionally, we've added community-inspired support for Hancom Hangul HWPX text extraction and HD rendering. Read on to discover how these new features can elevate your workflows and drive better results.

Exploring the Document Comparison APIs

May 29, 2024 · 8 min read

Ben Truscott

Document Filters Principal Engineer

The release of Hyland Document Filters 24.2 marks a significant milestone with the introduction of powerful Document Comparison APIs. These new features are designed to enhance the ability of developers to implement robust document comparison capabilities within their applications, facilitating the identification and management of changes across various document types.

Document Filters 24.2 Release

May 15, 2024 · 3 min read

Nabih Metri

Product Manager

The new 24.2 release of Hyland Document Filters introduces a range of features designed to streamline document comparison, improve accessibility, and integrate advanced OCR technology, with features being directly influenced by the Document Filters community. With these updates, Document Filters continues to evolve as a versatile tool that adapts to the diverse needs of its users.

Document Filters for AI Solutions

May 14, 2024 · 3 min read

Nabih Metri

Product Manager

In the dynamic realm of AI development, Hyland's Document Filters is a game-changer, offering developers a versatile toolkit for file identification, content extraction, document transformation, and document conversion across over 600 file formats. Our presentation underscores its pivotal role in AI solutions, from enhancing data security with robust redaction features to streamlining operations, and to reducing costs. As AI companies integrate Document Filters into their enterprise offerings, it exemplifies the toolkit's potential to revolutionize AI applications, ensuring efficiency, security, and compliance in our data-driven future.

Converting Documents with Comments

May 1, 2024 · 3 min read

Nabih Metri

Product Manager

In software development, where every line of code counts, the Hyland Document Filters SDK is a beacon of efficiency for the document conversion process. This SDK is crafted to streamline the document conversion solution, ensuring that embedded comments—the lifeblood of project collaboration, packed with critical feedback and key insights—are not just preserved, but seamlessly integrated into the converted documents. It’s a solution that resonates with the developer community, offering a way to enhance digital workflows with precision and ease. By incorporating this SDK, developers can confidently tackle the document conversion process, armed with the assurance that the collaborative essence of the documents remains intact, bolstering the robustness of their applications.

Document Filters 24.1 Release

February 21, 2024 · 3 min read

Nabih Metri

Product Manager

Document Filters 24.1.0 is now available for download!

Extracting text from any file is harder than it looks. Extracting formatting is even harder.

October 4, 2021 · 13 min read

Ben Truscott

Document Filters Principal Engineer

Corey Kidd

(Frm) Product Owner

Backdrop

This post was originally hosted on the Stack Overflow Blog.

We take for granted document processing on an individual scale: double-click the file (or use a simple command-line phrase) and the contents of the file display. But it gets more complicated at scale. Imagine you’re a recruiter searching resumes for keywords or a paralegal looking for names in thousands of pages of discovery documents. The formats, versions, and platforms that generated them could be wildly different. The challenge is even greater when it’s time sensitive, for example if you have to scan all outgoing emails for personally identifiable information (PII) leakages, or you have to give patients a single file that contains all of their disclosure agreements, scanned documents, and MRI/X-ray/test reports, regardless of the original file format.