Extract and Parse Data Efficiently with a Smart .NET Text Parser
ComponentOne TextParser enables you to efficiently integrate data from semi-structured sources, such as emails and invoices, into your workflows.
- Extract data from plain text or HTML files
- Store extracted data in a table of records or transfer it to another system
- Extract repeated data using a variety of techniques, including templates
- Use the .NET text parser across your desktop applications or web services
TextParser is available in Data Services Edition.
Download Free Trial (2024 v1)Why Choose ComponentOne TextParser?
Efficiently Parse Text
Smart text extractors help work around unique file types by ignoring extra characters.
Extract Information for Enhanced BI
Reading and storing information from semi-structured files lets you analyze or use it in different ways.
Improve Information Workflows
Automate the process for extracting important information repeatedly and improve efficiency.
.NET Text Parser Key Features
Extractors for Semi-Structured Files
The ComponentOne TextParser library supports three different extractors for different scenarios including plain text and a specialized HTML extractor.
Expression-based Matching
Extraction can occur along matched regular expressions, after a matched word or phrase, or using a defined script.
JSON Output
.NET Text Parser Extractor Types
Starts-After-Continues-Until Extractor
The Starts-After-Continues-Until Extractor is the simplest and the easiest to use as it's designed with the purpose of extracting relevant text from a plain text source. Define two parameters to use: where the text starts and where it ends. TextParser extracts the text contained between the occurrences of two regular expressions.
Template-based Extractor
The template-based Extractor is the most generic, as it allows users to parse data structures following a declarative XML template. Since the template can be provided as a separate file, it allows users to provide both the template and source to parse.
HTML Extractor
See All You Can Do With .NET Text Parser
Extract Product Information from Websites
The .NET text parser is designed to extract data from e-commerce websites like Amazon.com. The HTML Extractor is specialized for complex HTML documents by allowing unexpected characters within the markup.
Extract Important Information from Emails
Emails tend to follow a predictable HTML structure which makes it possible for predictable and automatic parsing. Through the .NET text parser individual pieces of information, such as item details, vendor info, and costs, can be extracted and recorded as fields which can then be recorded or analyzed.
Paste Rich Formatted Hyperlinks in Text Editors
Improve Productivity with Smart Tags
Process Resumes for Digital Analysis
Resumes are often formatted in a predictable manner that allows them to be easily read by a machine that parses out important information. When a company receives hundreds of resumes, manual processing can be tedious and time-consuming. The .NET text parser can analyze and narrow down candidate applications by parsing out key requirements.
Extract Invoice Totals
.NET Text Parser Demos
TextParser Explorer
See how you can extract data from plain text or HTML files, such as emails and invoices.