Services

Services and Technologies

Front-end

Angular
React
VBA
.NET
C#
Office Scripts

Data Transformation

Microsoft Consulting Services
Power BI / Tableau
Power Apps
.NET
VBA
Python
C#

Office Add-in

VSTO Add-in
Excel Add-in
Access Add-in
Word Add-in
PowerPoint Add-in
Office Add-in

Back-end

.NET
Python
Java
Node
SQL Server
Snowflake

Enterprise Solutions

MS Access, Excel, SharePoint
Google Workspace
IBM Planning Analytics
UnQork
Informatica
API Solutions

Cloud Consulting

Azure
Google Cloud Services
Amazon Web Services
Devops

UI UX Design

Web Design
UI-UX Design
Graphics
Product | Source Code
Blog
Courses
Contact

Services

Services and Technologies

Front-end

Angular
React
VBA
.NET
C#
Office Scripts

Data Transformation

Microsoft Consulting Services
Power BI / Tableau
Power Apps
.NET
VBA
Python
C#

Office Add-in

VSTO Add-in
Excel Add-in
Access Add-in
Word Add-in
PowerPoint Add-in
Office Add-in

Back-end

.NET
Python
Java
Node
SQL Server
Snowflake

Enterprise Solutions

MS Access, Excel, SharePoint
Google Workspace
IBM Planning Analytics
UnQork
Informatica
API Solutions

Cloud Consulting

Azure
Google Cloud Services
Amazon Web Services
Devops

UI UX Design

Web Design
UI-UX Design
Graphics
Product | Source Code
Blog
Courses
Contact

Pamai Tech Blog

More at Youtube.com/VbaA2z

Extract data from PDF using .NET

Sid Chewang

here’s a sample code in C# using the .NET framework to extract data from a PDF

using System;
using System.IO;
using iTextSharp.text;
using iTextSharp.text.pdf;

namespace PDFExtractorExample
{
    class Program
    {
        static void Main(string[] args)
        {
            // Define the PDF file path and name
            string filePath = @"C:\Users\UserName\Documents\example.pdf";

            // Create a new instance of the PDF reader
            using (PdfReader reader = new PdfReader(filePath))
            {
                // Extract text from each page of the PDF document
                for (int i = 1; i <= reader.NumberOfPages; i++)
                {
                    string text = PdfTextExtractor.GetTextFromPage(reader, i);

                    Console.WriteLine("Page {0}:\n{1}", i, text);
                }
            }

            Console.WriteLine("Data has been extracted from the PDF.");
            Console.ReadLine();
        }
    }
}

In this example, the iTextSharp library is used to extract text from the PDF. The PdfReader class is used to create a new instance of the PDF reader with the PDF file path and name. The GetTextFromPage method of the PdfTextExtractor class is used to extract text from each page of the PDF document. The text is then printed to the console. Finally, the PdfReader object is disposed of properly using the using statement.

Most Recent Posts

All Post
.NET
All
Apps Script
Java
OfficeScripts
Others
Python
SQL
VBA

Services and Technologies

Services and Technologies

Pamai Tech Blog

Extract data from PDF using .NET

Most Recent Posts

Products

Office Add-in

Enterprise Solutions

Cloud Consulting

UI UX Design

Data Transformation

Services

FAQ's

Privacy Policy

Terms & Condition

Team

Contact Us

Company

About Us

Services

Features

Our Pricing

Latest News