Services

Services and Technologies

Front-end

Angular
React
VBA
.NET
C#
Office Scripts

Data Transformation

Microsoft Consulting Services
Power BI / Tableau
Power Apps
.NET
VBA
Python
C#

Office Add-in

VSTO Add-in
Excel Add-in
Access Add-in
Word Add-in
PowerPoint Add-in
Office Add-in

Back-end

.NET
Python
Java
Node
SQL Server
Snowflake

Enterprise Solutions

MS Access, Excel, SharePoint
Google Workspace
IBM Planning Analytics
UnQork
Informatica
API Solutions

Cloud Consulting

Azure
Google Cloud Services
Amazon Web Services
Devops

UI UX Design

Web Design
UI-UX Design
Graphics
Product | Source Code
Blog
Courses
Contact

Services

Services and Technologies

Front-end

Angular
React
VBA
.NET
C#
Office Scripts

Data Transformation

Microsoft Consulting Services
Power BI / Tableau
Power Apps
.NET
VBA
Python
C#

Office Add-in

VSTO Add-in
Excel Add-in
Access Add-in
Word Add-in
PowerPoint Add-in
Office Add-in

Back-end

.NET
Python
Java
Node
SQL Server
Snowflake

Enterprise Solutions

MS Access, Excel, SharePoint
Google Workspace
IBM Planning Analytics
UnQork
Informatica
API Solutions

Cloud Consulting

Azure
Google Cloud Services
Amazon Web Services
Devops

UI UX Design

Web Design
UI-UX Design
Graphics
Product | Source Code
Blog
Courses
Contact

Pamai Tech Blog

More at Youtube.com/VbaA2z

.NET code to extract data from PDF file

Sid Chewang

Here’s an example code in C# to extract data from a PDF file using the iTextSharp library

using iTextSharp.text.pdf;
using iTextSharp.text.pdf.parser;
using System.IO;

// Define the path of the PDF file
string filePath = "path/to/pdf/file.pdf";

// Create an instance of the PdfReader class to read the PDF file
PdfReader pdfReader = new PdfReader(filePath);

// Define a string to store the extracted text
string extractedText = "";

// Loop through each page of the PDF file
for (int i = 1; i <= pdfReader.NumberOfPages; i++)
{
    // Extract text from the current page
    extractedText += PdfTextExtractor.GetTextFromPage(pdfReader, i);
}

// Close the PdfReader object
pdfReader.Close();

// Write the extracted text to a text file
File.WriteAllText("path/to/output/file.txt", extractedText);

// Display a message indicating that the extraction is complete
Console.WriteLine("Extraction complete.");

This code uses the PdfReader class to read the PDF file, loops through each page of the file, and uses the PdfTextExtractor class to extract the text from each page. The extracted text is stored in a string variable and then written to a text file using the File.WriteAllText() method. Finally, a message is displayed to indicate that the extraction is complete.

Most Recent Posts

All Post
.NET
Apps Script
Java
OfficeScripts
Others
Python
SQL
VBA

Services and Technologies

Services and Technologies

Pamai Tech Blog

.NET code to extract data from PDF file

Most Recent Posts

Products

Office Add-in

Enterprise Solutions

Cloud Consulting

UI UX Design

Data Transformation

Services

FAQ's

Privacy Policy

Terms & Condition

Team

Contact Us

Company

About Us

Services

Features

Our Pricing

Latest News