here’s a sample code in C# using the .NET framework to extract data from a PDF
using System; using System.IO; using iTextSharp.text; using iTextSharp.text.pdf; namespace PDFExtractorExample { class Program { static void Main(string[] args) { // Define the PDF file path and name string filePath = @"C:\Users\UserName\Documents\example.pdf"; // Create a new instance of the PDF reader using (PdfReader reader = new PdfReader(filePath)) { // Extract text from each page of the PDF document for (int i = 1; i <= reader.NumberOfPages; i++) { string text = PdfTextExtractor.GetTextFromPage(reader, i); Console.WriteLine("Page {0}:\n{1}", i, text); } } Console.WriteLine("Data has been extracted from the PDF."); Console.ReadLine(); } } }
In this example, the iTextSharp library is used to extract text from the PDF. The PdfReader
class is used to create a new instance of the PDF reader with the PDF file path and name. The GetTextFromPage
method of the PdfTextExtractor
class is used to extract text from each page of the PDF document. The text is then printed to the console. Finally, the PdfReader
object is disposed of properly using the using
statement.