Category "pdf"

How to get coordinate of text from line paragraph pdf itext5

How to get coordinate of left of line text paragraph pdf. I mean like pic (x,y). Coordinate per line. I use itext5 and java

R markdown to PDF - Printing console output

I am working in RStudio for a course, and I want to write up my reports with R markdown. I would like to display certain console output in the pdf that will be

How can i add an image into an existing and specific pdf page (Itext)?

I'm developing a "program" that create barcode and insert them on the original pdf. This is my code: package com.mycompany.barcodeinserter; import com.itextpdf

html-pdf: Failed to load PhantomJS module. Error: Cannot find module 'phantomjs-prebuilt'

I am using html-pdf NPM module to convert my html (bill.html) code to pdf file (bill.pdf), I have simple html with heading tag and exporting that html by fs mo

How to set fit width when embedding pdf in Chrome?

When I embed a pdf to view on Chrome/ Firefox , no matter I set width / set the parameter , it does not change the width of the pdf content, the only thing I ca

Tabula read pdf - CalledProcessError

I am using tabula to read tables from a pdf. The documents I'm extracting data from are really large, so I'm using a for-loop to run through the different pages

Rename PDF based on its content

How do I specifically extract the name from a PDF (screenshot as shown below) and rename the PDF as the name? So in this case, I want to extract Daniel Thomas f

Tabula py not reading all rows for PDFs with alternating colors for each row when Lattice is set to True

I am trying to extract all rows from the PDF attached here. Here is the code I used: def parse_latticepdf_pages(pdf): pages = read_pdf( pdf,

Get/Set Annotation from/in existsing PDF using itextSharp

I'm working on a project where i have to read an existing pdf using iTextSharp.PdfReader, then getAnnotations in order to insert them back in a new generated pd

Select only first page of PDF with PyPDF2

I am trying to strip out only the first page of multiple PDF files and combine into one file. (I receive 150 PDF files a day, the first page is the invoice whic

Image to pdf convertion in flutter

I am currently working on a photo scanner app in flutter and searching for a way to convert images to pdf. It would be great if anyone helps me

How to extract text from a Specific Area in a PDF using Python?

I'm trying to extract Text from a PDF using Python, and I have successfully done so using PyPDF2 like this: from PyPDF2 import PdfFileReader reader = PdfFileRea

Concatenating PDF files in memory with PyPDF2

I wish to concatenate (append) a bunch of small pdfs together effectively in memory in pure python. Specifically, an usual case is 500 single page pdfs, each wi

How to convert PDF to Excel on a Mac?

I have a lot of PDFs to convert to excel. I found the code below which converts PDF to excel by opening the pdf file in word but it fails at Dim fso As New File

EOF marker not found while use PyPDF2 merge pdf file in python

When I use the following code from PyPDF2 import PdfFileMerger merge = PdfFileMerger() for newFile in nlst: merge.append(newFile) merge.write("newFile.pdf

Split each PDF page in two?

I have a large number of PDF files which have two slides to a page (for printing). The format is A4 pages each with two slides setup like so: ----------- | sl

jsPDF addHTML exporting low quality image to PDF

Simple question searching from last 2 days but didnt find solution i am converting html to pdf using this addHTML api of jsPDF $('#loadPdf').on('click', functi

Using pdf.js on a node server

I want to convert a pdf to an image server-side, using node.js. My input for this task is pdf's url, and the desired output is a base64 string, representing an

Use PyPDF2 to detect non-embedded fonts in PDF file generated by Google Docs

I was hoping someone could help me write a Python function to detect any fonts in the file which are not embedded in the file. I've attempted to use the script

Save as pdf using ClosedXML

Does ClosedXML have any function that can be used to save an excel file as PDF? Currently i am using ClosedXML to create and populate the excel file and then u