pgpdf

PDF type with meta admin & Full-Text Search

Overview

Package	Version	Category	License	Language
`pgpdf`	`0.1.0`	TYPE	GPL-3.0	C

ID	Extension	Bin	Lib	Load	Create	Trust	Reloc	Schema
3530	`pgpdf`	No	Yes	Yes	Yes	Yes	Yes	-

Related	`pgjq` `pgjwt` `prefix` `semver` `unit` `pglite_fusion` `md5hash` `asn1oid`

Version

Type	Repo	Version	PG Ver	Package	Deps
EXT	PIGSTY	`0.1.0`	1817161514	`pgpdf`	-
RPM	PIGSTY	`0.1.0`	1817161514	`pgpdf_$v`	-
DEB	PIGSTY	`0.1.0`	1817161514	`postgresql-$v-pgpdf`	-

OS / PG	PG18	PG17	PG16	PG15	PG14
el8.x86_64	PIGSTY 0.1.0 el8.x86_64.pg18 : pgpdf_18 pgpdf_18-0.1.0-1PIGSTY.el8.x86_64.rpm PIGSTY · 0.1.0 · 18.4KiB pgpdf_18-0.1.0-1PGDG.rhel8.x86_64.rpm PGDG · 0.1.0 · 15.4KiB	PIGSTY 0.1.0 el8.x86_64.pg17 : pgpdf_17 pgpdf_17-0.1.0-1PIGSTY.el8.x86_64.rpm PIGSTY · 0.1.0 · 18.4KiB pgpdf_17-0.1.0-1PGDG.rhel8.x86_64.rpm PGDG · 0.1.0 · 15.4KiB	PIGSTY 0.1.0 el8.x86_64.pg16 : pgpdf_16 pgpdf_16-0.1.0-1PIGSTY.el8.x86_64.rpm PIGSTY · 0.1.0 · 18.4KiB pgpdf_16-0.1.0-1PGDG.rhel8.x86_64.rpm PGDG · 0.1.0 · 15.4KiB	PIGSTY 0.1.0 el8.x86_64.pg15 : pgpdf_15 pgpdf_15-0.1.0-1PIGSTY.el8.x86_64.rpm PIGSTY · 0.1.0 · 18.4KiB pgpdf_15-0.1.0-1PGDG.rhel8.x86_64.rpm PGDG · 0.1.0 · 15.4KiB	PIGSTY 0.1.0 el8.x86_64.pg14 : pgpdf_14 pgpdf_14-0.1.0-1PIGSTY.el8.x86_64.rpm PIGSTY · 0.1.0 · 18.4KiB pgpdf_14-0.1.0-1PGDG.rhel8.x86_64.rpm PGDG · 0.1.0 · 15.4KiB
el8.aarch64	PIGSTY 0.1.0 el8.aarch64.pg18 : pgpdf_18 pgpdf_18-0.1.0-1PIGSTY.el8.aarch64.rpm PIGSTY · 0.1.0 · 18.0KiB pgpdf_18-0.1.0-1PGDG.rhel8.aarch64.rpm PGDG · 0.1.0 · 14.8KiB	PIGSTY 0.1.0 el8.aarch64.pg17 : pgpdf_17 pgpdf_17-0.1.0-1PIGSTY.el8.aarch64.rpm PIGSTY · 0.1.0 · 18.0KiB pgpdf_17-0.1.0-1PGDG.rhel8.aarch64.rpm PGDG · 0.1.0 · 14.8KiB	PIGSTY 0.1.0 el8.aarch64.pg16 : pgpdf_16 pgpdf_16-0.1.0-1PIGSTY.el8.aarch64.rpm PIGSTY · 0.1.0 · 18.0KiB pgpdf_16-0.1.0-1PGDG.rhel8.aarch64.rpm PGDG · 0.1.0 · 14.8KiB	PIGSTY 0.1.0 el8.aarch64.pg15 : pgpdf_15 pgpdf_15-0.1.0-1PIGSTY.el8.aarch64.rpm PIGSTY · 0.1.0 · 17.9KiB pgpdf_15-0.1.0-1PGDG.rhel8.aarch64.rpm PGDG · 0.1.0 · 14.8KiB	PIGSTY 0.1.0 el8.aarch64.pg14 : pgpdf_14 pgpdf_14-0.1.0-1PIGSTY.el8.aarch64.rpm PIGSTY · 0.1.0 · 17.9KiB pgpdf_14-0.1.0-1PGDG.rhel8.aarch64.rpm PGDG · 0.1.0 · 14.8KiB
el9.x86_64	PIGSTY 0.1.0 el9.x86_64.pg18 : pgpdf_18 pgpdf_18-0.1.0-1PIGSTY.el9.x86_64.rpm PIGSTY · 0.1.0 · 18.2KiB pgpdf_18-0.1.0-1PGDG.rhel9.x86_64.rpm PGDG · 0.1.0 · 15.6KiB	PIGSTY 0.1.0 el9.x86_64.pg17 : pgpdf_17 pgpdf_17-0.1.0-1PIGSTY.el9.x86_64.rpm PIGSTY · 0.1.0 · 18.2KiB pgpdf_17-0.1.0-1PGDG.rhel9.x86_64.rpm PGDG · 0.1.0 · 15.6KiB	PIGSTY 0.1.0 el9.x86_64.pg16 : pgpdf_16 pgpdf_16-0.1.0-1PIGSTY.el9.x86_64.rpm PIGSTY · 0.1.0 · 18.2KiB pgpdf_16-0.1.0-1PGDG.rhel9.x86_64.rpm PGDG · 0.1.0 · 15.6KiB	PIGSTY 0.1.0 el9.x86_64.pg15 : pgpdf_15 pgpdf_15-0.1.0-1PIGSTY.el9.x86_64.rpm PIGSTY · 0.1.0 · 18.2KiB pgpdf_15-0.1.0-1PGDG.rhel9.x86_64.rpm PGDG · 0.1.0 · 15.6KiB	PIGSTY 0.1.0 el9.x86_64.pg14 : pgpdf_14 pgpdf_14-0.1.0-1PIGSTY.el9.x86_64.rpm PIGSTY · 0.1.0 · 18.2KiB pgpdf_14-0.1.0-1PGDG.rhel9.x86_64.rpm PGDG · 0.1.0 · 15.6KiB
el9.aarch64	PIGSTY 0.1.0 el9.aarch64.pg18 : pgpdf_18 pgpdf_18-0.1.0-1PIGSTY.el9.aarch64.rpm PIGSTY · 0.1.0 · 17.7KiB pgpdf_18-0.1.0-1PGDG.rhel9.aarch64.rpm PGDG · 0.1.0 · 14.9KiB	PIGSTY 0.1.0 el9.aarch64.pg17 : pgpdf_17 pgpdf_17-0.1.0-1PIGSTY.el9.aarch64.rpm PIGSTY · 0.1.0 · 17.8KiB pgpdf_17-0.1.0-1PGDG.rhel9.aarch64.rpm PGDG · 0.1.0 · 15.0KiB	PIGSTY 0.1.0 el9.aarch64.pg16 : pgpdf_16 pgpdf_16-0.1.0-1PIGSTY.el9.aarch64.rpm PIGSTY · 0.1.0 · 17.8KiB pgpdf_16-0.1.0-1PGDG.rhel9.aarch64.rpm PGDG · 0.1.0 · 15.0KiB	PIGSTY 0.1.0 el9.aarch64.pg15 : pgpdf_15 pgpdf_15-0.1.0-1PIGSTY.el9.aarch64.rpm PIGSTY · 0.1.0 · 17.7KiB pgpdf_15-0.1.0-1PGDG.rhel9.aarch64.rpm PGDG · 0.1.0 · 15.0KiB	PIGSTY 0.1.0 el9.aarch64.pg14 : pgpdf_14 pgpdf_14-0.1.0-1PIGSTY.el9.aarch64.rpm PIGSTY · 0.1.0 · 17.7KiB pgpdf_14-0.1.0-1PGDG.rhel9.aarch64.rpm PGDG · 0.1.0 · 15.0KiB
el10.x86_64	PIGSTY 0.1.0 el10.x86_64.pg18 : pgpdf_18 pgpdf_18-0.1.0-1PIGSTY.el10.x86_64.rpm PIGSTY · 0.1.0 · 18.2KiB pgpdf_18-0.1.0-1PGDG.rhel10.x86_64.rpm PGDG · 0.1.0 · 16.0KiB	PIGSTY 0.1.0 el10.x86_64.pg17 : pgpdf_17 pgpdf_17-0.1.0-1PIGSTY.el10.x86_64.rpm PIGSTY · 0.1.0 · 18.2KiB pgpdf_17-0.1.0-1PGDG.rhel10.x86_64.rpm PGDG · 0.1.0 · 16.0KiB	PIGSTY 0.1.0 el10.x86_64.pg16 : pgpdf_16 pgpdf_16-0.1.0-1PIGSTY.el10.x86_64.rpm PIGSTY · 0.1.0 · 18.2KiB pgpdf_16-0.1.0-1PGDG.rhel10.x86_64.rpm PGDG · 0.1.0 · 16.0KiB	PIGSTY 0.1.0 el10.x86_64.pg15 : pgpdf_15 pgpdf_15-0.1.0-1PIGSTY.el10.x86_64.rpm PIGSTY · 0.1.0 · 18.2KiB pgpdf_15-0.1.0-1PGDG.rhel10.x86_64.rpm PGDG · 0.1.0 · 16.0KiB	PIGSTY 0.1.0 el10.x86_64.pg14 : pgpdf_14 pgpdf_14-0.1.0-1PIGSTY.el10.x86_64.rpm PIGSTY · 0.1.0 · 18.2KiB pgpdf_14-0.1.0-1PGDG.rhel10.x86_64.rpm PGDG · 0.1.0 · 16.0KiB
el10.aarch64	PIGSTY 0.1.0 el10.aarch64.pg18 : pgpdf_18 pgpdf_18-0.1.0-1PIGSTY.el10.aarch64.rpm PIGSTY · 0.1.0 · 17.9KiB pgpdf_18-0.1.0-1PGDG.rhel10.aarch64.rpm PGDG · 0.1.0 · 15.5KiB	PIGSTY 0.1.0 el10.aarch64.pg17 : pgpdf_17 pgpdf_17-0.1.0-1PIGSTY.el10.aarch64.rpm PIGSTY · 0.1.0 · 17.9KiB pgpdf_17-0.1.0-1PGDG.rhel10.aarch64.rpm PGDG · 0.1.0 · 15.5KiB	PIGSTY 0.1.0 el10.aarch64.pg16 : pgpdf_16 pgpdf_16-0.1.0-1PIGSTY.el10.aarch64.rpm PIGSTY · 0.1.0 · 17.9KiB pgpdf_16-0.1.0-1PGDG.rhel10.aarch64.rpm PGDG · 0.1.0 · 15.5KiB	PIGSTY 0.1.0 el10.aarch64.pg15 : pgpdf_15 pgpdf_15-0.1.0-1PIGSTY.el10.aarch64.rpm PIGSTY · 0.1.0 · 17.9KiB pgpdf_15-0.1.0-1PGDG.rhel10.aarch64.rpm PGDG · 0.1.0 · 15.5KiB	PIGSTY 0.1.0 el10.aarch64.pg14 : pgpdf_14 pgpdf_14-0.1.0-1PIGSTY.el10.aarch64.rpm PIGSTY · 0.1.0 · 17.9KiB pgpdf_14-0.1.0-1PGDG.rhel10.aarch64.rpm PGDG · 0.1.0 · 15.5KiB
d12.x86_64	PIGSTY 0.1.0 d12.x86_64.pg18 : postgresql-18-pgpdf postgresql-18-pgpdf_0.1.0-1PIGSTY~bookworm_amd64.deb PIGSTY · 0.1.0 · 25.3KiB	PIGSTY 0.1.0 d12.x86_64.pg17 : postgresql-17-pgpdf postgresql-17-pgpdf_0.1.0-1PIGSTY~bookworm_amd64.deb PIGSTY · 0.1.0 · 25.3KiB	PIGSTY 0.1.0 d12.x86_64.pg16 : postgresql-16-pgpdf postgresql-16-pgpdf_0.1.0-1PIGSTY~bookworm_amd64.deb PIGSTY · 0.1.0 · 25.3KiB	PIGSTY 0.1.0 d12.x86_64.pg15 : postgresql-15-pgpdf postgresql-15-pgpdf_0.1.0-1PIGSTY~bookworm_amd64.deb PIGSTY · 0.1.0 · 25.3KiB	PIGSTY 0.1.0 d12.x86_64.pg14 : postgresql-14-pgpdf postgresql-14-pgpdf_0.1.0-1PIGSTY~bookworm_amd64.deb PIGSTY · 0.1.0 · 25.2KiB
d12.aarch64	PIGSTY 0.1.0 d12.aarch64.pg18 : postgresql-18-pgpdf postgresql-18-pgpdf_0.1.0-1PIGSTY~bookworm_arm64.deb PIGSTY · 0.1.0 · 24.7KiB	PIGSTY 0.1.0 d12.aarch64.pg17 : postgresql-17-pgpdf postgresql-17-pgpdf_0.1.0-1PIGSTY~bookworm_arm64.deb PIGSTY · 0.1.0 · 24.6KiB	PIGSTY 0.1.0 d12.aarch64.pg16 : postgresql-16-pgpdf postgresql-16-pgpdf_0.1.0-1PIGSTY~bookworm_arm64.deb PIGSTY · 0.1.0 · 24.6KiB	PIGSTY 0.1.0 d12.aarch64.pg15 : postgresql-15-pgpdf postgresql-15-pgpdf_0.1.0-1PIGSTY~bookworm_arm64.deb PIGSTY · 0.1.0 · 24.6KiB	PIGSTY 0.1.0 d12.aarch64.pg14 : postgresql-14-pgpdf postgresql-14-pgpdf_0.1.0-1PIGSTY~bookworm_arm64.deb PIGSTY · 0.1.0 · 24.6KiB
d13.x86_64	PIGSTY 0.1.0 d13.x86_64.pg18 : postgresql-18-pgpdf postgresql-18-pgpdf_0.1.0-1PIGSTY~trixie_amd64.deb PIGSTY · 0.1.0 · 25.3KiB	PIGSTY 0.1.0 d13.x86_64.pg17 : postgresql-17-pgpdf postgresql-17-pgpdf_0.1.0-1PIGSTY~trixie_amd64.deb PIGSTY · 0.1.0 · 25.2KiB	PIGSTY 0.1.0 d13.x86_64.pg16 : postgresql-16-pgpdf postgresql-16-pgpdf_0.1.0-1PIGSTY~trixie_amd64.deb PIGSTY · 0.1.0 · 25.2KiB	PIGSTY 0.1.0 d13.x86_64.pg15 : postgresql-15-pgpdf postgresql-15-pgpdf_0.1.0-1PIGSTY~trixie_amd64.deb PIGSTY · 0.1.0 · 25.3KiB	PIGSTY 0.1.0 d13.x86_64.pg14 : postgresql-14-pgpdf postgresql-14-pgpdf_0.1.0-1PIGSTY~trixie_amd64.deb PIGSTY · 0.1.0 · 25.2KiB
d13.aarch64	PIGSTY 0.1.0 d13.aarch64.pg18 : postgresql-18-pgpdf postgresql-18-pgpdf_0.1.0-1PIGSTY~trixie_arm64.deb PIGSTY · 0.1.0 · 24.7KiB	PIGSTY 0.1.0 d13.aarch64.pg17 : postgresql-17-pgpdf postgresql-17-pgpdf_0.1.0-1PIGSTY~trixie_arm64.deb PIGSTY · 0.1.0 · 24.6KiB	PIGSTY 0.1.0 d13.aarch64.pg16 : postgresql-16-pgpdf postgresql-16-pgpdf_0.1.0-1PIGSTY~trixie_arm64.deb PIGSTY · 0.1.0 · 24.7KiB	PIGSTY 0.1.0 d13.aarch64.pg15 : postgresql-15-pgpdf postgresql-15-pgpdf_0.1.0-1PIGSTY~trixie_arm64.deb PIGSTY · 0.1.0 · 24.7KiB	PIGSTY 0.1.0 d13.aarch64.pg14 : postgresql-14-pgpdf postgresql-14-pgpdf_0.1.0-1PIGSTY~trixie_arm64.deb PIGSTY · 0.1.0 · 24.6KiB
u22.x86_64	PIGSTY 0.1.0 u22.x86_64.pg18 : postgresql-18-pgpdf postgresql-18-pgpdf_0.1.0-1PIGSTY~jammy_amd64.deb PIGSTY · 0.1.0 · 26.7KiB	PIGSTY 0.1.0 u22.x86_64.pg17 : postgresql-17-pgpdf postgresql-17-pgpdf_0.1.0-1PIGSTY~jammy_amd64.deb PIGSTY · 0.1.0 · 27.4KiB	PIGSTY 0.1.0 u22.x86_64.pg16 : postgresql-16-pgpdf postgresql-16-pgpdf_0.1.0-1PIGSTY~jammy_amd64.deb PIGSTY · 0.1.0 · 27.4KiB	PIGSTY 0.1.0 u22.x86_64.pg15 : postgresql-15-pgpdf postgresql-15-pgpdf_0.1.0-1PIGSTY~jammy_amd64.deb PIGSTY · 0.1.0 · 27.5KiB	PIGSTY 0.1.0 u22.x86_64.pg14 : postgresql-14-pgpdf postgresql-14-pgpdf_0.1.0-1PIGSTY~jammy_amd64.deb PIGSTY · 0.1.0 · 27.4KiB
u22.aarch64	PIGSTY 0.1.0 u22.aarch64.pg18 : postgresql-18-pgpdf postgresql-18-pgpdf_0.1.0-1PIGSTY~jammy_arm64.deb PIGSTY · 0.1.0 · 26.2KiB	PIGSTY 0.1.0 u22.aarch64.pg17 : postgresql-17-pgpdf postgresql-17-pgpdf_0.1.0-1PIGSTY~jammy_arm64.deb PIGSTY · 0.1.0 · 27.0KiB	PIGSTY 0.1.0 u22.aarch64.pg16 : postgresql-16-pgpdf postgresql-16-pgpdf_0.1.0-1PIGSTY~jammy_arm64.deb PIGSTY · 0.1.0 · 27.0KiB	PIGSTY 0.1.0 u22.aarch64.pg15 : postgresql-15-pgpdf postgresql-15-pgpdf_0.1.0-1PIGSTY~jammy_arm64.deb PIGSTY · 0.1.0 · 27.0KiB	PIGSTY 0.1.0 u22.aarch64.pg14 : postgresql-14-pgpdf postgresql-14-pgpdf_0.1.0-1PIGSTY~jammy_arm64.deb PIGSTY · 0.1.0 · 26.9KiB
u24.x86_64	PIGSTY 0.1.0 u24.x86_64.pg18 : postgresql-18-pgpdf postgresql-18-pgpdf_0.1.0-1PIGSTY~noble_amd64.deb PIGSTY · 0.1.0 · 26.5KiB	PIGSTY 0.1.0 u24.x86_64.pg17 : postgresql-17-pgpdf postgresql-17-pgpdf_0.1.0-1PIGSTY~noble_amd64.deb PIGSTY · 0.1.0 · 26.4KiB	PIGSTY 0.1.0 u24.x86_64.pg16 : postgresql-16-pgpdf postgresql-16-pgpdf_0.1.0-1PIGSTY~noble_amd64.deb PIGSTY · 0.1.0 · 26.4KiB	PIGSTY 0.1.0 u24.x86_64.pg15 : postgresql-15-pgpdf postgresql-15-pgpdf_0.1.0-1PIGSTY~noble_amd64.deb PIGSTY · 0.1.0 · 26.5KiB	PIGSTY 0.1.0 u24.x86_64.pg14 : postgresql-14-pgpdf postgresql-14-pgpdf_0.1.0-1PIGSTY~noble_amd64.deb PIGSTY · 0.1.0 · 26.4KiB
u24.aarch64	PIGSTY 0.1.0 u24.aarch64.pg18 : postgresql-18-pgpdf postgresql-18-pgpdf_0.1.0-1PIGSTY~noble_arm64.deb PIGSTY · 0.1.0 · 25.7KiB	PIGSTY 0.1.0 u24.aarch64.pg17 : postgresql-17-pgpdf postgresql-17-pgpdf_0.1.0-1PIGSTY~noble_arm64.deb PIGSTY · 0.1.0 · 25.7KiB	PIGSTY 0.1.0 u24.aarch64.pg16 : postgresql-16-pgpdf postgresql-16-pgpdf_0.1.0-1PIGSTY~noble_arm64.deb PIGSTY · 0.1.0 · 25.6KiB	PIGSTY 0.1.0 u24.aarch64.pg15 : postgresql-15-pgpdf postgresql-15-pgpdf_0.1.0-1PIGSTY~noble_arm64.deb PIGSTY · 0.1.0 · 25.7KiB	PIGSTY 0.1.0 u24.aarch64.pg14 : postgresql-14-pgpdf postgresql-14-pgpdf_0.1.0-1PIGSTY~noble_arm64.deb PIGSTY · 0.1.0 · 25.7KiB

Build

You can build the RPM / DEB packages for pgpdf using pig build:

pig build pkg pgpdf         # build RPM / DEB packages

Install

You can install pgpdf directly. First, make sure the PGDG and PIGSTY repositories are added and enabled:

pig repo add pgsql -u          # Add repo and update cache

Install the extension using pig or apt/yum/dnf:

pig install pgpdf;          # Install for current active PG version

pig ext install -y pgpdf -v 18  # PG 18
pig ext install -y pgpdf -v 17  # PG 17
pig ext install -y pgpdf -v 16  # PG 16
pig ext install -y pgpdf -v 15  # PG 15
pig ext install -y pgpdf -v 14  # PG 14

dnf install -y pgpdf_18       # PG 18
dnf install -y pgpdf_17       # PG 17
dnf install -y pgpdf_16       # PG 16
dnf install -y pgpdf_15       # PG 15
dnf install -y pgpdf_14       # PG 14

apt install -y postgresql-18-pgpdf   # PG 18
apt install -y postgresql-17-pgpdf   # PG 17
apt install -y postgresql-16-pgpdf   # PG 16
apt install -y postgresql-15-pgpdf   # PG 15
apt install -y postgresql-14-pgpdf   # PG 14

Preload:

shared_preload_libraries = 'pgpdf';

Create Extension:

CREATE EXTENSION pgpdf;

Usage

The actual PDF parsing is done by poppler.

This allows you to work with PDFs in an ACID-compliant way. The usual alternative relies on external scripts or services which can easily make your data ingestion pipeline brittle and leave your raw data out-of-sync.

Download some PDFs.

wget https://wiki.postgresql.org/images/e/ea/PostgreSQL_Introduction.pdf -O /tmp/pgintro.pdf
wget https://pdfobject.com/pdf/sample.pdf -O /tmp/sample.pdf

You can create a pdf type, by casting either a text filepath or bytea column.

CREATE EXTENSION pgpdf;
SELECT '/tmp/pgintro.pdf'::pdf;

                                       pdf                                        
----------------------------------------------------------------------------------
 PostgreSQL Introduction                                                         +
 Digoal.Zhou                                                                     +
 7/20/2011Catalog                                                                +
  PostgreSQL Origin

If you don’t have the PDF file in your filesystem, but have already stored its content in a bytea column, you can just cast it to pdf.

SELECT pg_read_binary_file('/tmp/pgintro.pdf')::bytea::pdf;

Examples

Create a table with a pdf column:

CREATE TABLE pdfs(name text primary key, doc pdf);

INSERT INTO pdfs VALUES ('pgintro', '/tmp/pgintro.pdf');
INSERT INTO pdfs VALUES ('pgintro', '/tmp/sample.pdf');

Parsing and validation should happen automatically. The files will be read from the disk only once!

Note

The filepath should be accessible by the postgres process / user! That’s different than the user running psql. If you don’t understand what this means, as your DBA!

String Functions and Operators

Standard Postgres String Functions and Operators should work as usual:

SELECT 'Below is the PDF we received ' || '/tmp/pgintro.pdf'::pdf;

SELECT upper('/tmp/pgintro.pdf'::pdf::text);

SELECT name
FROM pdfs
WHERE doc::text LIKE '%Postgres%';

Full-Text Search (FTS)

You can also perform full-text search (FTS), since you can work on a pdf file like normal text.

SELECT '/tmp/pgintro.pdf'::pdf::text @@ to_tsquery('postgres');

 ?column? 
----------
 t
(1 row)

SELECT '/tmp/pgintro.pdf'::pdf::text @@ to_tsquery('oracle');

 ?column? 
----------
 f
(1 row)

Document similarity with `pg_trgm`

You can use pg_trgm to get the similarity between two documents:

CREATE EXTENSION pg_trgm;

SELECT similarity('/tmp/pgintro.pdf'::pdf::text, '/tmp/sample.pdf'::pdf::text);

Metadata

The following functions are available:

pdf_title(pdf) → text
pdf_author(pdf) → text
pdf_num_pages(pdf) → integer
Total number of pages in the document
pdf_page(pdf, integer) → text
Get the i-th page as text
pdf_creator(pdf) → text
pdf_keywords(pdf) → text
pdf_metadata(pdf) → text
pdf_version(pdf) → text
pdf_subject(pdf) → text
pdf_creation(pdf) → timestamp
pdf_modification(pdf) → timestamp

SELECT pdf_title('/tmp/pgintro.pdf');

        pdf_title        
-------------------------
 PostgreSQL Introduction
(1 row)

SELECT pdf_author('/tmp/pgintro.pdf');

 pdf_author 
------------
 周正中
(1 row)

Getting a subset of pages

SELECT pdf_num_pages('/tmp/pgintro.pdf');

 pdf_num_pages 
---------------
            24
(1 row)

SELECT pdf_page('/tmp/pgintro.pdf', 1);

           pdf_page           
------------------------------
 Catalog                     +
  PostgreSQL Origin         +
  Layout                    +
  Features                  +
  Enterprise Class Attribute+
  Case
(1 row)

SELECT pdf_subject('/tmp/pgintro.pdf');

 pdf_subject 
-------------
 
(1 row)

SELECT pdf_creation('/tmp/pgintro.pdf');

       pdf_creation       
--------------------------
 Wed Jul 20 11:13:37 2011
(1 row)

SELECT pdf_modification('/tmp/pgintro.pdf');

     pdf_modification     
--------------------------
 Wed Jul 20 11:13:37 2011
(1 row)

SELECT pdf_creator('/tmp/pgintro.pdf');

            pdf_creator             
------------------------------------
 Microsoft® Office PowerPoint® 2007
(1 row)

SELECT pdf_metadata('/tmp/pgintro.pdf');

 pdf_metadata 
--------------
 
(1 row)

SELECT pdf_version('/tmp/pgintro.pdf');

 pdf_version 
-------------
 PDF-1.5
(1 row)

Installation

Install poppler dependencies

Linux

sudo apt install -y libpoppler-glib-dev pkg-config

Homebrew/MacOS

brew install poppler pkgconf

cd /tmp
git clone https://github.com/Florents-Tselai/pgpdf.git
cd pgpdf
make
make install # may need sudo

After the installation, in a session:

CREATE EXTENSION pgpdf;

Docker

Get the Docker image with:

docker pull florents/pgpdf:pg17

This adds pgpdf to the Postgres image (replace 17 with your Postgres server version, and run it the same way).

Run the image in a container.

docker run --name pgpdf -p 5432:5432 -e POSTGRES_PASSWORD=pass florents/pgpdf:pg17

Through another terminal, connect to the running server (container).

PGPASSWORD=pass psql -h localhost -p 5432 -U postgres

Warning

Reading arbitrary binary data (PDF) into your database can pose security risks. Only use this for files you trust.

Feedback

Was this page helpful?

Thanks for the feedback! Please let us know how we can improve.

Sorry to hear that. Please let us know how we can improve.

Last Modified 2026-03-12: add pg extension catalog (95749bf)