PDF Tools Complete Guide: Merge, Split, Compress & Protect Your PDFs

Master every PDF operation — merging, splitting, compressing, password-protecting, watermarking, and converting. Learn why browser-based tools are the safest option for sensitive documents.

Published February 17, 2026 · Updated February 17, 2026

PDFs are the cockroaches of the digital world — and I mean that as a compliment. They survive everything. Send a PDF from a Mac to a Windows machine to a Linux server to someone's phone, and it looks identical everywhere. That resilience is exactly why the format has dominated document exchange for over 30 years. Contracts, invoices, research papers, tax forms, resumes, instruction manuals — if a document matters, it is probably a PDF.

But here is the paradox: the very thing that makes PDFs great for sharing makes them terrible for working with. The format was engineered from the ground up to be a faithful digital representation of a printed page. It was never designed to be edited, rearranged, or manipulated after creation. And yet, that is exactly what we need to do with them constantly — merge five documents into one application packet, extract three pages from a 200-page report, shrink a 50MB scan to something that fits in an email, or slap a password on a contract before sending it to a client.

This guide covers every major PDF operation you are likely to need, explains how each one actually works under the hood, and shows you how to do it without uploading sensitive documents to someone else's server.

Why PDFs Are Hard to Edit

Before diving into specific tools, it helps to understand why PDFs resist manipulation in the first place. This is not just academic — it explains why some operations are trivial and others are genuinely difficult.

A PDF is not a flow document like a Word file. In a .docx, text is stored as a stream of characters with formatting instructions: "this paragraph is 12pt Times New Roman, left-aligned, with 1.5 line spacing." The word processor dynamically reflows the text to fit whatever page size and margins you define. Move a paragraph, and everything below it shifts automatically.

A PDF works completely differently. Each page is essentially a canvas with absolute coordinates. The letter "A" in a PDF is not part of a text stream — it is an instruction that says "draw glyph 65 from font subset XYZ at position (72.5, 684.2) on page 3." Every character, every line, every image is positioned with exact coordinates. There is no concept of "flow." Move one element, and nothing else adjusts.

This coordinate-based architecture explains several things:

Why PDFs look identical everywhere — there is no reflow to go wrong. The rendering engine just draws exactly what the file specifies.
Why editing text in a PDF is awkward — adding a word does not push subsequent text forward. You have to manually reposition everything that follows.
Why merging and splitting are easy — you are just copying complete page objects from one file to another. No content modification is needed.
Why compression focuses on images — the text instructions are already tiny. It is the embedded images that bloat PDF files.

Understanding this architecture makes the rest of the guide much more intuitive. Some operations work with the PDF structure (merge, split, reorder) and are clean, lossless, and fast. Others work against it (text editing, reformatting) and are inherently messy.

Merging PDFs: Combining Multiple Documents Into One

Merging is probably the most common PDF operation, and fortunately it is one of the simplest because it works with the format's architecture rather than against it.

When you need to merge

The use cases are everywhere:

Job applications — combining your resume, cover letter, portfolio samples, and references into a single file that a hiring manager can review without juggling attachments
Contract packages — assembling the main agreement, exhibits, signatures pages, and supporting documents into one complete package
Report compilation — combining monthly reports from different departments into a quarterly summary
School assignments — merging your essay, bibliography, and appendix scans into one submission
Tax preparation — combining W-2s, 1099s, receipts, and other supporting documents into a single file for your accountant
Real estate — assembling purchase agreements, inspection reports, disclosures, and addenda

How merging works technically

When you merge PDFs, the tool is not "opening" the documents in the way a PDF viewer does. Instead, it reads the internal file structure — the page tree, font resources, image objects, and cross-reference tables — and creates a new PDF that incorporates the page objects from each source file.

Modern libraries like pdf-lib (which Fileza uses) handle this by:

Parsing each source PDF's cross-reference table to locate all objects
Copying page objects and their dependencies (fonts, images, color profiles) into a new document
Rebuilding the page tree to include all pages in the specified order
Generating a new cross-reference table and writing the output file

Because the page content is copied verbatim — the exact same drawing instructions, font subsets, and image data — merging is lossless. The output pages are bit-for-bit identical to the originals. No quality degradation, no font substitution, no layout shifts.

Tips for better merges

Check page sizes before merging. If one document is Letter (8.5 x 11") and another is A4 (210 x 297mm), the merged PDF will have mixed page sizes. This is perfectly valid but may look odd when printed. Some tools let you normalize page sizes during the merge.
Watch the file size. Merging five 10MB PDFs produces a 50MB file. If each source PDF has its own embedded copy of common fonts (like Arial or Times New Roman), the merged file will have duplicate font data. Some advanced merge tools can deduplicate shared resources, but most do not.
Order matters. Arrange your source files in the correct sequence before merging. Reordering pages after merging is possible but adds an extra step.
Bookmarks and table of contents from individual source files may not transfer to the merged document, depending on the tool. If you need a navigable merged PDF, you might need to add bookmarks after merging.

Splitting PDFs: Extracting the Pages You Need

Splitting is the inverse of merging — extracting specific pages from a larger document into a new, smaller PDF. Like merging, it works with the PDF structure and is inherently lossless.

Common splitting scenarios

Extracting a signature page from a multi-page contract to send for signing
Pulling specific chapters from a textbook or manual
Separating individual invoices from a batch export
Isolating a single page that contains the information you actually need from a 50-page report
Breaking a large scan into individual documents (for example, splitting a stack of scanned receipts)

Splitting approaches

There are several ways to split a PDF, and different tools support different approaches:

Method	Description	Best For
Extract specific pages	Select individual pages by number (e.g., pages 3, 7, 12)	Grabbing a few specific pages
Extract page range	Specify a continuous range (e.g., pages 5-15)	Pulling a chapter or section
Split at every N pages	Break into chunks of N pages each	Dividing a batch document
Split into individual pages	Every page becomes its own PDF	Processing scanned stacks
Remove pages	Keep everything except specified pages	Deleting irrelevant pages

What splitting preserves

Since splitting copies complete page objects, the extracted pages retain:

All text with original fonts and positioning
All images at their original resolution and quality
Vector graphics and drawings
Annotations, form fields, and comments on the extracted pages
Links and bookmarks (though cross-page links to removed pages will break)

What may not transfer: document-level metadata, document-level JavaScript, and cross-page references like a table of contents that points to pages not included in the split output.

Compressing PDFs: Getting File Sizes Under Control

PDF compression is where things get more nuanced, because unlike merging and splitting, compression actually modifies the content inside the file. The degree of modification — and the quality trade-off — depends entirely on what is making the PDF large in the first place.

Why PDFs get bloated

Not all large PDFs are large for the same reason:

Cause	Typical Size Impact	Compressible?
High-resolution embedded images	Often 80-95% of file size	Very compressible
Uncompressed or poorly compressed images	Major contributor	Highly compressible
Embedded full font files (instead of subsets)	1-10MB per font	Moderately compressible
Duplicate resources across pages	Varies	Deduplicate for savings
Metadata and hidden layers	Usually minor	Easy to strip
Vector graphics (charts, diagrams)	Usually small	Limited compression potential
Text content	Negligible	Already compact

The key insight: PDF compression is almost entirely about image optimization. A 30MB PDF with three high-resolution photos embedded at 300 DPI will compress beautifully because you can downsample those images to 150 DPI and recompress them, cutting the file size by 60-80%. A 30MB PDF that is a 500-page text-only legal document is already about as small as it can get — the text data is inherently compact.

Compression techniques

Effective PDF compression uses a combination of strategies:

Image downsampling. If a PDF contains a 4000x3000 pixel photo embedded at 300 DPI, but the image only occupies a 4-inch-wide area on the page, it only needs to be about 600 pixels wide for screen viewing (150 DPI) or 1200 pixels for print (300 DPI). Reducing the image resolution from 12 megapixels to 0.7 megapixels dramatically reduces size with no visible difference at the intended display size.

Image recompression. Some PDFs contain images stored as uncompressed bitmaps or with lossless PNG compression. Recompressing these as JPEG at quality 75-85 can cut image data by 80% or more with minimal visible quality loss.

Font subsetting. If a PDF embeds the complete Arial font file (800+ glyphs, several megabytes), but only uses 60 characters from it, font subsetting replaces the full font with a subset containing only the glyphs actually used. This can reduce font data from megabytes to kilobytes.

Object stream compression. PDF internal structures (cross-reference tables, object dictionaries) can be stored using more efficient compression algorithms. This typically saves a small percentage but adds up in large documents.

Metadata removal. Stripping author names, creation dates, editing history, and embedded thumbnails. Minor savings but useful for privacy.

Realistic compression expectations

PDF Type	Original Size	Compressed Size	Reduction
Scanned document (300 DPI, color)	25MB	5-8MB	65-80%
Photo-heavy report	40MB	10-15MB	60-75%
Presentation export with images	15MB	4-7MB	50-70%
Text-heavy legal document	5MB	4MB	10-20%
CAD/engineering drawings (vector)	10MB	8-9MB	10-15%
Form with embedded fonts	3MB	1-2MB	30-50%

The quality trade-off

Unlike merging and splitting, compression involves trade-offs. Downsampling images means you cannot recover the original resolution later. Recompressing JPEG images introduces additional generation loss. For most business documents, these trade-offs are invisible and irrelevant. For archival or print-production workflows, you might want to keep the originals and compress only the copies you distribute.

A good rule of thumb: compress for the intended use case. An email attachment that will be viewed on a laptop screen does not need 300 DPI images. A document going to a commercial printer does.

Password Protection and Encryption

PDF supports two distinct types of password protection, and understanding the difference is critical because one provides genuine security and the other is essentially theater.

User password vs. owner password

User password (open password): The document cannot be opened at all without entering this password. The content is encrypted — without the password, the file is unreadable gibberish. This provides real security.

Owner password (permissions password): The document opens normally, but certain operations are restricted — printing, copying text, editing, or extracting pages. The content is not encrypted in a meaningful way when only an owner password is set. Any competent PDF tool can strip the owner password and remove all restrictions in seconds.

This is not a flaw in any particular tool — it is a fundamental limitation of the PDF specification. The owner password is a "please don't" flag, not a security mechanism. If you need to actually prevent someone from reading a document, use a user password with strong encryption.

Encryption standards

Standard	Key Length	Security Level	Compatibility
RC4 40-bit	40 bits	Broken — crackable in seconds	PDF 1.1+ (Acrobat 3+)
RC4 128-bit	128 bits	Weak — crackable with modest hardware	PDF 1.4+ (Acrobat 5+)
AES 128-bit	128 bits	Strong for most purposes	PDF 1.6+ (Acrobat 7+)
AES 256-bit	256 bits	Military-grade — not practically crackable	PDF 2.0+ (Acrobat X+)

Always use AES-256 encryption. There is no performance or compatibility reason to use anything weaker in 2026. Every modern PDF reader supports it, and the encryption/decryption overhead is negligible on modern hardware.

Password strength matters more than encryption strength

Even AES-256 encryption is useless with a weak password. The encryption algorithm is not the weak link — the password is. A four-digit numeric password gives an attacker only 10,000 combinations to try, which takes about one second on modern hardware. A well-chosen 12-character password with mixed case, numbers, and symbols has approximately 475 trillion trillion combinations, which is not practically crackable.

Guidelines for PDF passwords:

Minimum 12 characters for sensitive documents
Mix character types — uppercase, lowercase, numbers, symbols
Avoid dictionary words and predictable patterns
Use a unique password — do not reuse passwords from your email or other accounts
Share the password through a different channel than the PDF itself. If you email the PDF, send the password via text message or a separate email. Never put the password in the same email as the document.

When to use password protection

Legal documents — contracts, NDAs, settlement agreements
Financial records — tax returns, bank statements, financial reports
Medical documents — test results, insurance claims, prescriptions
HR documents — offer letters, salary information, performance reviews
Intellectual property — patent drafts, trade secrets, unpublished research
Personal documents — passport scans, ID copies, Social Security documents

PDF to Images: Turning Pages Into Pictures

Sometimes you need individual page images rather than a PDF. This is more common than you might expect.

Why convert PDF pages to images

Social media sharing — you cannot post a PDF on Instagram or LinkedIn, but you can post a JPG or PNG of each page
Presentations — embedding a PDF page into a PowerPoint or Google Slides deck as a high-quality image
Website content — displaying document pages on a webpage without requiring a PDF viewer
Thumbnails and previews — generating preview images for a document management system
Annotation and markup — some markup tools work better with images than with PDFs
Archival in image formats — some workflows require TIFF or PNG archives of each page

Choosing the right image format

Format	Best For	Resolution Recommendation
PNG	Text-heavy pages, diagrams, screenshots	150-300 DPI
JPEG	Photo-heavy pages, scanned documents	150-200 DPI, quality 85-92
WebP	Web display (smaller files than PNG/JPEG)	150 DPI for web
TIFF	Archival, print production	300 DPI

For most purposes, PNG at 150-200 DPI strikes the right balance between file size and readability. Text stays sharp (PNG is lossless), and the resolution is sufficient for on-screen viewing. If you need to print the page images, bump up to 300 DPI.

JPEG works well for scanned document pages (which are essentially photographs already), but avoid JPEG for pages with crisp text on a white background — the compression introduces visible artifacts around sharp edges.

Resolution and DPI explained

A PDF page does not have a native resolution — it is a vector format with resolution-independent content. When you convert a page to an image, you choose the rendering resolution:

72 DPI — screen resolution, small files, text may appear fuzzy
150 DPI — good for on-screen viewing, readable text, moderate file size
300 DPI — print quality, crisp text, larger files
600 DPI — high-quality print, very sharp text, large files (usually overkill)

Each doubling of DPI quadruples the pixel count and roughly quadruples the file size. A standard Letter-size page at 150 DPI produces an image of about 1275 x 1650 pixels. At 300 DPI, that same page becomes 2550 x 3300 pixels — four times as many pixels.

Images to PDF: Building Documents From Scans and Photos

The reverse operation — combining images into a PDF — is essential for anyone who works with physical documents.

Common use cases

Digitizing paper documents — scanning receipts, letters, contracts, or notes and combining them into a single searchable document
Creating photo portfolios — assembling a sequence of images into a presentable, paginated format
Submitting documentation — many forms and applications require documents in PDF format
Archiving photographs — PDFs provide a standardized container with metadata support
Creating simple slideshows — each image becomes a page that can be viewed in any PDF reader

Getting good results

The quality of your image-to-PDF conversion depends primarily on the quality of your source images:

For scanned documents:

Scan at 300 DPI minimum for text documents
Use color scanning only if the color matters (forms, letterhead). Grayscale produces much smaller files.
Straighten pages before scanning — most phone scanning apps do this automatically
Crop to the page boundaries to avoid wasted space

For photos:

Use the original high-resolution images, not thumbnails or compressed previews
Consider the page size — a 4000x3000 photo will be scaled to fit the page, and portrait vs. landscape orientation matters
If combining many photos, consider whether you want one photo per page or a multi-photo layout

Page sizing: Most tools default to standard page sizes (Letter or A4) and scale each image to fit within the page margins. Some tools let you set custom page sizes that match each image's aspect ratio exactly, which eliminates white borders and gives a cleaner result for photo portfolios.

Supported image formats

Virtually any common image format can be embedded into a PDF:

JPEG — the most common source format for photos and scans, embeds directly without re-encoding
PNG — lossless, great for screenshots and diagrams with text
WebP — may need conversion to JPEG or PNG before embedding, depending on the tool
TIFF — standard for professional scanning workflows
HEIC/HEIF — iPhone photos, typically need conversion before embedding

Watermarks: Branding and Draft Protection

Watermarks add a text or image overlay to PDF pages, serving two main purposes: branding and document status marking.

Types of watermarks

Text watermarks are the most common. They typically display words like "CONFIDENTIAL," "DRAFT," "SAMPLE," or a company name across each page. They are rendered semi-transparent so the underlying content remains readable.

Image watermarks overlay a logo or graphic. These are common for branded documents, proposal templates, and professional reports.

Diagonal vs. horizontal placement. Diagonal watermarks (rotated 45 degrees across the page) are the classic style for status indicators like "DRAFT." Horizontal watermarks are more common for logos and branding elements, usually placed in a header, footer, or corner.

When to use watermarks

Draft documents — clearly marking documents that are not final to prevent premature distribution or reliance on incomplete information
Confidential materials — adding a visual reminder that the content is restricted, though note that a watermark does not prevent copying or sharing
Branded deliverables — adding your company logo to client-facing reports, proposals, and presentations
Sample or preview copies — marking documents that are provided for review but are not the final version
Legal documents — some jurisdictions and organizations require specific markings on certain document types

Watermark limitations

It is important to understand what watermarks do and do not provide:

Watermarks do not prevent copying. Anyone can screenshot the content or retype it. A watermark is a visual deterrent, not a security mechanism.
Watermarks can be removed. A watermark added as a PDF overlay can often be removed with PDF editing tools. If you need tamper-resistant watermarks, you need a more sophisticated approach like baking the watermark into flattened page images.
Watermarks affect readability. A watermark that is too opaque or too large will make the document hard to read. Keep opacity at 10-20% for text watermarks and use a light gray color for best results.

Privacy Advantages of Local Processing

Here is where the conversation gets serious. PDFs are not just any file type — they are the format people use for their most sensitive documents. Tax returns, medical records, legal contracts, financial statements, identification documents, employment agreements. When you upload a PDF to an online tool, you are handing someone else a copy of that document.

What happens when you use a cloud-based PDF tool

Most online PDF tools — including some very popular ones — follow this process:

You upload your file to their server
Their server processes it (merge, split, compress, whatever operation you requested)
The result is stored temporarily on their server and you download it
The file is (supposedly) deleted after some time — 1 hour, 24 hours, 7 days, depending on the service

The problems with this approach are substantial:

Your file traverses the internet. During upload and download, the file passes through multiple network nodes. Unless the connection uses end-to-end encryption (not just HTTPS), intermediaries could potentially intercept the data.

Your file exists on someone else's server. Even temporarily, your document is on hardware you do not control. You are trusting the service operator's security practices, their access controls, their employee vetting, and their data deletion procedures.

You cannot verify deletion. When the service says your file is deleted after 24 hours, you have no way to confirm that. Are backups made? Are logs kept? Do they replicate data across multiple data centers? You simply do not know.

Terms of service vary wildly. Some free PDF tools include clauses in their terms of service that grant them rights to use uploaded content for training machine learning models, improving their services, or other purposes. Most people do not read the terms before uploading their tax return.

Data breach risk. Any server that processes and stores millions of documents is a high-value target for attackers. Data breaches at file processing services have exposed millions of users' documents.

How local processing eliminates these risks

Browser-based tools like Fileza process files entirely within your browser using client-side JavaScript. The architecture is fundamentally different:

You select your file — the browser reads it into local memory
JavaScript processes it — libraries like pdf-lib manipulate the PDF structure directly in your browser's memory
The result is generated locally — the output file is created in your browser
You download the result — from your own browser to your own disk

At no point does the file leave your device. There is no upload, no server processing, no temporary storage, no network transmission of your document content. The tool's server delivers the JavaScript code to your browser, but your actual files never touch that server.

This is not a minor distinction — it is a completely different security model. You do not have to trust anyone's data handling practices, because your data never reaches anyone else's infrastructure.

When local processing matters most

Tax documents — W-2s, 1099s, and tax returns contain Social Security numbers, income details, and employer information
Medical records — HIPAA-regulated documents should never be uploaded to non-compliant services
Legal contracts — confidentiality clauses may prohibit uploading agreements to third-party services
Financial statements — bank statements and financial reports contain account numbers and transaction histories
Identification documents — passport scans, driver's license copies, and similar ID documents are identity theft goldmines
Business confidential — internal reports, strategic plans, and proprietary information

For these document types, local processing is not just a convenience — it is the only approach that does not introduce unnecessary risk.

Quick Reference: PDF Operations at a Glance

Here is a side-by-side comparison of every major PDF operation, what it does, whether it modifies content, and when to use it:

Operation	What It Does	Lossless?	Typical Use Case	Processing Speed
Merge	Combines multiple PDFs into one	Yes	Application packets, report compilation	Fast (seconds)
Split	Extracts specific pages	Yes	Pulling chapters, isolating pages	Fast (seconds)
Compress	Reduces file size	No (images recompressed)	Email attachments, upload limits	Medium (seconds to minutes)
Password protect	Encrypts with password	Yes (content unchanged)	Securing sensitive documents	Fast (seconds)
PDF to images	Renders pages as PNG/JPG	Depends on format and DPI	Social media, presentations	Medium
Images to PDF	Embeds images as pages	Depends on settings	Digitizing scans, photo portfolios	Fast
Watermark	Adds text/image overlay	No (overlay added)	Branding, draft marking	Fast (seconds)
Reorder pages	Rearranges page sequence	Yes	Fixing page order	Fast (seconds)
Rotate pages	Changes page orientation	Yes (metadata change)	Fixing scanned orientation	Fast (seconds)
Remove pages	Deletes specified pages	Yes (remaining pages unchanged)	Removing irrelevant content	Fast (seconds)

The "lossless" column is important. Operations that work with the PDF structure (merge, split, reorder, rotate, remove) do not modify any content and produce bit-perfect copies of the affected pages. Operations that modify content (compress, watermark) alter the file in ways that cannot be perfectly reversed.

Putting It All Together: Real-World Workflows

Understanding individual operations is useful, but most real tasks require combining multiple operations. Here are some common workflows:

Preparing a job application

Convert your resume from DOCX to PDF (preserves formatting across systems)
Convert reference letters and certifications from images to PDF
Merge resume + cover letter + portfolio samples + references into one file
Compress the merged file if it exceeds the upload limit (many job portals cap at 5-10MB)
Name the file clearly: "LastName_FirstName_Application.pdf"

Sending a contract for signature

Split the signature page from the full contract
Send the full contract for review as one file
Send the signature page separately for signing
Merge the signed page back into the full contract
Password protect the final executed version

Archiving financial records

Scan receipts and statements (use your phone's scanning app for automatic edge detection and straightening)
Combine scanned images into category-based PDFs (one per month, one per expense type, etc.)
Compress to reduce storage requirements
Password protect with a strong password using AES-256 encryption
Store with descriptive filenames and organized folder structure

Creating a client deliverable

Merge your report, appendices, and supporting documents
Add a watermark with your company logo or "CONFIDENTIAL" marking
Compress to a reasonable size for email delivery
Password protect if the content is sensitive
Send the PDF with the password communicated through a separate channel

The Bottom Line

PDFs are the universal format for documents that matter, and knowing how to manipulate them efficiently saves real time and prevents real headaches. The operations themselves are straightforward — merge to combine, split to extract, compress to shrink, encrypt to protect.

The more important decision is where you process your PDFs. For casual documents — a recipe collection, a public form, an event flyer — any tool will do. But for anything containing personal, financial, medical, or business-sensitive information, local processing is the only approach that does not require you to trust a third party with your data.

Every PDF operation covered in this guide — merging, splitting, compressing, password protection, image conversion, and watermarking — can be done entirely within your browser. No uploads, no accounts, no trusting someone else's server with your tax returns. Your files stay on your device from start to finish.

That is how it should work.