PDF Tools Complete Guide: Merge, Split, Compress & Protect Your PDFs

Master every PDF operation — merging, splitting, compressing, password-protecting, watermarking, and converting. Learn why browser-based tools are the safest option for sensitive documents.

Published February 18, 2026 · Updated February 18, 2026

PDFs are the cockroaches of the digital world — and I mean that as a compliment. They survive everything. Send a PDF from a Mac to a Windows machine to a Linux server to someone's phone, and it looks identical everywhere. That resilience is exactly why the format has dominated document exchange for over 30 years. Contracts, invoices, research papers, tax forms, resumes, instruction manuals — if a document matters, it is probably a PDF.

But here is the paradox: the very thing that makes PDFs great for sharing makes them terrible for working with. The format was engineered from the ground up to be a faithful digital representation of a printed page. It was never designed to be edited, rearranged, or manipulated after creation. And yet, that is exactly what we need to do with them constantly — merge five documents into one application packet, extract three pages from a 200-page report, shrink a 50MB scan to something that fits in an email, or slap a password on a contract before sending it to a client.

This guide covers every major PDF operation you are likely to need, explains how each one actually works under the hood, and shows you how to do it without uploading sensitive documents to someone else's server.

Why PDFs Are Hard to Edit

Before diving into specific tools, it helps to understand why PDFs resist manipulation in the first place. This is not just academic — it explains why some operations are trivial and others are genuinely difficult.

A PDF is not a flow document like a Word file. In a .docx, text is stored as a stream of characters with formatting instructions: "this paragraph is 12pt Times New Roman, left-aligned, with 1.5 line spacing." The word processor dynamically reflows the text to fit whatever page size and margins you define. Move a paragraph, and everything below it shifts automatically.

A PDF works completely differently. Each page is essentially a canvas with absolute coordinates. The letter "A" in a PDF is not part of a text stream — it is an instruction that says "draw glyph 65 from font subset XYZ at position (72.5, 684.2) on page 3." Every character, every line, every image is positioned with exact coordinates. There is no concept of "flow." Move one element, and nothing else adjusts.

This coordinate-based architecture explains several things:

  • Why PDFs look identical everywhere — there is no reflow to go wrong. The rendering engine just draws exactly what the file specifies.
  • Why editing text in a PDF is awkward — adding a word does not push subsequent text forward. You have to manually reposition everything that follows.
  • Why merging and splitting are easy — you are just copying complete page objects from one file to another. No content modification is needed.
  • Why compression focuses on images — the text instructions are already tiny. It is the embedded images that bloat PDF files.

Understanding this architecture makes the rest of the guide much more intuitive. Some operations work with the PDF structure (merge, split, reorder) and are clean, lossless, and fast. Others work against it (text editing, reformatting) and are inherently messy.

Merging PDFs: Combining Multiple Documents Into One

Merging is probably the most common PDF operation, and fortunately it is one of the simplest because it works with the format's architecture rather than against it.

When you need to merge

The use cases are everywhere:

  • Job applications — combining your resume, cover letter, portfolio samples, and references into a single file that a hiring manager can review without juggling attachments
  • Contract packages — assembling the main agreement, exhibits, signatures pages, and supporting documents into one complete package
  • Report compilation — combining monthly reports from different departments into a quarterly summary
  • School assignments — merging your essay, bibliography, and appendix scans into one submission
  • Tax preparation — combining W-2s, 1099s, receipts, and other supporting documents into a single file for your accountant
  • Real estate — assembling purchase agreements, inspection reports, disclosures, and addenda

How merging works technically

When you merge PDFs, the tool is not "opening" the documents in the way a PDF viewer does. Instead, it reads the internal file structure — the page tree, font resources, image objects, and cross-reference tables — and creates a new PDF that incorporates the page objects from each source file.

Modern libraries like pdf-lib (which Fileza uses) handle this by:

  1. Parsing each source PDF's cross-reference table to locate all objects
  2. Copying page objects and their dependencies (fonts, images, color profiles) into a new document
  3. Rebuilding the page tree to include all pages in the specified order
  4. Generating a new cross-reference table and writing the output file

Because the page content is copied verbatim — the exact same drawing instructions, font subsets, and image data — merging is lossless. The output pages are bit-for-bit identical to the originals. No quality degradation, no font substitution, no layout shifts.

Tips for better merges

  • Check page sizes before merging. If one document is Letter (8.5 x 11") and another is A4 (210 x 297mm), the merged PDF will have mixed page sizes. This is perfectly valid but may look odd when printed. Some tools let you normalize page sizes during the merge.
  • Watch the file size. Merging five 10MB PDFs produces a 50MB file. If each source PDF has its own embedded copy of common fonts (like Arial or Times New Roman), the merged file will have duplicate font data. Some advanced merge tools can deduplicate shared resources, but most do not.
  • Order matters. Arrange your source files in the correct sequence before merging. Reordering pages after merging is possible but adds an extra step.
  • Bookmarks and table of contents from individual source files may not transfer to the merged document, depending on the tool. If you need a navigable merged PDF, you might need to add bookmarks after merging.

Splitting PDFs: Extracting the Pages You Need

Splitting is the inverse of merging — extracting specific pages from a larger document into a new, smaller PDF. Like merging, it works with the PDF structure and is inherently lossless.

Common splitting scenarios

  • Extracting a signature page from a multi-page contract to send for signing
  • Pulling specific chapters from a textbook or manual
  • Separating individual invoices from a batch export
  • Isolating a single page that contains the information you actually need from a 50-page report
  • Breaking a large scan into individual documents (for example, splitting a stack of scanned receipts)

Splitting approaches

There are several ways to split a PDF, and different tools support different approaches:

Method Description Best For
Extract specific pages Select individual pages by number (e.g., pages 3, 7, 12) Grabbing a few specific pages
Extract page range Specify a continuous range (e.g., pages 5-15) Pulling a chapter or section
Split at every N pages Break into chunks of N pages each Dividing a batch document
Split into individual pages Every page becomes its own PDF Processing scanned stacks
Remove pages Keep everything except specified pages Deleting irrelevant pages

What splitting preserves

Since splitting copies complete page objects, the extracted pages retain:

  • All text with original fonts and positioning
  • All images at their original resolution and quality
  • Vector graphics and drawings
  • Annotations, form fields, and comments on the extracted pages
  • Links and bookmarks (though cross-page links to removed pages will break)

What may not transfer: document-level metadata, document-level JavaScript, and cross-page references like a table of contents that points to pages not included in the split output.

Compressing PDFs: Getting File Sizes Under Control

PDF compression is where things get more nuanced, because unlike merging and splitting, compression actually modifies the content inside the file. The degree of modification — and the quality trade-off — depends entirely on what is making the PDF large in the first place.

Why PDFs get bloated

Not all large PDFs are large for the same reason:

Cause Typical Size Impact Compressible?
High-resolution embedded images Often 80-95% of file size Very compressible
Uncompressed or poorly compressed images Major contributor Highly compressible
Embedded full font files (instead of subsets) 1-10MB per font Moderately compressible
Duplicate resources across pages Varies Deduplicate for savings
Metadata and hidden layers Usually minor Easy to strip
Vector graphics (charts, diagrams) Usually small Limited compression potential
Text content Negligible Already compact

The key insight: PDF compression is almost entirely about image optimization. A 30MB PDF with three high-resolution photos embedded at 300 DPI will compress beautifully because you can downsample those images to 150 DPI and recompress them, cutting the file size by 60-80%. A 30MB PDF that is a 500-page text-only legal document is already about as small as it can get — the text data is inherently compact.

Compression techniques

Effective PDF compression uses a combination of strategies:

Image downsampling. If a PDF contains a 4000x3000 pixel photo embedded at 300 DPI, but the image only occupies a 4-inch-wide area on the page, it only needs to be about 600 pixels wide for screen viewing (150 DPI) or 1200 pixels for print (300 DPI). Reducing the image resolution from 12 megapixels to 0.7 megapixels dramatically reduces size with no visible difference at the intended display size.

Image recompression. Some PDFs contain images stored as uncompressed bitmaps or with lossless PNG compression. Recompressing these as JPEG at quality 75-85 can cut image data by 80% or more with minimal visible quality loss.

Font subsetting. If a PDF embeds the complete Arial font file (800+ glyphs, several megabytes), but only uses 60 characters from it, font subsetting replaces the full font with a subset containing only the glyphs actually used. This can reduce font data from megabytes to kilobytes.

Object stream compression. PDF internal structures (cross-reference tables, object dictionaries) can be stored using more efficient compression algorithms. This typically saves a small percentage but adds up in large documents.

Metadata removal. Stripping author names, creation dates, editing history, and embedded thumbnails. Minor savings but useful for privacy.

Realistic compression expectations

PDF Type Original Size Compressed Size Reduction
Scanned document (300 DPI, color) 25MB 5-8MB 65-80%
Photo-heavy report 40MB 10-15MB 60-75%
Presentation export with images 15MB 4-7MB 50-70%
Text-heavy legal document 5MB 4MB 10-20%
CAD/engineering drawings (vector) 10MB 8-9MB 10-15%
Form with embedded fonts 3MB 1-2MB 30-50%

The quality trade-off

Unlike merging and splitting, compression involves trade-offs. Downsampling images means you cannot recover the original resolution later. Recompressing JPEG images introduces additional generation loss. For most business documents, these trade-offs are invisible and irrelevant. For archival or print-production workflows, you might want to keep the originals and compress only the copies you distribute.

A good rule of thumb: compress for the intended use case. An email attachment that will be viewed on a laptop screen does not need 300 DPI images. A document going to a commercial printer does.

Password Protection and Encryption

PDF supports two distinct types of password protection, and understanding the difference is critical because one provides genuine security and the other is essentially theater.

User password vs. owner password

User password (open password): The document cannot be opened at all without entering this password. The content is encrypted — without the password, the file is unreadable gibberish. This provides real security.

Owner password (permissions password): The document opens normally, but certain operations are restricted — printing, copying text, editing, or extracting pages. The content is not encrypted in a meaningful way when only an owner password is set. Any competent PDF tool can strip the owner password and remove all restrictions in seconds.

This is not a flaw in any particular tool — it is a fundamental limitation of the PDF specification. The owner password is a "please don't" flag, not a security mechanism. If you need to actually prevent someone from reading a document, use a user password with strong encryption.

Encryption standards

Standard Key Length Security Level Compatibility
RC4 40-bit 40 bits Broken — crackable in seconds PDF 1.1+ (Acrobat 3+)
RC4 128-bit 128 bits Weak — crackable with modest hardware PDF 1.4+ (Acrobat 5+)
AES 128-bit 128 bits Strong for most purposes PDF 1.6+ (Acrobat 7+)
AES 256-bit 256 bits Military-grade — not practically crackable PDF 2.0+ (Acrobat X+)

Always use AES-256 encryption. There is no performance or compatibility reason to use anything weaker in 2026. Every modern PDF reader supports it, and the encryption/decryption overhead is negligible on modern hardware.

Password strength matters more than encryption strength

Even AES-256 encryption is useless with a weak password. The encryption algorithm is not the weak link — the password is. A four-digit numeric password gives an attacker only 10,000 combinations to try, which takes about one second on modern hardware. A well-chosen 12-character password with mixed case, numbers, and symbols has approximately 475 trillion trillion combinations, which is not practically crackable.

Guidelines for PDF passwords:

  • Minimum 12 characters for sensitive documents
  • Mix character types — uppercase, lowercase, numbers, symbols
  • Avoid dictionary words and predictable patterns
  • Use a unique password — do not reuse passwords from your email or other accounts
  • Share the password through a different channel than the PDF itself. If you email the PDF, send the password via text message or a separate email. Never put the password in the same email as the document.

When to use password protection

  • Legal documents — contracts, NDAs, settlement agreements
  • Financial records — tax returns, bank statements, financial reports
  • Medical documents — test results, insurance claims, prescriptions
  • HR documents — offer letters, salary information, performance reviews
  • Intellectual property — patent drafts, trade secrets, unpublished research
  • Personal documents — passport scans, ID copies, Social Security documents

PDF to Images: Turning Pages Into Pictures

Sometimes you need individual page images rather than a PDF. This is more common than you might expect.

Why convert PDF pages to images

  • Social media sharing — you cannot post a PDF on Instagram or LinkedIn, but you can post a JPG or PNG of each page
  • Presentations — embedding a PDF page into a PowerPoint or Google Slides deck as a high-quality image
  • Website content — displaying document pages on a webpage without requiring a PDF viewer
  • Thumbnails and previews — generating preview images for a document management system
  • Annotation and markup — some markup tools work better with images than with PDFs
  • Archival in image formats — some workflows require TIFF or PNG archives of each page

Choosing the right image format

Format Best For Resolution Recommendation
PNG Text-heavy pages, diagrams, screenshots 150-300 DPI
JPEG Photo-heavy pages, scanned documents 150-200 DPI, quality 85-92
WebP Web display (smaller files than PNG/JPEG) 150 DPI for web
TIFF Archival, print production 300 DPI

For most purposes, PNG at 150-200 DPI strikes the right balance between file size and readability. Text stays sharp (PNG is lossless), and the resolution is sufficient for on-screen viewing. If you need to print the page images, bump up to 300 DPI.

JPEG works well for scanned document pages (which are essentially photographs already), but avoid JPEG for pages with crisp text on a white background — the compression introduces visible artifacts around sharp edges.

Resolution and DPI explained

A PDF page does not have a native resolution — it is a vector format with resolution-independent content. When you convert a page to an image, you choose the rendering resolution:

  • 72 DPI — screen resolution, small files, text may appear fuzzy
  • 150 DPI — good for on-screen viewing, readable text, moderate file size
  • 300 DPI — print quality, crisp text, larger files
  • 600 DPI — high-quality print, very sharp text, large files (usually overkill)

Each doubling of DPI quadruples the pixel count and roughly quadruples the file size. A standard Letter-size page at 150 DPI produces an image of about 1275 x 1650 pixels. At 300 DPI, that same page becomes 2550 x 3300 pixels — four times as many pixels.

Images to PDF: Building Documents From Scans and Photos

The reverse operation — combining images into a PDF — is essential for anyone who works with physical documents.

Common use cases

  • Digitizing paper documents — scanning receipts, letters, contracts, or notes and combining them into a single searchable document
  • Creating photo portfolios — assembling a sequence of images into a presentable, paginated format
  • Submitting documentation — many forms and applications require documents in PDF format
  • Archiving photographs — PDFs provide a standardized container with metadata support
  • Creating simple slideshows — each image becomes a page that can be viewed in any PDF reader

Getting good results

The quality of your image-to-PDF conversion depends primarily on the quality of your source images:

For scanned documents:

  • Scan at 300 DPI minimum for text documents
  • Use color scanning only if the color matters (forms, letterhead). Grayscale produces much smaller files.
  • Straighten pages before scanning — most phone scanning apps do this automatically
  • Crop to the page boundaries to avoid wasted space

For photos:

  • Use the original high-resolution images, not thumbnails or compressed previews
  • Consider the page size — a 4000x3000 photo will be scaled to fit the page, and portrait vs. landscape orientation matters
  • If combining many photos, consider whether you want one photo per page or a multi-photo layout

Page sizing: Most tools default to standard page sizes (Letter or A4) and scale each image to fit within the page margins. Some tools let you set custom page sizes that match each image's aspect ratio exactly, which eliminates white borders and gives a cleaner result for photo portfolios.

Supported image formats

Virtually any common image format can be embedded into a PDF:

  • JPEG — the most common source format for photos and scans, embeds directly without re-encoding
  • PNG — lossless, great for screenshots and diagrams with text
  • WebP — may need conversion to JPEG or PNG before embedding, depending on the tool
  • TIFF — standard for professional scanning workflows
  • HEIC/HEIF — iPhone photos, typically need conversion before embedding

Watermarks: Branding and Draft Protection

Watermarks add a text or image overlay to PDF pages, serving two main purposes: branding and document status marking.

Types of watermarks

Text watermarks are the most common. They typically display words like "CONFIDENTIAL," "DRAFT," "SAMPLE," or a company name across each page. They are rendered semi-transparent so the underlying content remains readable.

Image watermarks overlay a logo or graphic. These are common for branded documents, proposal templates, and professional reports.

Diagonal vs. horizontal placement. Diagonal watermarks (rotated 45 degrees across the page) are the classic style for status indicators like "DRAFT." Horizontal watermarks are more common for logos and branding elements, usually placed in a header, footer, or corner.

When to use watermarks

  • Draft documents — clearly marking documents that are not final to prevent premature distribution or reliance on incomplete information
  • Confidential materials — adding a visual reminder that the content is restricted, though note that a watermark does not prevent copying or sharing
  • Branded deliverables — adding your company logo to client-facing reports, proposals, and presentations
  • Sample or preview copies — marking documents that are provided for review but are not the final version
  • Legal documents — some jurisdictions and organizations require specific markings on certain document types

Watermark limitations

It is important to understand what watermarks do and do not provide:

  • Watermarks do not prevent copying. Anyone can screenshot the content or retype it. A watermark is a visual deterrent, not a security mechanism.
  • Watermarks can be removed. A watermark added as a PDF overlay can often be removed with PDF editing tools. If you need tamper-resistant watermarks, you need a more sophisticated approach like baking the watermark into flattened page images.
  • Watermarks affect readability. A watermark that is too opaque or too large will make the document hard to read. Keep opacity at 10-20% for text watermarks and use a light gray color for best results.

Privacy Advantages of Local Processing

Here is where the conversation gets serious. PDFs are not just any file type — they are the format people use for their most sensitive documents. Tax returns, medical records, legal contracts, financial statements, identification documents, employment agreements. When you upload a PDF to an online tool, you are handing someone else a copy of that document.

What happens when you use a cloud-based PDF tool

Most online PDF tools — including some very popular ones — follow this process:

  1. You upload your file to their server
  2. Their server processes it (merge, split, compress, whatever operation you requested)
  3. The result is stored temporarily on their server and you download it
  4. The file is (supposedly) deleted after some time — 1 hour, 24 hours, 7 days, depending on the service

The problems with this approach are substantial:

Your file traverses the internet. During upload and download, the file passes through multiple network nodes. Unless the connection uses end-to-end encryption (not just HTTPS), intermediaries could potentially intercept the data.

Your file exists on someone else's server. Even temporarily, your document is on hardware you do not control. You are trusting the service operator's security practices, their access controls, their employee vetting, and their data deletion procedures.

You cannot verify deletion. When the service says your file is deleted after 24 hours, you have no way to confirm that. Are backups made? Are logs kept? Do they replicate data across multiple data centers? You simply do not know.

Terms of service vary wildly. Some free PDF tools include clauses in their terms of service that grant them rights to use uploaded content for training machine learning models, improving their services, or other purposes. Most people do not read the terms before uploading their tax return.

Data breach risk. Any server that processes and stores millions of documents is a high-value target for attackers. Data breaches at file processing services have exposed millions of users' documents.

How local processing eliminates these risks

Browser-based tools like Fileza process files entirely within your browser using client-side JavaScript. The architecture is fundamentally different:

  1. You select your file — the browser reads it into local memory
  2. JavaScript processes it — libraries like pdf-lib manipulate the PDF structure directly in your browser's memory
  3. The result is generated locally — the output file is created in your browser
  4. You download the result — from your own browser to your own disk

At no point does the file leave your device. There is no upload, no server processing, no temporary storage, no network transmission of your document content. The tool's server delivers the JavaScript code to your browser, but your actual files never touch that server.

This is not a minor distinction — it is a completely different security model. You do not have to trust anyone's data handling practices, because your data never reaches anyone else's infrastructure.

When local processing matters most

  • Tax documents — W-2s, 1099s, and tax returns contain Social Security numbers, income details, and employer information
  • Medical records — HIPAA-regulated documents should never be uploaded to non-compliant services
  • Legal contracts — confidentiality clauses may prohibit uploading agreements to third-party services
  • Financial statements — bank statements and financial reports contain account numbers and transaction histories
  • Identification documents — passport scans, driver's license copies, and similar ID documents are identity theft goldmines
  • Business confidential — internal reports, strategic plans, and proprietary information

For these document types, local processing is not just a convenience — it is the only approach that does not introduce unnecessary risk.

Quick Reference: PDF Operations at a Glance

Here is a side-by-side comparison of every major PDF operation, what it does, whether it modifies content, and when to use it:

Operation What It Does Lossless? Typical Use Case Processing Speed
Merge Combines multiple PDFs into one Yes Application packets, report compilation Fast (seconds)
Split Extracts specific pages Yes Pulling chapters, isolating pages Fast (seconds)
Compress Reduces file size No (images recompressed) Email attachments, upload limits Medium (seconds to minutes)
Password protect Encrypts with password Yes (content unchanged) Securing sensitive documents Fast (seconds)
PDF to images Renders pages as PNG/JPG Depends on format and DPI Social media, presentations Medium
Images to PDF Embeds images as pages Depends on settings Digitizing scans, photo portfolios Fast
Watermark Adds text/image overlay No (overlay added) Branding, draft marking Fast (seconds)
Reorder pages Rearranges page sequence Yes Fixing page order Fast (seconds)
Rotate pages Changes page orientation Yes (metadata change) Fixing scanned orientation Fast (seconds)
Remove pages Deletes specified pages Yes (remaining pages unchanged) Removing irrelevant content Fast (seconds)

The "lossless" column is important. Operations that work with the PDF structure (merge, split, reorder, rotate, remove) do not modify any content and produce bit-perfect copies of the affected pages. Operations that modify content (compress, watermark) alter the file in ways that cannot be perfectly reversed.

Putting It All Together: Real-World Workflows

Understanding individual operations is useful, but most real tasks require combining multiple operations. Here are some common workflows:

Preparing a job application

  1. Convert your resume from DOCX to PDF (preserves formatting across systems)
  2. Convert reference letters and certifications from images to PDF
  3. Merge resume + cover letter + portfolio samples + references into one file
  4. Compress the merged file if it exceeds the upload limit (many job portals cap at 5-10MB)
  5. Name the file clearly: "LastName_FirstName_Application.pdf"

Sending a contract for signature

  1. Split the signature page from the full contract
  2. Send the full contract for review as one file
  3. Send the signature page separately for signing
  4. Merge the signed page back into the full contract
  5. Password protect the final executed version

Archiving financial records

  1. Scan receipts and statements (use your phone's scanning app for automatic edge detection and straightening)
  2. Combine scanned images into category-based PDFs (one per month, one per expense type, etc.)
  3. Compress to reduce storage requirements
  4. Password protect with a strong password using AES-256 encryption
  5. Store with descriptive filenames and organized folder structure

Creating a client deliverable

  1. Merge your report, appendices, and supporting documents
  2. Add a watermark with your company logo or "CONFIDENTIAL" marking
  3. Compress to a reasonable size for email delivery
  4. Password protect if the content is sensitive
  5. Send the PDF with the password communicated through a separate channel

The Bottom Line

PDFs are the universal format for documents that matter, and knowing how to manipulate them efficiently saves real time and prevents real headaches. The operations themselves are straightforward — merge to combine, split to extract, compress to shrink, encrypt to protect.

The more important decision is where you process your PDFs. For casual documents — a recipe collection, a public form, an event flyer — any tool will do. But for anything containing personal, financial, medical, or business-sensitive information, local processing is the only approach that does not require you to trust a third party with your data.

Every PDF operation covered in this guide — merging, splitting, compressing, password protection, image conversion, and watermarking — can be done entirely within your browser. No uploads, no accounts, no trusting someone else's server with your tax returns. Your files stay on your device from start to finish.

That is how it should work.