June 16, 2026

PDF to Excel Formatting Problems and How to Fix

Convert a PDF to Excel right here, no sign-up to try:

Drop your PDF here or click to browse

PDF files up to 50MB

Uploading...

First file free. Files are deleted after processing.

You upload a clean-looking PDF, open the converted Excel file, and the table is a mess. Columns are merged, rows are split across two lines, headers sit in the wrong place, and every dollar amount refuses to add up. This is the most common complaint about PDF to Excel conversion, and it is not your fault. It comes from how PDFs store information.

This guide explains why a PDF to Excel conversion loses formatting, walks through each specific problem (numbers turning into text, data dumping into one column, merged cells, broken scanned tables), and gives you a fix for each one. The converter at the top of this page reads the real table structure instead of guessing from spacing, so you can try your file first and read on if you still need to clean something up.

Why does converting a PDF to Excel mess up the formatting?

A PDF stores text by position on the page, not in a grid of rows and columns. There are no real cells underneath the table you see, only characters placed at x and y coordinates with lines drawn around them. So a converter has to reverse-engineer where the columns and rows are, usually by measuring the gaps between characters. When the spacing is tight, a column is narrow, or a cell wraps onto two lines, the converter guesses wrong and the formatting breaks.

That single fact explains almost every problem you will hit: misaligned columns, split rows, merged cells, and numbers that look right but behave like text. The rest of this article covers each one.

Why are my numbers showing as text after converting a PDF to Excel?

Numbers turn into text because the converter copies the characters exactly as they appear in the PDF, including dollar signs, commas, parentheses for negatives, and trailing spaces. Excel sees "$1,250.00" or "(450.00)" as a label, not a value, so it will not sum or average the column. You usually see a small green triangle in the corner of the cell and a warning that the number is stored as text.

To fix it quickly: select the column, click the warning icon, and choose Convert to Number. If that does not work because of currency symbols or parentheses, use Find and Replace to strip the "$" and "," characters first, then wrap negatives in a formula or replace the parentheses with a minus sign. A converter that recognizes currencies and negative numbers up front, like an accurate PDF to Excel converter, hands back real numbers so you skip this cleanup entirely.

Why does my PDF data go into one column or merged cells in Excel?

Data lands in one column when you copy and paste straight from the PDF. The paste keeps the text as a single block per row because the PDF has no column boundaries to carry over, so a whole row of figures stacks into one cell. The opposite problem, merged and blank cells scattered through the sheet, happens when a spacing-based converter misreads where one column ends and the next begins.

If your data is jammed into one column, select it, go to Data and then Text to Columns, and split it on the delimiter that separates your values. This works maybe 60 to 70 percent of the time and struggles with descriptions that contain spaces. For merged-cell chaos, there is no clean manual fix beyond rebuilding the table by hand. The reliable route is a tool that reads the actual table layout. See how to copy a table from a PDF to Excel without it collapsing into one column.

How do I fix formatting issues when converting a PDF to Excel?

Fix formatting issues by cleaning the output in this order: split merged data with Text to Columns, convert text-numbers back to real numbers, remove blank rows and columns, then reapply headers. Doing it in that sequence stops you from fixing the same cell twice. Here is the practical checklist:

  • Split stuck columns with Data, then Text to Columns, using a space or comma delimiter.
  • Convert text to numbers by selecting the column and choosing Convert to Number, or multiply by 1 in a helper column.
  • Strip stray characters ($, commas, footnote marks) with Find and Replace.
  • Delete empty rows and columns the converter left behind.
  • Rebuild merged headers by hand so each column has one clean label.

The faster path is to avoid the cleanup. A converter that detects tables structurally returns numbers as numbers, dates as dates, and one row per record. If you want to compare methods, our guide on converting a PDF to Excel without losing formatting lays out the options side by side.

How do I keep formatting when converting a PDF to Excel?

To keep formatting, use a tool that reads the PDF's table structure rather than its character spacing, and check the result in a preview before you rely on it. Copy and paste and most free online tools work from spacing, which is why they break on tight or multi-column layouts. A structural converter keeps rows and columns aligned, numbers numeric, and dates in date format so the file is usable the moment it downloads.

No tool is perfect on every layout, so previewing matters. Run your real file through the PDF to Excel converter at the top of this page, look at the preview, and confirm the columns line up before you build formulas on top of it. For complex financial documents, that one check saves an hour of manual repair later.

Why does a scanned PDF convert to Excel so badly?

A scanned PDF converts badly because it is an image, not text. There are no characters to read, just a picture of a page, so a plain converter returns blank cells or garbled rows. The file has to go through OCR (optical character recognition) first to turn the picture into readable text, and only then can the table be rebuilt. Most free tools and Excel's own From PDF import skip OCR entirely.

If your source is a scan or a photo of a statement, use a converter with built-in OCR. Our OCR PDF to Excel tool reads scanned and image-based files and reconstructs the table, which is the only way to get clean rows out of a scanned document. For a deeper walkthrough, see how to convert a scanned PDF to Excel.

The short version

PDF to Excel formatting problems come down to one thing: a PDF has no real table underneath it, so the converter has to guess. Spacing-based tools and copy-paste guess badly, which is why you get merged cells, one-column dumps, and numbers that act like text. You can clean most of it up with Text to Columns, Convert to Number, and Find and Replace, but the cleaner fix is a converter that reads the table structure and shows you a preview. Upload your file above and check the result before you spend time fixing anything by hand.