Please wait while the page is being loaded Skip this advertisement >
Monday | 8 September, 2008
ARN
Recovering PDF redaction
PDF redaction exposed by security researcher.
Carl Jongsma (Computerworld) 09 May, 2008 10:08:57

Unintentional exposure of sensitive data through Word files is a has caused problems for companies in the past, especially when people forget that Track Changes can easily allow document recipients to view information that has been deleted or sanitised for release.

Recovery of information from PDF files has also led to some unintended consequences when it was discovered that the attempt to redact information was as simple as placing a black square/rectangle over the text, making it a simple process to recover the original text.

Didier Stevens, who gained attention for his recent discoveries relating to hiding content in PDF files, has again discovered a side effect of creating PDF files that might lead to unexpected information disclosure for the unaware.

The concept of an Incremental Update in PDF files is relatively well known, when changes to an existing PDF document don't result in the PDF file being completely rewritten on saving. How an incremental update is actually represented in the raw PDF file is less well known, but it is basically the amended data being appended to the original document, with the process repeating for subsequent updates. Stevens discovered that the process of stripping away an update and recovering the original content is an extremely simple one. What this means is that for documents that have been redacted or otherwise modified by replacing text instead of drawing a black rectangle over it, the deleted/replaced text can be recovered along with the original unmodified document in a simple one-step procedure. Making the process even simpler is that it can often be achieved with a text editor and it doesn't matter if the PDF content has been encrypted.

There are some efforts to increase awareness of the risk of document metadata, but this recent rediscovery adds another item to check prior to releasing documents for wider consumption. It is also another simple tool for forensic researchers to help in recovering original data from a document. A saving grace appears to be that many applications that export to PDF as part of their Save process do not support incremental updates, which means that if you want to redact data, do it in the original application and then export the redacted version.

It is nothing that can't be gained from reading the PDF specification, but who takes the time to read in depth the technical specification for the data format that they are using?

Market Place

ARN Member Login

 
Panel Sessions
  • ARN Panel Sessions: Day 3

    The last of our panel sessions recorded live at CeBIT 2008. Today, the topic is storage. Data is growing at an enormous rate, so what does the future hold?

Play
ARN news
  • Weekly Tech News Update: 8th September, 2008

    We're back again at the IFA consumer electronics show in Berlin where a virtual mirror helps you see the latest fashions, Samsung introduces a laptop that's lighter than air, and a prototype LCD TV is the thinnest on the show floor.

Play
Channel Watch
  • Brian's bloopers

    It takes a long time to produce an episode of Channel Watch. Maybe you'll understand why after watching this...

Play
Business Continuity & Disaster Recovery Zone

When an IT disaster occurs, how handy it would be to push a button and start again as if nothing had happened.
Discover and learn more about CA XOSoft today.
ARN Vendor Directory
ARN Library

V/Line and Oakton use Microsoft SQL Server 2008 to develop an Executive HR Dashboard

With the help of Oakton, V/Line - Victoria's regional public transport provider - utilised Microsoft SQL Server 2008 to develop an Executive HR Dashboard report.

Sponsored Links