20100628 igc native format

Upload: chuck-rothman

Post on 03-Jun-2018

219 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/12/2019 20100628 IGC Native Format

    1/6

    AN IGC WHITE PAPER

    By Christine Musil

    Director of Marketing

    Informative Graphics Corporation

    S m a r t D o c u m e n t S o f t w a r e

    The Reality of Native Format

    Production and Redaction

  • 8/12/2019 20100628 IGC Native Format

    2/6

  • 8/12/2019 20100628 IGC Native Format

    3/6

    OConnor takes a slightly different view, preferring a more

    inclusive category of reasonably usable that is used in therules and under the Sedona comments noted above. It in-

    cludes documents that are not in their native format, but aresearchable and have the metadata intact so they are highlyfunctional for discovery purposes. Socha has a different take

    on reasonably usable, seeing it as an attribute of one of thefour forms of production summarized above rather than as

    a separate form. Says OConnor, the category reasonablyusable is the one used in the rules so I prefer that phrase, but I

    think that George and I are on the same page with regards tothe handling of documents, he just drills down a little deeper

    with his definitions.

    Preservation and production of metadata

    in a usable form is at the heart of manyarguments against converting native documents to TIFF or

    PDF. OConnor notes, Metadata was originally a term used torefer to computer data such as date created/modified, author,etc. Now, the definition of metadata has expanded to include

    hidden material that does not appear when a document isprinted out. Examples of this would include hidden rows, cells

    and formulas in Excel, and Track Changes and comments andmarkups in Word. Just to show that this, too, is an unsettledarea, according to Socha metadata was originally defined as

    data about data, a definition he says continues to apply to-day. (For those who want to dig deeper into the breadth and

    depth of metadata, Socha suggests searching on metadataas well as terms such as Dublin Core.)

    Converting documents to PDF (even text-searchable ones) orTIFF may alter or fail to include the original creation date for

    the document or strip out all or most unseen content, againcausing some requestors to demand native production with

    metadata intact. However, the created and modified datesare not always relevant, so this does not necessarily preclude

    TIFF or PDF being used as the production format. The unseen

    data, such as text revisions or comments, are also not brought

    through to the new TIFF or PDF file. Since these hidden ele-ments may contain privileged information, this could be help-ful to the producing party. It should be noted, however, that

    the mere suspicion of privileged content is not a sufficientbasis for a blanket withholding of metadata. Spreadsheets are

    a notable exception, where eliminating formulas and hidingcells may be committing spoliation because they are intrinsic

    to the integrity of the document.

    There is clear case law supporting the production of financial

    spreadsheets in their native format and discussion of spread

    sheets warrants particular consideration when determining

    production formats.

    In Williams v. Sprint/United Management Co.Case (230 F.R.D.640) (D. KAN. 2005), Magistrate Judge Waxse from the Kan-sas Federal District Court decided that the defendant, when

    specifically instructed to produce digital spreadsheets in themanner in which they were kept in the ordinary course of

    business, should be subject to sanctions if they had scrubbedmetadata and locked certain spreadsheet cells prior to pro-

    duction. In this case, Judge Waxse ordered a reproduction ofthe documents at the defendants expense, but decided not to

    impose sanctions because the laws on metadata productionat the time were still new and fairly ambiguous.

    Even if you arent sure if metadata will be relevant, you always

    should retain an untainted set of the files, and only processfrom copies. In his 2005 paper Beyond Data about Data:

    The Litigators Guide to Metadata, Craig Ball, trial lawyer andspecial master of ESI for numerous Federal and State courts,

    states, Fail to preserve metadata at the earliest opportunity

    and you may never be able to replicate what was lost.iii

    Proper preservation is important because metadata includesmore than the data within the file itself. System data, like afiles name and location, size, creation, modification and usage

    are also important to assess tampering, for example.

    It is important that both sides understand the potentialimpact of metadata, if any, on the case. Without sufficient evi-

    dence that metadata is relevant, the court may not grant re-quests for it to be produced. Take Dahl v. Bain Capital Partners,

    LLC,2009 U.S. Dist. LEXIS 52551 (D. Mass. Jun. 22, 2009), where

    the requestor sought all metadata associated with emails and

    Metadata Preservation/Production

    Page 3 - The Reality of Native Format Production and Redactio

    Metadata can show history of places the document has been stored. This

    example is from a British dossier on Iraqs security infrastructure and reveals

    that the document was compiled by copying content from outside docu-

    ments, including a post-graduate student.iv

    There is clear case law supporting theproduction of financial spreadsheets in their native format...

  • 8/12/2019 20100628 IGC Native Format

    4/6

    Word documents produced by the producers. Bain Capitalresponded by producing just 12 fields of metadata, which the

    court supported stating that many courts have expressedreservations about the utility of metadata and ultimately find-

    ing that:

    Rather than a sweeping request for metadata, [requestors]

    should tailor their requests to specific word documents,

    specific emails or specific sets of email, an arrangement that,

    according to their memorandum, suits [producers]. This morefocused approach will, the court hopes, reduce the parties

    costs and work. Furthermore, it reflects the general uneasi-

    ness that courts hold over metadatas contribution in assuring

    prudent and efficient litigation.

    The issue of metadata should be discussed up front or the op-posing side may have grounds to claim spoliation or request

    re-production. Look at Bray & Gillespie Mgmt. LLC v. LexingtonIns. Co., 2009 WL 546429 (M.D. Fla. Mar. 4, 2009). Lexington

    requested that Bray & Gillespie (B&G) produce data in nativeformat, and B&G did not comply. While B&G converted theirnative files to TIFF and stored the files metadata separately,

    they gave Lexington only the TIFFs and held back the meta-data. When it was discovered that the metadata had been

    preserved and that they had violated Lexingtons nativeformat request, B&G was subject to sanctions and the courtsordered them to produce the metadata

    to Lexington anyway.

    Discussions about production format,

    metadata and redaction should occur atthe meet and confer session. FRCP rule 26(f), the meet and

    confer rule, requires parties to meet at an early stage in thelitigation process to discuss what information they have and

    how they will share it. Unfortunately, according to OConnor,the meet and confer process often gets short-changed orskipped entirely. This leaves the producing party exposed to

    potentially costly and unexpected demands for native formatslater in the process, perhaps after already producing in PDF

    or TIFF.

    InCovad Communications Co. v. Revonet, Inc., 2008 U.S. Dist.LEXIS 104204 (D.D.C. Dec. 24, 2008), the court ordered theproducer to re-produce data in electronic format after having

    produced it in hard copy, but it ordered the two parties toshare the $4,000 cost of privilege review, concluding:

    This whole controversy could have been eliminated had [re-

    questor] asked for the data in native format in the first place or

    had [producer] asked [requestor] in what format it wanted the

    data before it presumed that it was not native. Two thousand

    dollars is not a bad price for the lesson that the courts have

    reached the limits of their patience with having to resolve

    electronic discovery controversies that are expensive, time

    consuming and so easily avoided by the lawyers conferring

    with each other on such a fundamental question as the format

    of their productions of electronically stored information.

    In a perfect world, the meet and confer would be thorough,

    civil and productive, and parties would clearly understandwhat they were requesting or expected to produce, but this is

    not often the case.

    Browning Marean, Senior Counsel at DLA Piper, notes that Atthese meet and confer sessions, it is crucial to have knowl-

    edgeable technical people present who are familiar with the

    discovery data and who know about file formats, productionand redaction. Without an IT representative who understands

    issues around all document types that may be discoverable,such as the feasibility of producing the entire volume of data,

    and the possible concerns around privacy and proprietaryinformation contained in the documents, any decision madecould be completely unrealistic.

    The meet and confer is not only important for determining

    production formats, but it is also an opportunity to be up frontabout what data will be held back for cause of privilege orprivacy. Marean suggests that parties agree on what data the

    requestor is entitled to and be clear about how it will be madeavailable. If a large H.R. database contains social security

    numbers which clearly constitute privacy information, explainto the other side and tell them which tables youll make avail-

    able to them and in what format, and explain what has been

    redacted or omitted and why. Give them what theyre entitledtono more, no less.

    Even judges are pleading for parties to conduct the meet

    and confer, since the lack of it can cause an unfortunatedomino effect that wastes a lot of time and money. InAguilar

    v. Immigration and Customs Enforcement Div. of the U.S. Dept.

    of Homeland Sec., 2008 WL 5062700 (S.D.N.Y. Nov. 21, 2008),Magistrate Judge Frank Maas emphasized the importance of

    the meet and confer meeting, saying This lawsuit demon-

    strates why it is so important that parties fully discuss their

    ESI early in the evolution of the case. Had that been done, thedefendants might not have opposed the plaintiffs requests fo

    certain metadata. Moreover, the parties might have been ableto work out many, if not all, of their differences without court

    involvement or additional expense, thereby furthering thejust, speedy and inexpensive determination of this case.

    Page 4 - The Reality of Native Format Production and Redaction

    Importance of the Meet and Confer

    The issue of metadata should be discussed up front...

    "Give them what they're entitled to

    no more, no less." - Browning Marean

  • 8/12/2019 20100628 IGC Native Format

    5/6

    Redaction, the removal of privileged orprivacy data from documents, representsanother problem that arises in producing native documents.

    The typical method of redaction has been to print the docu-ments, use a black marker to mask the information, then

    photocopy the marked-up pages several times to ensure com-plete obscuration before re-scanning back into the system. As

    the volume of ESI has escalated, this method of redaction iscumbersome, expensive and even unrealistic given deadlines

    to produce.

    Electronic redaction involves redacting documents using a

    computer application like Adobe Acrobat or InformativeGraphics Redact-It. Such tools can save significant amounts

    of time by allowing users to search for privileged phrases orautomatically find privacy information, and they generally

    create a new, redacted rendition of the original document inTIFF or PDF format. Clearly, the biggest advantage is for text-

    searchable formats like PDF.

    But how do you perform electronic redaction when native

    format is required? Redaction, by its nature, changes thedocument and must be saved to a new version, regardless

    of format. According to Ball, The very nature of redaction isalteration of the document. No mechanism that you could usemaintains an identical hash. (A hash value is a unique identify-

    ing number that defines each digital file and is often used forforensic purposes to validate a documents authenticity.) You

    start with the original document, select areas for redaction,and output the final, redacted version to a new document

    with necessarily different metadata (e.g., time stamps and au-

    thor) and a new hash. No matter which format you choose forthe redacted document, the redacted version must be tracked

    and managed in addition to the unredacted original.

    Spreadsheets pose unique challenges to redaction. Ball

    notes, Native redaction may be best realized in spreadsheetsbecause you have the ability to remove whole categories offielded information by row or column. But because spread-

    sheet data often entails dependenciesvalues that changebased on other valuesredaction can have unforeseen

    consequences if implemented incautiously. Disclosure is es-sential, so lawyers understand they are dealing with an altered

    document. There are no commercial tools to redact spread-sheets in their native format at this time, so its typically doneusing the native applications. Spreadsheets require careful

    attention during the meet and confer.

    Page 5 - The Reality of Native Format Production and Redact

    Electronic Redaction

    Summary

    Socha cautions that with true native redaction, unless yourecareful and savvy, youll likely change things you dont really re

    alize youre changing. He notes, This can break things. If thereare formulas in a spreadsheet, you can ruin the document. Key

    numbers can disappear or change as part of a chain reaction.He continues, There are a series of challenges that differ fromapplication to application; the industry hasnt yet determined

    how to address native redaction. I have yet to see a proprietarapproach or a broadly accepted method. True native review

    can be done with XML versions of Word, but needs to be doneby someone who knows how.

    Redacting to a near-native or near-paper format is still the mos

    prevalent method. Converting native files to TIFF or PDF andredacting them is safe, convenient, inexpensive and allowsyou to use reliable tools that are proven. Should a particular

    document be called into question, you can always produce theoriginal source file with metadata completely intact.

    What is clear, however, is that the meet and confer is the best,

    and in fact the required, setting to discuss issues of produc-tion formats, redaction and metadata so they do not end up

    being decided by the court. Lawyers need to understand morabout the inner workings of the technology or need to bringalong IT personnel that can help them navigate successfully

    through the preliminary stages. Attorneys needs to take eDis-covery technology training seriously and educate themselves

    to protect their own interests and that of their clients. It isirresponsible, perhaps even malpractice, for them to think anddo otherwise.

    Electronic discovery often presentscomplex problems with only partial

    solutions. Parties need to understand that dictating the sameformat across the board may not suit their ultimate goals. Eac

    data set is different, and what must be revealed and concealedis unique for each case. By gaining a greater understanding of

    file formats, electronic redaction tools and metadata, lawyerscan do right by their clients and can avoid regrettable situa-tions in the courtroom which can range from embarrassment

    to sanctions to permanent damage to reputation of the firmand the client. Dont take these risks unnecessarily. Take the

    time and bring the IT talent to the table to have a quality meetand confer. After all,

    an ounce of

    prevention is wortha pound of cure.

    The very nature of redaction

    is alteration of the document.

    -Craig Ball

  • 8/12/2019 20100628 IGC Native Format

    6/6Page 6 - The Reality of Native Format Production and Redact

    iCommittee on the Judiciary.Federal Rules of Civil Procedure. 111th Cong

    1st sess., 2009.www.uscourts.gov/rules/EDiscovery_w_Notes.pdf

    iiA Project of The Sedona Conference Working Group on Electronic

    Document Retention & Production (WG1).The Sedona Principles: SecondEdition, Best Practices Recommendations & Principles for Addressing Elec-tronic Document Production. June 2007.www.thesedonaconference.org/content/miscFiles/TSC_PRINCP_2nd_ed_607.pdf

    iiiCraig Ball. Beyond Data about Data: The Litigators Guide to Metadata.

    2005.www.craigball.com/metadata.pdf

    ivNerino Petro and Bryan Sims, Avoiding ethical pitfalls with electronic

    documents: Part 1 metadata,The State Bar of Wisconsin Inside Track.June 16 2010.

    http://www.wisbar.org/AM/Template.cfm?Section=InsideTrack&Templat

    e=/CustomSource/InsideTrack/contentDisplay.cfm&ContentID=93679

    Copyright 2010 Informative Graphics Corporatio

    About the Author

    Christine Musil is Director of Marketing at Informative Graphics, a leading developer of

    commercial software products for secure content viewing, collaboration and redaction.Founded in 1990, Informative Graphics products are deployed by thousands of

    corporations worldwide.

    For more information,

    please contact:

    Informative Graphics Corp4835 E. Cactus Rd

    Scottsdale, AZ 85254Phone: 800.398.7005 (intl +1.602.971.6061

    URL: www.infograph.comeMail: [email protected]