The East Coast eDiscovery & IG Retreat is a new addition to the series of retreats held by Chris La Cour’s company, Ing3nious. It was held at the Chatham Bars Inn in Chatham, Massachusetts. This is the fourth year Chris has been organizing retreats, with the number of retreats and diversity of themes increasing in recent years, but this is the first one held outside of California. As always, the venue was beautiful (more photos here), and the conference was informative and well-organized. My notes below only capture a small amount of the information presented. There were often two simultaneous sessions, so I couldn’t attend everything.
Keynote: Information Governance in a Predictive World
Big data isn’t just about applying the same old analysis to more data. It’s about real-time or near-real-time, action-oriented analysis. It allows asking new questions. Technology now exists to allow police to scan license plates while driving through a parking lot to check for stolen cars. Amazon may implement predictive shipping, where it ships a product to a customer before the customer orders it, which requires predictions with high confidence. Face recognition technology is already in use at airports to track who is entering and whether they belong there. When he tweeted a complaint about an airline, he got a personalized reply from the airline on twitter within 30 seconds, thanks to technology.
Proactive information governance will allow problems (sexual harassment, fraud) to be detected immediately so they can be corrected, instead of finding out later when there is a lawsuit. It is possible to predict when someone will leave a company — they may become short with people, or spend more time on LInkedIn.
Change will come to the Federal Rules of Civil Procedure in 2015. Rule 37(e) on sanctions for spoliation will change from “willful or bad faith” to “willful and bad faith” to encourage more deletion. Have a policy to delete as much as possible, follow it, and be prepared to prove that you follow it.
Case Study: The Swiss Army Knife Approach to eDiscovery
Predictive coding failed for a case where the documents were very homogeneous (keywords didn’t work either; the document set was large, but not too large for eyes-on review). It also failed when there were too many issue codes. Predictive coding had problems with Spanish documents where there many different dialects. Predictive coding works well for prioritizing documents when there is a quick deposition schedule.
Email domain name filtering can be used to remove junk or to detect things like gmail accounts that may be relevant. Also, look for gaps in dates for emails — it could be that the person was on vacation, or maybe something was removed.
Clustering or near-dupe is useful to ensure consistency of redactions.
There is a benefit to reviewing the documents yourself to understand the case. Not a fan of producing documents you haven’t seen.
May need to do predictive coding or clustering in a foreign country to minimize the amount of data that needs to be brought to the U.S..
Talk to custodians about acronyms. Look at word lists — watch for unusual words, or how word usage corresponds to timing of events.
Real-World Impact of Information Governance on the eDiscovery Process
I couldn’t attend this one.
Future-think: How Will eDiscovery Be Different in 5-10 Years?
Cost cutting: Cull in house, and targeted collections. The big obstacle is getting people to learn technology. Some people still print email to read it.
Analysis of cases is accelerating. Even small cases are impacted by technology. For example, information about a car accident is recorded. In the future, ESI will be collected and a conclusion will be reached without a trial. Human memory is obsolete — everything will be ESI.
Personal devices may be subject to discovery. Employment agreements should make that clear in advance.
Privacy will be a big legal field. Information is even collected about children — what they eat at school and when they are on the bus.
Will schools be held liable for student loans if the school fails to predict that the student will fail?
There are concerns about security of the cloud.
Businesses should demand change to make things more efficient, like arbitration.
Information Governance – Teams, Litigation Holds and Best Practices
I couldn’t attend this one.
Recent Developments in Technology Assisted Review — Is TAR Gaining Traction?
Three panelists said predictive coding didn’t work for them for identifying privileged documents. One panelist (me) said he had a case where it worked well for priv docs. Although I didn’t mention it at the time since I wouldn’t be able to get the reference right off of the top of my head, there is a paper with a nice discussion about the issues around finding priv docs that also claims success at using predictive coding.
What level of recall is necessary? There seemed to be consensus that 75% was acceptable for most purposes, but people sometimes aim higher, depending on the circumstances (e.g., to ward off objections from the other side).
Is it OK to use keyword search to cull down the document population before applying predictive coding? Must be careful not to cull away too many of the relevant documents (e.g., the Biomet case).
There is a lot of concern about being required to turn over training documents (especially non-responsive ones) to the other side. I pointed out that it is not like turning over search terms. It is very clear whether or not a document matches a search query, but disclosing training documents does not tell what predictions a particular piece of predictive coding software will give. In fact, some software will (hopefully, rarely) fail to produce documents that are near-dupes of relevant training documents, so one should not assume that the disclosure of training documents guarantees anything about what will be produced. There was concern that disclosure of non-relevant training documents by some parties will set a bad precedent.
Top 5 Trends in Discovery for 2014
I couldn’t attend this one.
Recruiting the best eDiscovery Team
Cybersecurity is a concern. It is important to vet service providers. Many law firms are not as well protected as one would like.
When required to give depositions about e-discovery process, paralegals can do well. IT people tend to get stressed. Lawyers can be too argumentative.
Need a champion to encourage everyone to get things done.
Legal hold is often drafted by outside counsel but enforced by in-house counsel.
Don’t have custodians do their own self-collection (e.g., based on search terms), but may have IT do collection (less expensive than using outside consultant, but must be able to explain what they did).
Information governance and changes to FRCP will reduce costs over the next five years.