Our next paper — OpenEDGAR – Open Source Software for SEC Edgar Analysis is now available. This paper explores a range of #OpenSource tools we have developed to explore the EDGAR system operated by the US Securities and Exchange Commission (SEC). While a range of more sophisticated extraction and clause classification protocols can be developed leveraging LexNLP and other open and closed source tools, we provide some very simple code examples as an illustrative starting point.
Click here for Paper: < SSRN > < arXiv >
Access Codebase Here: < Github >
Abstract: OpenEDGAR is an open source Python framework designed to rapidly construct research databases based on the Electronic Data Gathering, Analysis, and Retrieval (EDGAR) system operated by the US Securities and Exchange Commission (SEC). OpenEDGAR is built on the Django application framework, supports distributed compute across one or more servers, and includes functionality to (i) retrieve and parse index and filing data from EDGAR, (ii) build tables for key metadata like form type and filer, (iii) retrieve, parse, and update CIK to ticker and industry mappings, (iv) extract content and metadata from filing documents, and (v) search filing document contents. OpenEDGAR is designed for use in both academic research and industrial applications, and is distributed under MIT License at https://github.com/LexPredict/openedgar
Save the DATE ! August 9, 2018 – we will be hosting the BLOCK (Legal) Tech Conference @ Illinois Tech – Chicago Kent College of Law.
Tickets are FREE but registration is required — – http://blocklegaltech.com/
#BlockChain #CryptoInfrastructure #FinTech #LegalTech #ICO
Paper Abstract – LexNLP is an open source Python package focused on natural language processing and machine learning for legal and regulatory text. The package includes functionality to (i) segment documents, (ii) identify key text such as titles and section headings, (iii) extract over eighteen types of structured information like distances and dates, (iv) extract named entities such as companies and geopolitical entities, (v) transform text into features for model training, and (vi) build unsupervised and supervised models such as word embedding or tagging models. LexNLP includes pre-trained models based on thousands of unit tests drawn from real documents available from the SEC EDGAR database as well as various judicial and regulatory proceedings. LexNLP is designed for use in both academic research and industrial applications, and is distributed at https://github.com/LexPredict/lexpredict-lexnlp
The migration from analog to digital to computational contracts is the future arc for the entire field …. (most contracts are barely even digital today) … click here to link to post
Yes – there is no easy button – but far more streamlining is coming and higher order work streams are on the march in data science …
Another version of what we document in our paper … Daniel Katz, Michael J Bommarito II, Tyler Sollinger & Jim Chen, Law on the Market? Abnormal Stock Returns and Supreme Court Decision-Making
Academic Tour Continues – tomorrow I will be giving a talk at Bar Ilan University here in Tel Aviv at their Law & Big Data Workshop – it is looks like an good agenda with proper scientific papers with technical results / or discussions about methodology. #LegalScience #LegalData #LegalInformatics
Tomorrow – it is my great pleasure to deliver the Keynote Address at one the first #LegalTech events in Lithuania. The event will be opened by the Mayor of Vilnius – Remigijus Šimašius and Lyra Jakulevičienė (Dean of the Mykolas Romeris University Law School). LegalTech is a global phenomenon!
We are announcing a new open source offering – OpenEDGAR, for building databases using the #SEC #EDGAR database. Press release here ! See you on Github.
It is my great pleasure to visit with Primerus and its associated law firms and deliver an address at its 2018 Annual PDI Convocation.
Today I am UConn Law speaking at a Conference entitled – Evaluating Litigation Risk in the 21st Century. Thanks to Alexandra Lahav and the UConn Insurance Law Center for hosting me today!
CLOC Panel – Legal AI in Real Life — (Cisco, Liberty Mutual, Spotify) (I was filling in for Julian T. from Google) — I discussed the contract analytics project we are undertaking with Cisco / Elevate and other applied A.I. / Analytics Projects that the LexPredict Team is undertaking with corporate legal departments !