The faculty and data scientists of DBMI are working at the nexus of software engineering, biological sciences, and the clinical practice of medicine.
This Data Portal is a growing catalog of our resources, including:
Our current data challenge is the multi-track 2019 n2c2/OHNLP Shared Task on Challenges in Natural Language Processing for Clinical Data, which continues the NLP work pioneered under the former i2b2 program.
The i2b2 data sets previously released as a result of this series of challenges dating back to 2006 will also be hosted here soon under their new moniker, n2c2.
Our full inventory of projects will be added over time. For now please check out the DBMI Github repository and this selection of resources from our faculty labs: