Contribute Media
A thank you to everyone who makes this possible: Read More

CodaLab: A New Service for Data Exchange, Code Execution, Benchmarks & Reproducible Research

Summary

Learn, Share and Collaborate with CodaLab -- A new open source platform which lets communities create and explore experiments together and engage in benchmarking and competitions to enable true reproducibility and advance the state of the art in data-driven research

Description

CodaLab is a web-based open source platform that allows researchers to share and browse code, data, and create experiments in a truly reproducible manner. CodaLab focuses on accomplishing the following:

  • Serve as a data repository for data sets including large scale data sets that could only be hosted in a cloud computing environment
  • Serve as an algorithm repository that researchers can use in their experimentation, to teach and learn from others
  • Host the execution of experiments as worksheets, sometimes referred to as "executable papers", which are annotated scientific documents that combine textual process descriptions with live data sets and functioning code.
  • Enable the creation of benchmarks

CodaLab is a community-driven effort led by Percy Liang from Stanford University who built the precursor of CodaLab, namely, MLComp. From a development viewpoint CodaLab supports both the Linux and Windows communities with code in GitHub and Python as one of the main language used to support the scientific community.

At SciPi, we invite the community to participate in CodaLab by creating experiments as executable papers and by sharing them with the rest of the community at http://codalab.org. These worksheets or "executable papers" can then be freely reproduced, appended, and otherwise modified to improve productivity and accelerate the pace of discovery and learning among data-driven scientific professionals.

Details

Improve this page