Transcription of Datasheets for Datasets
{{id}} {{{paragraph}}}
Datasheets for Datasets Timnit Gebru 1 Jamie Morgenstern 2 Briana Vecchione 3 Jennifer Wortman Vaughan 1 Hanna Wallach 1. Hal Daum III 1 4 Kate Crawford 1 5. Abstract We therefore propose the concept of Datasheets for Datasets . The machine learning community has no stan- In the electronics industry, every component is accompanied dardized way to document how and why a dataset by a datasheet describing standard operating characteristics, was created, what information it contains, what test results, and recommended usage. By analogy, we rec- [ ] 9 Jul 2018. tasks it should and should not be used for, and ommend that every dataset be accompanied with a datasheet whether it might raise any ethical or legal con- documenting its motivation, creation, composition, intended cerns. To address this gap, we propose the con- uses, distribution, maintenance, and other information. We cept of Datasheets for Datasets . In the electronics anticipate that such Datasheets will increase transparency industry, it is standard to accompany every com- and accountability in the machine learning community.
anticipate that such datasheets will increase transparency and accountability in the machine learning community. Section2provides context for our proposal. Section3 discusses the evolution of safety standards in other indus-tries, and outlines the concept of datasheets in electronics. We give examples of questions that should be answered in
Domain:
Source:
Link to this page:
Please notify us if you found a problem with this document:
{{id}} {{{paragraph}}}