Transcription of Datasheets for Datasets
{{id}} {{{paragraph}}}
Datasheets for Datasets Timnit Gebru 1 Jamie Morgenstern 2 Briana Vecchione 3 Jennifer Wortman Vaughan 1 Hanna Wallach 1. Hal Daum III 1 4 Kate Crawford 1 5. Abstract We therefore propose the concept of Datasheets for Datasets . The machine learning community has no stan- In the electronics industry, every component is accompanied dardized way to document how and why a dataset by a datasheet describing standard operating characteristics, was created, what information it contains, what test results, and recommended usage. By analogy, we rec- [ ] 9 Jul 2018. tasks it should and should not be used for, and ommend that every dataset be accompanied with a datasheet whether it might raise any ethical or legal con- documenting its motivation, creation, composition, intended cerns. To address this gap, we propose the con- uses, distribution, maintenance, and other information. We cept of Datasheets for Datasets .
machine learning systems exhibit accuracy disparities be-tween subpopulations, and calls for more diverse datasets, inclusive testing, and standards to address these disparities. 3.3. Electrical and Electronic Technologies Like datasets, electronic components are incorporated into a system whose larger goal may be far removed from the
Domain:
Source:
Link to this page:
Please notify us if you found a problem with this document:
{{id}} {{{paragraph}}}