The Data-Production Dispositif

dc.contributor.authorMiceli, Milagros
dc.contributor.authorPosada, Julian
dc.date.accessioned2023-09-19T15:38:48Z
dc.date.available2023-09-19T15:38:48Z
dc.date.issued2022
dc.description.abstractMachine learning (ML) depends on data to train and verify models. Very often, organizations outsource processes related to data work (i.e., generating and annotating data and evaluating outputs) through business process outsourcing (BPO) companies and crowdsourcing platforms. This paper investigates outsourced ML data work in Latin America by studying three platforms in Venezuela and a BPO in Argentina. We lean on the Foucauldian notion of dispositif to define the data-production dispositif as an ensemble of discourses, actions, and objects strategically disposed to (re)produce power/knowledge relations in data and labor. Our dispositif analysis comprises the examination of 210 data work instruction documents, 55 interviews with data workers, managers, and requesters, and participant observation. Our findings show that discourses encoded in instructions reproduce and normalize the worldviews of requesters. Precarious working conditions and economic dependency alienate workers, making them obedient to instructions. Furthermore, discourses and social contexts materialize in artifacts, such as interfaces and performance metrics, limiting workers' agency and normalizing specific ways of interpreting data. We conclude by stressing the importance of counteracting the data-production dispositif by fighting alienation and precarization, and empowering data workers to become assets in the quest for high-quality data.
dc.identifier.citationMiceli, M., & Posada, J. (2022). The Data-Production Dispositif. Proceedings of the ACM on Human-Computer Interaction, 6(CSCW2), 1–37. https://doi.org/10.1145/3555561
dc.identifier.doihttps://doi.org/10.1145/3555561
dc.identifier.issn2573-0142
dc.identifier.urihttps://www.weizenbaum-library.de/handle/id/345
dc.language.isoeng
dc.rightsopen access
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subjectcrowdsourcing
dc.subjectdata labeling
dc.subjectdata production
dc.subjectdata work
dc.subjectmachine learning
dc.subjectplatform labor
dc.titleThe Data-Production Dispositif
dc.typeArticle
dc.type.statuspublishedVersion
dcmi.typeText
dcterms.bibliographicCitation.booktitleProceedings of the ACM on Human-Computer Interaction
dcterms.bibliographicCitation.doi10.1145/3555561
dcterms.bibliographicCitation.issueCSCW2
dcterms.bibliographicCitation.issueCSCW2
dcterms.bibliographicCitation.journaltitleProceedings of the ACM on Human-Computer Interaction
dcterms.bibliographicCitation.urlhttps://dl.acm.org/doi/10.1145/3555561
dcterms.bibliographicCitation.volume6
local.researchgroupDaten, algorithmische Systeme und Ethik
local.researchtopicDigitale Technologien in der Gesellschaft
Dateien
Originalbündel
Gerade angezeigt 1 - 1 von 1
Lade...
Vorschaubild
Name:
Miceli_Data-Production-Dispositif.pdf
Größe:
812.25 KB
Format:
Adobe Portable Document Format
Beschreibung: