Our paper offers design implications to support this, such as:
- Protocols to help data scientists identify minimum standards for validity and other criteria, tailored to their specific application context
- Tools designed to help data scientists identify and apply strategies more effectively
14.10.2025 14:54 β π 1 π 0 π¬ 1 π 0
The challenge for HCI, CSCW, and ML is not to *replace* these bricolage practices with rigid top-down planning, but to develop scaffolding that enhances the rigor of bricolage while preserving creativity and adaptability
14.10.2025 14:54 β π 2 π 0 π¬ 1 π 0
Yet from urban planning to software engineering, history is rife with examples where rigid top-down interventions have failed while bottom-up alternatives designed to better scaffold *existing* practices succeeded
14.10.2025 14:54 β π 1 π 0 π¬ 1 π 0
What do these findings mean for how we improve target variable construction going forward? We might be tempted to more stringently enforce a rigid "top-down planning approach" to measurement, in which data scientists more carefully define construct β design operationalization β collect data
14.10.2025 14:54 β π 1 π 0 π¬ 1 π 0
How do data scientists evaluate validity? They treat their target variable definition as a tangible object to be scrutinized. They "poke holes" in their definition then "patch" them. They apply a variety of "spot checks" to reconcile their theoretical understanding of a concept with observed labels
14.10.2025 14:54 β π 1 π 0 π¬ 1 π 0
Data scientists navigate this balancing act by adaptively applying (re)formulation strategies
For example, they use "swapping" to change target variables when the first has unanticipated challenges, or "composing" to capture complementary dimensions of a concept being captured in a target variable
14.10.2025 14:54 β π 1 π 0 π¬ 1 π 0
An illustration of the target variable construction process presented in our findings. During target variable construction, data scientists specify an initial prediction task based on their available data, then iteratively refine their prediction task by applying (re)formulation strategies. Data scientists proceed with their final prediction task if it satisfies all criteria, or discontinue their project if strategies are exhausted.
While engaging in bricolage, data scientists balance the validity of their target variable with other criteria, such as:
π‘ Simplicity
βοΈ Resource requirements
π― Predictive performance
π Portability
14.10.2025 14:54 β π 0 π 0 π¬ 1 π 0
We find that target variable construction is a *bricolage practice*, in which data scientists creatively "make do" with the limited resources at hand
14.10.2025 14:54 β π 1 π 0 π¬ 1 π 0
To explore this tension, we interviewed 15 data scientists from education and healthcare sectors to understand their practices, challenges, and perceived opportunities for target variable construction in predictive modeling
14.10.2025 14:54 β π 1 π 0 π¬ 1 π 0
Traditional measurement theory assumes a top-down workflow, where data is collected to fit a study's goals (define construct β design operationalization β collect data)
In contrast, data scientists are often forced to reconcile their measurement goals with *existing* data
14.10.2025 14:54 β π 1 π 0 π¬ 1 π 0
A subtle aspect of predictive modeling is target variable construction: the process of translating a latent, unobservable concept like "healthcare need" into a prediction target
But how does target variable construction unfold in practice, and how can we better support it going forward? #CSCW2025 π§΅
14.10.2025 14:54 β π 6 π 1 π¬ 1 π 1
Cella M. Sum β
β¨Iβm on the academic job market β¨
Iβm a PhD candidate at @hcii.cmu.edu studying tech, labor, and resistance π©π»βπ»πͺπ½π₯
I research how workers and communities contest harmful sociotechnical systems and shape alternative futures through everyday resistance and collective action
More info: cella.io
09.10.2025 14:39 β π 61 π 31 π¬ 3 π 4
Screenshot of the CSCW 2025 paper "The Future of Tech Labor: How Workers are Organizing and Transforming the Computing Industry"
CELLA M. SUM, Carnegie Mellon University, USA
ANNA KONVICKA, Princeton University, USA
MONA WANG, Princeton University, USA
SARAH E. FOX, Carnegie Mellon University, USA
Abstract: The tech industryβs shifting landscape and the growing precarity of its labor force have spurred unionization efforts among tech workers. These workers turn to collective action to improve their working conditions and to protest unethical practices within their workplaces. To better understand this movement, we interviewed 44 U.S.-based tech worker-organizers to examine their motivations, strategies, challenges, and future visions for labor organizing. These workers included engineers, product managers, customer support specialists, QA analysts, logistics workers, gig workers, and union staff organizers. Our findings reveal that, contrary to popular narratives of prestige and privilege within the tech industry, tech workers face fragmented and unstable work environments which contribute to their disempowerment and hinder their organizing efforts. Despite these difficulties, organizers are laying the groundwork for a more resilient tech worker movement through community building and expanding political consciousness. By situating these dynamics within broader structural and ideological forces, we identify ways for the CSCW community to build solidarity with
tech workers who are materially transforming our field through their organizing efforts.
What can #CSCW learn from tech workers who have been involved in collective action and unionization about how to make transformative change within our field?
My new #CSCW2025 paper with Mona Wang, Anna Konvicka, and Sarah Fox seeks to answer this question.
Pre-print: arxiv.org/pdf/2508.12579
28.08.2025 14:14 β π 43 π 17 π¬ 3 π 4
You are eligible to participate if you have experience designing evaluations that use both (1) an LLM-as-a-judge and (2) a rubric to rate GenAI outputs. We welcome participants from all professional roles. Participants must be 18+ and be located in the U.S.
19.08.2025 19:46 β π 0 π 0 π¬ 0 π 0
Have you built a generative AI evaluation that uses an LLM-as-a-judge and a rubric to rate model outputs?
Sign up for a 45-minute Zoom session to provide feedback on a new tool for building trustworthy evals.
Learn more at tinyurl.com/llm-as-a-judge - receive $35 for participating in a session!
19.08.2025 19:46 β π 0 π 0 π¬ 1 π 0
phd fellow at university of oslo researching public sector digitalization
contributing editor @culanth's anthropod
anthropology, data studies, work, technology, sts, informatics
#academicsky
A Human-Robot Interaction (HRI) scholar focused on building effective, ethical, and robust Human-Robot collaborations.
Specific foci are: Trust Management, Personality, Individual Differences, and Open Science, and Methodology in Robotics.
PhD @ UMich School of Information β’ researching information manipulation online - particularly on wikipedia, rumble, + tiktok β’ https://laurakurek.github.io/
Design for scientific discovery, robotics and data visualization at the Space Science Institute. Former NASA/JPL, CMU HCII
Professor studying AI and work at Syracuse University.
political scientist @ University of ZΓΌrich
platforms, political violence, social media, computing
research & writing:
henryhenryhenry.com
sometimes land of enchantment, sometimes land of Helvetia
PhD student at the University of Notre Dame studying online communities & social media β¨ previously at @uwischool.bsky.social, @socialfutureslab.bsky.social, & UW CIP and intern with Adobe Research
Computer Scientist. Professor @ Loyola University Chicago. Studying cyberbullying detection, data systems, big data, ML. #BullyBlocker #DBSnap #Latino #firstgen
Assistant Professor in Communication Studies at Northwestern. HCI, Human-Centered AI, AI literacy, co-creativity. Director of Creative Interfaces Research + Design Studio. durilong.com
designer, researcher, ux; dark patterns, ethics, and technology (https://uxp2.com); associate professor and HCI/d director at IU Bloomington; they/them π³οΈβπ
still chasing the high that was space tw*tter circa 2018 π
year 5 phd studying advocacy technology at @berkeleyischool.bsky.social
formerly @aclum.bsky.socialπ½, @stsci.edu π°οΈ
laurenmarietta.github.io
Professor, University of Georgia.
Director, Center for Advanced Computer-Human Ecosystems (CACHE). https://www.ugavr.com
#VR #AR #research #behaviorchange #communication #technology #metaverse
Asst. Prof. of Computer & Data Science, Brown University | Visiting Scholar at The Petrie-Flom Center, Harvard Law School | Faculty Associate, Berkman Klein Center, Harvard | HCI, Computer Security, Privacy, Policy, and Wellbeing
Ph.D. @ CMU HCII. Developing tools and processes to support Responsible AI practices on the ground.
Currently focusing on AI red-teaming, auditing, and impact assessment.
Prev. Microsoft Research, Berkeley EECS.
https://www.wesleydeng.com
Director and Podcast Host @AI for All Tomorrows. Post-Doctoral Researcher @CU Boulder. TEDx Speaker. Researcher of AI safety, mental health, and end-of-life care. Former Co-Host @Radical AI Podcast
PhD student @ Cornell info sci | Sociotechnical fairness & algorithm auditing | Previously Stanford RegLab, MSR FATE, Penn | https://emmaharv.github.io/
We apply HCI methods to solve real-world problems. We are the Human-Computer Interaction Institute at Carnegie Mellon University. #cmuhcii
π΄ https://hcii.cmu.edu/
Assistant prof at UMN CS. Human-centered AI, online communities, risky mental health behaviors. Mom, lifter, nerd, haver of opinions.
Assistant Professor at UW-Madison