This week's inaugural #GWOSCON was a fantastic conference about open source software. The slides for my presentation "Reproducible Data Science with Open Source Tools" are available on GitHub in my talks repository: github.com/jayqi/talks
27.03.2025 23:16 — 👍 2 🔁 0 💬 0 📌 0
Great post! I appreciate the thorough exploration and discussion of different approaches.
Followup question: Pandas (as of version 2.0) can use PyArrow as a backend instead of numpy. Would using the list PyArrow data type address some of the shortcomings you identified for Pandas?
26.02.2025 00:33 — 👍 0 🔁 0 💬 0 📌 0
Open source and Python preacher. Feminist. Opinions are my own. she/her
Senior Data Scientist at BuzzFeed in San Francisco // AI content generation ethics and R&D // plotter of pretty charts
https://minimaxir.com
Dad of two cats. Funemployed ML engineer.
Enjoys learning, running/soccer/hiking/skiing, cooking/baking, and lurking on social media.
In PNW, from France.
I ❤️ open source. Working on LightGBM and RAPIDS.
Keep Chicago out your mouth.
he/him. This is me: https://github.com/jameslamb
Publishes http://commoncog.com. Tweets about books & the art of business, from the perspective of an operator. Also: https://warpcast.com/cedric
Data nerd, “recovering data scientist”, author, podcaster, occasional athlete
DevRel @observablehq.com, previously teaching faculty (Environmental Data Science) at UC Santa Barbara. PhD, Environmental Science and Management. Data science | R | data visualization | education | art | www.allisonhorst.com
Senior Principal Software Engineer at Red Hat AI | Open Source Leader at KServe, Argo, Kubeflow, Kubernetes, CNCF | Maintainer of XGBoost, TensorFlow | Keynote Speaker | Author | Technical Advisor
More info: http://terrytangyuan.xyz
We are hiring!
We are dedicated to fostering community for data visualization professionals.
DrivenData builds AI solutions for social good through data science competitions and our expert in-house project team.
Challenges - https://www.drivendata.org/
Data Consulting - https://drivendata.co/
Alexander von Humboldt Professor for AI and Chair for Societal Computing at Saarland University. Co-Director at https://i2sc.net.
Researcher trying to shape AI towards positive outcomes. ML & Ethics +birds. Generally trying to do the right thing. TIME 100 | TED speaker | Senate testimony provider | Navigating public life as a recluse.
Former: Google, Microsoft; Current: Hugging Face
Book: https://thecon.ai
Web: https://faculty.washington.edu/ebender
Data engineer in practice, data librarian at heart.
Knitter, dog momma, board gamer, and consumer of sci-fi/fantasy of all mediums.
Civic tech, data for good, peace/conflict data, and philosophizing thru data modeling
https://jennajordan.me
Software Engineer at Posit, PBC
https://fosstodon.org/@gaborcsardi
https://github.com/gaborcsardi
Writing modeling packages at @posit.co (née RStudio). Opinions are my own. https://max-kuhn.org/
tada⬢science ⬡⬡ ex(Posit/RStudio, ThinkR, Mango Solutions) ⬡⬡ role(Data Scientist, Software Engineer, R Expert) ⬡⬡
Chief Scientist @ Distributional.com @dbnlAI.bsky.social #MLSky #StatSky
Founder @ datascientific.com
Founder wimlds.org & co-founder rladies.org
PhD @ UC Berkeley
🏡 🌈 Oakland, California.
Visualisation and graphics @posit.co
Classic Generative Art Weirdo using 🖤 and R: http://thomaslinpedersen.art and http://deca.art/thomasp85
he/him