Characterizing Datasets and Building Better Models with Continued Pre-Training
Whatβs the most effective way to add new domain knowledge into an open LLM? A new blog post from my team covers experiments we did at the beginning of the year to start answering this question. It starts, unsurprisingly, with sweeping your learning rateβ¦ www.databricks.com/blog/charact...
25.11.2024 23:28 β π 22 π 8 π¬ 0 π 1
Applied mathematician specializing in data science and scientific computing. Tensor decomposition, numerical optimization, linear algebra, network science, randomized algorithms. https://mathsci.ai #math #tensors #randnla #tikz #texlatex
prev: @BrownUniversity, @uwcse/@uw_wail phd, ex-@cruise, RS @waymo. 0.1x engineer, 10x friend.
spondyloarthritis, cars ruin cities, open source
Staff Research Scientist at Google DeepMind. Artificial and biological brains π€ π§
Computational neuroscientist at the FMI.
www.zenkelab.org
Google Chief Scientist, Gemini Lead. Opinions stated here are my own, not those of Google. Gemini, TensorFlow, MapReduce, Bigtable, Spanner, ML things, ...
Lecturer in Maths & Stats at Bristol. Interested in probabilistic + numerical computation, statistical modelling + inference. (he / him).
Homepage: https://sites.google.com/view/sp-monte-carlo
Seminar: https://sites.google.com/view/monte-carlo-semina
I like tokens! Lead for OLMo data at @ai2.bsky.social (Dolma π) w @kylelo.bsky.social. Open source is fun π€βοΈππ³οΈβπ Opinions are sampled from my own stochastic parrot
more at https://soldaini.net
Decision-making under uncertainty, machine learning theory, artificial intelligence Β· anti-ideological Β· Assistant Research Professor, Cornell
https://avt.im/ Β· https://scholar.google.com/citations?user=EGKYdiwAAAAJ&sortby=pubdate
a mediocre combination of a mediocre AI scientist, a mediocre physicist, a mediocre chemist, a mediocre manager and a mediocre professor.
see more at https://kyunghyuncho.me/
Senior Research Scientist at Google DeepMind. I β Optimization β© Machine Learning. Fan of IronMaidenπ€.Here to discuss research π€
Postdoc at CBS, Harvard University
(New around here)
AI, RL, NLP, Games Asst Prof at UCSD
Research Scientist at Nvidia
Lab: http://pearls.ucsd.edu
Personal: prithvirajva.com
Neuro + AI Research Scientist at DeepMind; Affiliate Professor at Columbia Center for Theoretical Neuroscience.
Likes studying learning+memory, hippocampi, and other things brains have and do, too.
she/her.
All I want in this life of mine is some good clean fun.
Waiting on a robot body. All opinions are universal and held by both employers and family.
Literally a professor. Recruiting students to start my lab.
ML/NLP/they/she.
Information and updates about RLC 2025 at the University of Alberta from Aug. 5th to 8th!
https://rl-conference.cc
CS Faculty at Mila and McGill, interested in Graphs and Complex Data, AI/ML, Misinformation, Computational Social Science and Online Safety
Research Director, Founding Faculty, Canada CIFAR AI Chair @VectorInst.
Full Prof @UofT - Statistics and Computer Sci. (x-appt) danroy.org
I study assumption-free prediction and decision making under uncertainty, with inference emerging from optimality.