for i, (input, target) in enumerate(data):
output = model(input)
loss = loss_fn(output, target)
loss = loss / iters_to_accumulate
loss.backward()
if (i + 1) % iters_to_accumulate == 0:
optimizer.zero_grad()
This is just your standard gradient accumulation?
for i, (input, target) in enumerate(data):
output = model(input)
loss = loss_fn(output, target)
loss = loss / iters_to_accumulate
loss.backward()
if (i + 1) % iters_to_accumulate == 0:
optimizer.zero_grad()
19.12.2024 19:02 β π 2 π 0 π¬ 2 π 0
Still a bit confused on when to use Illuminate vs NotebookLM for getting an audio overview of papers. Currently using Illuminate.
14.12.2024 22:51 β π 1 π 0 π¬ 1 π 0
I find 'uv sync' so fast that I would just change the version in project.toml and .python-version and sync again. The fact that the env is tied to a directory may sometimes be a negative but it also ensures all package versions are tracked in git.
04.12.2024 06:25 β π 0 π 0 π¬ 0 π 0
What seems to be currently the best approach for depth estimation, diffusion models or "old-school" discriminative models? Both seem to claim SOTA models nowadays?
02.12.2024 14:33 β π 1 π 0 π¬ 1 π 0
Advent of Code 2024
This was my tenth(!) year building 25 days of puzzles for #AdventOfCode. You can solve them all for free! Most people write code to solve them, but you can solve them however you like. I hope they help people become better programmers. π
The first puzzle comes out in two hours: adventofcode.com
01.12.2024 02:57 β π 1129 π 208 π¬ 61 π 22
Every once in a re-read Joseph Redmon's YOLOv3 paper. That was really a work of art..
"Sometimes you just kinda phone it in for a year, you know? I didnβt do a whole lot of research this year. [...] I managed to make some improvements to YOLO. But, honestly, nothing like super interesting"
29.11.2024 15:39 β π 4 π 0 π¬ 1 π 0
Would be interesting to see how it would perform for BOP dynamic onboarding.
As you can tell, Iβve started sharing interesting 6D pose estimation papers I come across. I already track these for myself, so why not share them with all of you?
26.11.2024 14:02 β π 0 π 0 π¬ 0 π 0
As you can tell, Iβve started sharing interesting 6D pose estimation papers I come across. I already track these for myself, so why not share them with all of you?
25.11.2024 14:05 β π 0 π 0 π¬ 0 π 0
The GPU poor do not have it easy ;). But usually it is just multiple notebooks and nvidia-smi is handy to see how much each notebook is taking up.
21.11.2024 15:04 β π 1 π 0 π¬ 0 π 0
Haven't switched yet, is there an easy way to see which programs take up how much gpu memory like in nvidia-smi?
21.11.2024 09:16 β π 3 π 0 π¬ 2 π 0
Thats why all my pytorch code looks like:
```
from torchvision.transforms.v2.functional import to_dtype, to_image
img_tensor = to_dtype(to_image(image), scale=True)
```
20.11.2024 19:19 β π 4 π 0 π¬ 1 π 0
Research at Google DeepMind. Ex-Physicist. Controllable World Simulators (GNNs, Structured World Models, Neural Assets). TLM Veo Capabilities (Ingredients & more).
π San Francisco, CA
From SLAM to Spatial AI; Professor of Robot Vision, Imperial College London; Director of the Dyson Robotics Lab; Co-Founder of Slamcore. FREng, FRS.
Researching #ComputerVision at #GoogleDeepMind using JAX/Flax (http://github.com/google/flax). Views are my own.
@PyTorch "My learning style is Horace twitter threads" -
@typedfemale
Member of technical staff @periodiclabs
Open-source/open science advocate
Maintainer of torchrl / tensordict / leanrl
Former MD - Neuroscience PhD
https://github.com/vmoens
Co-Founder & CEO, Sakana AI π β @sakanaai.bsky.social
https://sakana.ai/careers
research scientist at google deepmind.
phd in neural nonsense from stanford.
poolio.github.io
RecSys, AI, Engineering; Principal Applied Scientist @ Amazon. Led ML @ Alibaba, Lazada, Healthtech Series A. Writing @ eugeneyan.com, aiteratelabs.com.
a mediocre combination of a mediocre AI scientist, a mediocre physicist, a mediocre chemist, a mediocre manager and a mediocre professor.
see more at https://kyunghyuncho.me/
GEODE Team Lead (Geometric Deep Learning)
3D vision researcher @NaverLabsEurope
Official account for International Conference on 3D Vision (3DV) #3DV2026 π¨π¦
Website: https://3dvconf.github.io/
Tenured Researcher @INRIA, Ockham team. Teacher @Polytechnique
and @ENSdeLyon
Machine Learning, Python and Optimization
Prof ETH ZΓΌrich, Director Microsoft Spatial AI Lab, CV/ML/Robotics
Professor of Computer Vision/Machine Learning at Imagine/LIGM, Γcole nationale des Ponts et ChaussΓ©es @ecoledesponts.bsky.social Music & overall happiness π³πͺ» Born well below 350ppm π¬ mostly silly personal views
πParis π https://davidpicard.github.io/
The AI community building the future!
Principal Scientist at Naver Labs Europe, Lead of Spatial AI team. AI for Robotics, Computer Vision, Machine Learning. Austrian in France. https://chriswolfvision.github.io/www/
Helping robots see @ http://intrinsic.ai (Alphabet company). Here talking about 3D Computer Vision and everything around it. Views are my own.
3D Computer Vision & ML
Research Scientist @Google
I lead Cohere For AI. Formerly Research
Google Brain. ML Efficiency, LLMs,
@trustworthy_ml.