Guénolé Fiche's Avatar

Guénolé Fiche

@gfiche.bsky.social

Research Scientist at Naver Labs Europe. Human-centric 3D computer vision

84 Followers  |  89 Following  |  6 Posts  |  Joined: 25.11.2024
Posts Following

Posts by Guénolé Fiche (@gfiche.bsky.social)

A new model for human mesh recovery, high-performing and w/o using any 3D scans, has been published by my excellent colleagues at @naverlabseurope.bsky.social. Excellent work!

06.11.2025 11:37 — 👍 24    🔁 3    💬 0    📌 0
Post image

Meet Anny, our Free (Apache 2.0) and Interpretable Human Body Model for all ages.
Anny is built upon #MakeHuman and enables achieving SOTA performance in Human Mesh Recovery.
ArXiv: arxiv.org/abs/2511.03589
Demo: anny-demo.europe.naverlabs.com
Code: github.com/naver/anny

06.11.2025 10:59 — 👍 17    🔁 6    💬 1    📌 2
Post image

MEGA is the first method that achieves SOTA results in both single and multi-output HMR. Want to try it yourself? Code and demo are available at: g-fiche.github.io/research-pag...

Work done in collaboration with @sleglaive.bsky.social , @xavirema.bsky.social , and Francesc Moreno-Noguer. (6/6)

19.03.2025 07:51 — 👍 2    🔁 0    💬 0    📌 0
Post image

We propose 2 generation modes:
- In deterministic mode, MEGA predicts all tokens in a single forward pass, ensuring speed and accuracy.
- In stochastic mode we iteratively sample human mesh tokens, enabling MEGA to produce multiple predictions from a single image. (5/6)

19.03.2025 07:51 — 👍 1    🔁 0    💬 1    📌 0
Post image

Subsequently, we add image conditioning and train MEGA to recover human meshes from image features and partial token sequences. During inference, we begin with a fully masked sequence of tokens and generate a human mesh conditioned on an input image. (4/6)

19.03.2025 07:51 — 👍 1    🔁 0    💬 1    📌 0
Post image

MEGA is first pre-trained on motion capture data to recover human meshes from partial human mesh token sequences with a variable masking rate. Starting from an empty sequence, we are then able to generate random meshes showing high pose and shape diversity. (3/6)

19.03.2025 07:51 — 👍 1    🔁 0    💬 1    📌 0
Post image

MEGA is a MaskEd Generative Autoencoder, which relies on a tokenized representation of human meshes. We frame HMR as generating a sequence of tokens corresponding to a human mesh, conditioned on an input image. (2/6)

19.03.2025 07:51 — 👍 1    🔁 0    💬 1    📌 0
Post image

Reconstructing 3D humans from a single image is highly ambiguous: many 3D poses can explain the same 2D view. Yet, most HMR methods predict a single mesh! 🤯

At #CVPR2025, we present MEGA, a new approach that tackles this challenge. 🧵👇 (1/6)

19.03.2025 07:47 — 👍 2    🔁 0    💬 1    📌 1