That is incredible! Thanks for the explanation.
06.12.2024 08:06 — 👍 0 🔁 0 💬 0 📌 0That is incredible! Thanks for the explanation.
06.12.2024 08:06 — 👍 0 🔁 0 💬 0 📌 0This work looks incredible! The elo rating is that good even considering the model lost if it has a hallucinated nonlegal step.
06.12.2024 08:06 — 👍 0 🔁 0 💬 0 📌 0Super cool work! Does hallucination happen when running evaluation of elo rating? How does it handle that case?
05.12.2024 12:15 — 👍 0 🔁 0 💬 1 📌 0Hi Marc could you add me? Thanks!!
01.12.2024 07:32 — 👍 1 🔁 0 💬 0 📌 0Hi, could you add me? Thank you!
22.11.2024 10:10 — 👍 1 🔁 0 💬 1 📌 0Hi! would love to be added
18.11.2024 09:17 — 👍 1 🔁 0 💬 0 📌 0Thanks!
17.11.2024 11:43 — 👍 0 🔁 0 💬 0 📌 0would love to be added!
17.11.2024 11:10 — 👍 0 🔁 0 💬 1 📌 0