Mihai Chirculescu's Avatar

Mihai Chirculescu

@mchirculescu.bsky.social

I make open source projects related to GenAI https://github.com/Mihaiii

583 Followers  |  1,031 Following  |  27 Posts  |  Joined: 08.05.2023  |  2.2854

Latest posts by mchirculescu.bsky.social on Bluesky

Oh, wow, congratulations! 🀯 I admire your determination.

26.12.2024 21:50 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
β€˜Yes, I am a human’: bot detection is no longer working – and just wait until AI agents come along Designed in the early 2000s to stop rogue bots, we could at least justify the infuriation when they did their job properly.

With LLMs becoming mainstream, CAPTCHAs are quickly becoming obsolete. AI agents from big tech, designed to book trips and buy tickets, will seal their fate.

Finding a durable bot detection system that can outsmart advanced AI while remaining user-friendly for humans seems increasingly unlikely.

22.12.2024 13:01 β€” πŸ‘ 59    πŸ” 5    πŸ’¬ 11    πŸ“Œ 4
Post image

Ovis1.6-Gemma2-27B was just released.

I used the 9B version in the past for document understanding and it was similar to Qwen2 VL 7B, so this one should be better (it's also explicitly mentioned "Enhanced Document Understanding" as a key feature).

huggingface.co/AIDC-AI/Ovis1.6-Gemma2-27B

27.11.2024 04:12 β€” πŸ‘ 7    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Can I please be added? :)

24.11.2024 18:09 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

πŸ™‹β€β™‚οΈ

22.11.2024 15:57 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
How AI Learned to Reason
YouTube video by Art of the Problem How AI Learned to Reason
22.11.2024 03:53 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Alibaba Group just released Marco-o1, their open source alternative to OpenAI's o1.

Github: github.com/AIDC-AI/Marc...
Paper: arxiv.org/abs/2411.14405
HF: huggingface.co/AIDC-AI/Marc...

22.11.2024 03:26 β€” πŸ‘ 8    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

πŸ™‹β€β™‚οΈ

21.11.2024 23:24 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I gained a newfound respect for so-called 'prompt engineers'. Trying to approach the LLM with an engineer's mindset (impacting attention outputs, adding steering vectors etc.) feels like the wrong approach.

Each token I put in the prompt matters and I now focus more on that.

21.11.2024 18:38 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Open-source LLMs Join the conversation

We've made a starter pack for researchers/organizations working on open-source LLMS.

Please let us know if we missed you or if you'd like to be added!

go.bsky.app/FELkyDr

20.11.2024 01:33 β€” πŸ‘ 41    πŸ” 14    πŸ’¬ 6    πŸ“Œ 0

πŸ™‹β€β™‚οΈ

20.11.2024 22:25 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Yes, and there's also the human limitation in the equation, which will remain there. I used to buy into the hype about "it's a paradigm shift: instead of the human learning how the business app works, the app will have to learn what the human wants". It an attractive idea, but not realistic.

20.11.2024 15:13 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

"Agentic AI" seems like pure hype to me.

Even when it will be functional and production ready, for real world tasks, RPA solutions will still be superior because: 1. Speed of execution and 2. Clarity of what the human desires and better control.

ELI5 why I'm wrong

20.11.2024 13:34 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

πŸ™‹β€β™‚οΈ

20.11.2024 11:03 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
GitHub - Mihaiii/backtrack_sampler: An easy-to-understand framework for LLM samplers that rewind and revise generated tokens An easy-to-understand framework for LLM samplers that rewind and revise generated tokens - Mihaiii/backtrack_sampler

Thanks! This repo is similar to backtrack_sampler, except it can't backtrack (although backtrack_sampler can do everything that is done here by just ignoring the backtrack functionality).

github.com/Mihaiii/back...

20.11.2024 08:23 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

πŸ™‹β€β™‚οΈ

19.11.2024 13:07 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I was on X for the AI community, but lately there were less relevant posts in my feed so I decided to give this app another chance.

19.11.2024 11:51 β€” πŸ‘ 7    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

Dare is also in the house after a very long break here

19.11.2024 11:32 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Sweet! I'm glad to see you here too!

19.11.2024 11:31 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

How are they preventing bots? Or the API allows only read?

19.11.2024 11:26 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

A very cool read with lots of potential market value. I suspect OpenAI is using something like this internally for their plugins.

https://github.com/newhouseb/clownfish

16.05.2023 19:51 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

And then is the algo that makes recommendations :) https://github.com/bluesky-social/social-app/blob/main/src/lib/constants.ts#L78

16.05.2023 15:49 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Hmm, indeed, neither GPT4 doesn't output the correct answer. I even tried some prompting, but still no success.

09.05.2023 16:33 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

In my exp, GPT4 is way better than ChatGPT. Let me know the exact prompt you're using and I'll try it out. Bard is also not available in Romania :|

09.05.2023 14:10 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

We now have 3 ggml RedPajama models. They are quantized and some have ~2GB.
HF: https://huggingface.co/keldenl

08.05.2023 17:17 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Github: https://github.com/theislab/single-cell-best-practices

08.05.2023 12:47 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Mass-editing thousands of facts into a transformer memory

Site: https://memit.baulab.info/
Github: https://github.com/kmeng01/memit
Image soutce: Nomic AI discord

08.05.2023 12:15 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I'm going to try to use GPT4 to help me navigate and understand https://github.com/lucidrains/x-transformers .

This is going to take some time, but in theory is should be better than any internet course.

08.05.2023 11:20 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Please don't upload my code on GitHub This is a call to open source developers to not upload the work of others on GitHub.

Over time we’re going to see more people who were once happy to share their work online look for ways to opt out of their writings, code or art being used to train AI.

This is on track to becoming the biggest threat the Open Web has faced in years.

08.05.2023 08:32 β€” πŸ‘ 65    πŸ” 20    πŸ’¬ 7    πŸ“Œ 6
Post image Post image

GPT4ALL is about to announce a ggml adaptation for mpt7b!

The images are form Nomic AI discord.

08.05.2023 10:08 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@mchirculescu is following 17 prominent accounts