's Avatar

@deep-diver.bsky.social

87 Followers  |  15 Following  |  12 Posts  |  Joined: 20.11.2024  |  1.9577

Latest posts by deep-diver.bsky.social on Bluesky

Post image

Simple Summarization on DeepSeek-R1

RL is key
↳ but hard to make it helpful alone.
↳ 4 stage pipeline (good start + reasoning RL + SFT + safety RL) = o1 level performance.
↳ Distilling R1-Zero outputs = o1-mini level.

Model: huggingface.co/deepseek-ai
Paper: github.com/deepseek-ai/...

21.01.2025 13:03 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

and you can also select and use Gemini, Mistral, and LLaMA as a generative model.

Out-of-the-box data sources include Local, Google Cloud Storage, Google Drive, Slack, Jira, making it easy to create PoCs for a wide range of use cases.

20.01.2025 01:29 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

For example, you can select and use GCP's { RagManagedDb, Vector Search, Feature Store } or third-party { Weaviate, pinecone } as underlying DB. In addition, you can select GCP's { text-embedding, gecko } or the open source model { e5-base | large | small } as an embedding model,

20.01.2025 01:29 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

You can configure the desired RAG pipeline with various combinations, and you can also use the backend service developed and provided by Google.

20.01.2025 01:29 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Vertex AI RAG Engine: A developers tool Build robust and grounded generative AI applications with Vertex AI RAG Engine, reducing hallucinations and enhancing accuracy.

Google's Vertex AI RAG Engine

Google launched a RAG-specific service called "Vertex AI RAG Engine." It can be understood as providing infrastructure for RAG on the Google Cloud Platform and supporting libraries that can be easily utilized.

developers.googleblog.com/en/vertex-ai...

20.01.2025 01:29 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
AI Paper Reviews by AI Explore AI papers with thorough reviews generated by AI

blog on Hugging Face Daily Papers that is updated on a daily basis
: deep-diver.github.io/ai-paper-rev...

17.01.2025 08:12 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
GitHub - deep-diver/paper-reviewer: Generate a comprehensive review from an arXiv paper, then turn it into a blog post. This project powers the website below for the HuggingFace's Daily Papers (https:... Generate a comprehensive review from an arXiv paper, then turn it into a blog post. This project powers the website below for the HuggingFace's Daily Papers (https://huggingface.co/papers). - d...

core project (paper-reviewer)
: github.com/deep-diver/p...

Please give ⭐️ to reach 700 on GitHub!!

17.01.2025 08:12 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image Post image

updates on ai-paper-reviewer!

core
✦ supporting open source Layout Parsing model from
@OpenDataLab_AI

✦ scrapping papers from
@openreviewnet

blog
✦ display papers by the dates added in
@huggingface
Daily Papers. Up to 3 latest days are managed, then archived

link πŸ‘‡

17.01.2025 08:12 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

I share these kinda contents that I actually build myself with collaborators.

If you are curious and want to know what's coming, please follow me!

Cheers 🍻

20.11.2024 12:30 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
GitHub - deep-diver/paper-reviewer: Generate a comprehensive review from an arXiv paper, then turn it into a blog post. This project powers the website below for the HuggingFace's Daily Papers (https:... Generate a comprehensive review from an arXiv paper, then turn it into a blog post. This project powers the website below for the HuggingFace's Daily Papers (https://huggingface.co/papers). - d...

And this project got 550 @github.com 🌟 in a month. Notably, it comes with audio podcast for every papers whose quality is quite comparable to NotebookLM.

github.com/deep-diver/p...

20.11.2024 12:30 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs The widespread adoption of cloud-based proprietary large language models (LLMs) has introduced significant challenges, including operational dependencies, privacy concerns, and the necessity of contin...

My first ever full paper in the field of AI. This is quite unique exp since I am not ML background at all.

arxiv.org/abs/2408.13467

20.11.2024 12:30 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

I am Chansung. I love collab with others for building cool AI project and writing paper

Recent ones πŸ‘‡
1. Paper on @arxiv

LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs

2. OSS
AI Paper Reviewer: gen text and poscast of papers

Find links below πŸ”—

20.11.2024 12:30 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

@deep-diver is following 13 prominent accounts