conceptofmind's Avatar

conceptofmind

@conceptofmind.bsky.social

31 Followers  |  5 Following  |  12 Posts  |  Joined: 31.10.2023  |  1.5792

Latest posts by conceptofmind.bsky.social on Bluesky


Post image

Awesome to see @nvidia @NVIDIAAI using our research for their open-source long-context models.

13.04.2025 18:19 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Talking about DeepSeek and their connection to Tsinghua. Tsinghua and CMU have an older (2017) but still great series on high-performance parallel computing. The playlist can be found here: https://buff.ly/4gktKSd

26.01.2025 19:49 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Bridging the Data Provenance Gap Across Text, Speech and Video Progress in AI is driven largely by the scale and quality of training data. Despite this, there is a deficit of empirical analysis examining the attributes of well-established datasets beyond text.…

Paper link: https://buff.ly/3WtuwWi

22.01.2025 20:01 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Happy to anounce that our paper, Bridging the Data Provenance Gap Across Text, Speech, and Video, was accepted to @iclr_conf. #ICLR2025

22.01.2025 20:01 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

I have an extremely easy evaluation that currently all top models achieve a 0% on. This is the easiest set of evaluations in our entire suite. AGI would be able to solve the hardest problems effortlessly. Once o3 becomes available in the API, I will put out a public baseline.

21.12.2024 19:31 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Does anyone have any ideas why T5 or CLIP is being used for text encoding in diffusion training instead of a much stronger encoder or embedding model?

15.12.2024 20:19 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

There is absolutely no shortage of pre-training data.

15.12.2024 00:24 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
GitHub - aws/aws-parallelcluster: AWS ParallelCluster is an AWS supported Open Source cluster management tool to deploy and manage HPC clusters in the AWS cloud. AWS ParallelCluster is an AWS supported Open Source cluster management tool to deploy and manage HPC clusters in the AWS cloud. - aws/aws-parallelcluster

AWS ParallelCluster is honestly such an incredibly useful tool for large-scale distributed training:

14.12.2024 04:25 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
CompVis/cleandift Β· Hugging Face We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Hugging Face repo: huggingface.co/CompVis/clea...

05.12.2024 17:26 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
GitHub - CompVis/cleandift Contribute to CompVis/cleandift development by creating an account on GitHub.

GitHub repo: github.com/CompVis/clea...

05.12.2024 17:26 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Paper: compvis.github.io/cleandift/st...

05.12.2024 17:26 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Be sure to check out this awesome work by @stefanabaumann.bsky.social, @rmsnorm.bsky.social, and @koljabauer.bsky.social.

05.12.2024 17:26 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

@conceptofmind is following 5 prominent accounts