The Safari browser is like a car with one gear that claim it does not pollute...
24.08.2025 14:11 β π 0 π 0 π¬ 0 π 0@sciencialab.com.bsky.social
The Safari browser is like a car with one gear that claim it does not pollute...
24.08.2025 14:11 β π 0 π 0 π¬ 0 π 0Exactly! There is a common misconception that by throwing any kind of crap into a vector it will magically work. Still at the age of AI, metadata information cannot still be ignored.
28.07.2025 07:25 β π 0 π 0 π¬ 0 π 0Yes. The time is now. Vaccines to treat and prevent cancer.
www.jci.org/articles/vie...
Your feedback will help us improve Grobid! π Feel free to share your thoughts, star us on GitHub, and letβs keep building! π¬π
Next up, we're focusing on supporting more platforms (Linux ARM), improving figures and tables extraction, enhancing CJK language support, and providing better handling for more document types like theses, reports, and more.
π½
- π€ Improved recognition of non-standard fonts
- π οΈ Various bug fixes and security vulnerabilities addressed
github.com/kermitt2/...
π½
Grobid 0.8.2 is out! π
- π§ New processing "flavors" for different doc types (e.g. SDO, corrections, editorials)
- π Improved URL extraction
- β
Better text extraction for paragraphs around figures and tables
π§΅π½
I estimate that a few examples for each model would quickly improve the results to an acceptable level.
Feel free to reach out if you are interested, and we can work out a collaboration around it.
I'm not sure Grobid is used in any project targetting any of the CJK languages, as other details might need to be addressed.
We started a branch at low-priority (github.com/kermitt2/gro...) to improve CJK languages at once, but other more urgent issues were prioritized at the time.
π
It's interesting to see this analysis, however, to be fair, Grobid does not have any training data for Japanese. This is valid also for Chinese, Korean, etc.
π
Dear @github, I wonder whether it would be possible to have a way to save certain "search parameters" inside the issues/pulls so that our work may be framed to important tasks. E.g. working on a specific milestone and wanting to know everything that is not yet done:
new demo: kermitt2-grobid.hf.space
07.05.2025 19:30 β π 0 π 0 π¬ 0 π 0GROBID by Patrice Lopez turns messy PDFs into well-structured text in TEI format including references- super useful! https://github.com/kermitt2/grobid
18.12.2013 11:53 β π 0 π 1 π¬ 1 π 0To what extent do researchers funded by Dutch Research Council NWO and ZonMw share the research data and code underlying their publications?
Today we published an analysis based on 10.000+ papers using the open source tool Grobid: www.nwo.nl/en/news/shar...
All underlying data openly available!
Grobid popularity is still growing, despite LM, LLM, LLLM....
06.05.2025 14:57 β π 0 π 0 π¬ 0 π 0Hi, I'm happy to sell my Twitter handler and close up my twitter account, as soon as it's legally allowed π
21.01.2025 08:37 β π 1 π 0 π¬ 0 π 0Hallucinating AI? π«£
09.12.2024 17:02 β π 0 π 0 π¬ 0 π 0install Ublock (ublockorigin.com/) and Ghostery (www.ghostery.com/), or both.. they will increase your security and privacy overall.
2/2
Suggestion not asked. If you don't want advertisements anymore on Twitter, you can pay 200 EUR per year (100 EUR only reduced them by half, lol), or you can pay 0 EUR and
1/2
Ci sono strumenti che per postare in entrambi i social per esempio con fedica.com (uno a caso che sembra fatto bene) π Se i contenuti, senza necessariamente seguire tutte le risposte, si espandono di qui Γ© piΓΊ facile spingerne l'espansione
21.11.2024 18:48 β π 1 π 0 π¬ 0 π 0Alleluja!! www.nature.com/artic...
21.11.2024 15:21 β π 0 π 0 π¬ 0 π 0π
21.11.2024 08:15 β π 1 π 0 π¬ 0 π 0Here a few tips for using Bluesky github.com/JefTek/Blues...
21.11.2024 08:13 β π 0 π 0 π¬ 0 π 0Vedendo come sta andando la sua ricerca, io non mi preoccuperei troppo ;-)
20.11.2024 14:24 β π 1 π 0 π¬ 0 π 0