George Macgregor's Avatar

George Macgregor

@g3om4c.code4lib.social.ap.brid.gy

I'll tend to toot about #repositories, #openscience, structured #data, resource discovery, #inforetrieval -- but may occasionally toot #jazz vibes too […] [bridged from https://code4lib.social/@g3om4c on the fediverse by https://fed.brid.gy/ ]

30 Followers  |  5 Following  |  191 Posts  |  Joined: 05.09.2024  |  1.892

Latest posts by g3om4c.code4lib.social.ap.brid.gy on Bluesky

Original post on code4lib.social

EPrints now supports COAR Notify functionality. Exceptional news! 'Ingredient' or Bazaar plugin details available at the link below.

https://coar-notify.net/catalogue/implementations/platforms/eprints/

https://wiki.eprints.org/w/COAR_Notify_Overview #EPrints #Notify #repository #repositories […]

06.08.2025 13:21 — 👍 1    🔁 2    💬 0    📌 0
Original post on code4lib.social

There must soon surely be an electoral day of reckoning for the current US administration. The anti-intellectual, pseudo-scientific, take-us-back-to-the-dark-ages philosophy which appears to permeate its ranks is breathtaking. It is also all very upsetting to watch from afar. It will not serve […]

06.08.2025 13:08 — 👍 0    🔁 0    💬 1    📌 0
Screenshot from the blog post showing the publisher version of an image and the seriously degraded "archived" version in PubMed Central, where text is even by humans hard to read. The snippet of the image shown shows the names of a number of WikiPathways pathways, with WP-identifier and name.

Screenshot from the blog post showing the publisher version of an image and the seriously degraded "archived" version in PubMed Central, where text is even by humans hard to read. The snippet of the image shown shows the names of a number of WikiPathways pathways, with WP-identifier and name.

new blog post: "Archiving, but not really" https://chem-bla-ics.linkedchemistry.info/2025/08/06/archiving-but-not-really.html https://doi.org/10.59350/vwd81-p8z85

in which I express serious problems with some publishers not ensuring data quality with […]

[Original post on mastodon.social]

06.08.2025 07:32 — 👍 2    🔁 4    💬 0    📌 0
Post image

Gartner Hype Cycle Identifies Top #AI Innovations in 2025
https://www.gartner.com/en/newsroom/press-releases/2025-08-05-gartner-hype-cycle-identifies-top-ai-innovations-in-2025

05.08.2025 21:36 — 👍 0    🔁 2    💬 0    📌 0
Original post on code4lib.social

[Trump, tariffs, automobiles]

It probably goes without saying that #Trump displays almost infantile thinking when it comes to #trade and #tariffs. It's embarrassing. He says about #Japanese #automobiles: "they send out millions to the USA] but they won’t take any of ours" -- as if [#Japan […]

05.08.2025 22:18 — 👍 0    🔁 0    💬 0    📌 0
Original post on code4lib.social

"IFLA —the leading international body representing the interests of library and information services—has signed the Statement on Four #DigitalRights of Memory Institutions, joining more than 130 signatories from around the world who are calling for the legal rights that libraries, archives, and […]

31.07.2025 07:22 — 👍 0    🔁 1    💬 0    📌 0

1st world problem:

You can have a 2.5 Gbps internet connection, but that does not mean the website you're visiting will load faster.

29.07.2025 05:01 — 👍 1    🔁 1    💬 0    📌 0
Original post on code4lib.social

@gedankenstuecke Yes!! You are dead on!! I think a lot of European governments are behaving similarly. My disappointment about the Starmer led Labour Party government is that, rather than demonstrating how they are different to Farage, they are instead leaning into some of the rhetoric, like you […]

29.07.2025 22:42 — 👍 0    🔁 0    💬 0    📌 0
Original post on code4lib.social

@gedankenstuecke Yes, there were some protests. But they involved mere hundreds of protesters, at some hotels in England where asylum seekers are being temporarily housed.

I guess I am simply saying that press reporting of these protests is not proportional. And I dislike it because it gives […]

29.07.2025 22:30 — 👍 0    🔁 0    💬 1    📌 0
Original post on code4lib.social

@gedankenstuecke Agree, it's very frustrating; though I wouldn't describe it as "UK society" thing. This is essentially the leader (Farage) of a far right party -- and its disciples -- trying to whip up hatred about refugees, and turning against the RNLI in process. The last time they tried this […]

29.07.2025 21:19 — 👍 0    🔁 0    💬 1    📌 0
Post image

Major Update to #Unpaywall Database (via #OurResearch) https://blog.ourresearch.org/major-update-to-unpaywall-database/

29.07.2025 20:57 — 👍 0    🔁 2    💬 0    📌 0

How depressing. I'm currently receiving invitations for routine meetings that are scheduled to be held as far away as June 2026... 2026! What an appalling thought. Groan...

#ModernLifeIsRubbish

29.07.2025 08:05 — 👍 0    🔁 0    💬 0    📌 0
Original post on code4lib.social

Black Sabbath are arguably #Birmingham's best musical export; but surely #UB40 are a close 2nd?

It's been years since I have listened to this song but heard it while shopping in Lidl, of all places. What a terrific song. Space #dub #reggae, with a haunting melody and lyrical message.

The Earth […]

28.07.2025 20:28 — 👍 0    🔁 0    💬 0    📌 0
Original post on code4lib.social

Another interesting paper to add to the 'to read' pile. Includes some rich looking analysis!

"This study provides evidence that thematic clustering can be a beneficial #aggregation approach, opening opportunities for studying different ways of representing and #visualizing aggregated #search […]

28.07.2025 19:58 — 👍 0    🔁 0    💬 0    📌 0
Original post on code4lib.social

Universities -- & national scholarly funders more generally -- have been resistant to developing #open, transparent #LLMs that deliver sovereign AI, seeing this as a private gig for big tech only. Pleased therefore to see EPFL, ETH #Zurich, & the Swiss National Supercomputing Centre (CSCS) […]

13.07.2025 19:40 — 👍 0    🔁 3    💬 0    📌 0
Another picture of the statue, Foam, at Greenbank Gardens, standing in the centre of a pool of water. Blues skies and vegetation visible in the background.

Another picture of the statue, Foam, at Greenbank Gardens, standing in the centre of a pool of water. Blues skies and vegetation visible in the background.

Foam, taken from a different, perhaps better, angle.

12.07.2025 16:03 — 👍 0    🔁 0    💬 0    📌 0
Picture of the plaque describing 'Foam'. Charles d’Orville Pilkington Jackson, the sculptor of the equestrian statue of Robert the Bruce at Bannockburn, created Foam, a bronze water nymph, for the Glasgow Empire Exhibition in 1938. Other mythical creatures at the exhibition surrounded the statue, all sold except for Foam, which Pilkington Jackson kept. After his death, the statue was moved to Greenbank, the nearest garden to Bellahouston Park, where the exhibition occurred.

Picture of the plaque describing 'Foam'. Charles d’Orville Pilkington Jackson, the sculptor of the equestrian statue of Robert the Bruce at Bannockburn, created Foam, a bronze water nymph, for the Glasgow Empire Exhibition in 1938. Other mythical creatures at the exhibition surrounded the statue, all sold except for Foam, which Pilkington Jackson kept. After his death, the statue was moved to Greenbank, the nearest garden to Bellahouston Park, where the exhibition occurred.

Picture of the statue, Foam, at Greenbank Gardens, standing in the centre of a pool of water. Blues skies and vegetation visible in the background.

Picture of the statue, Foam, at Greenbank Gardens, standing in the centre of a pool of water. Blues skies and vegetation visible in the background.

Scorching day in Glasgow today -- 29 degrees Celsius. Decided to visit Greenbank Gardens in the city's southside, which includes this beautiful statue, 'Foam'. #Glasgow #NTS #NationalTrust #art #sculpture

12.07.2025 13:48 — 👍 1    🔁 0    💬 1    📌 0
Original post on code4lib.social

'What makes this paper great?' by Jenna Bartel (iSchool, Toronto). This paper won the Best Conference Paper commendation at the Conceptions of #Library and #InformationScience Conference (CoLIS).

Theorising Notions of #Searching, (Re)Sources, & Evaluation in Light of Generative AI […]

11.07.2025 19:54 — 👍 0    🔁 0    💬 0    📌 0

Things LLMs and I have trouble with: 1) I say "write this thing", they hear "give me a structured approach to help me draft this thing". 2) I say "read this data and tell me about it", they hear "hallucinate a bunch more data more or less like this".

10.07.2025 20:45 — 👍 0    🔁 2    💬 0    📌 0
Screen snippet of opening slide delivered by Kathleen Shearer, presented at the soft launch of the COAR International Repository Directory (IRD). Slide explains the motivation for a new, comprehensive, and accurate directory of repositories. It also summarises some of the deficiencies inherent in existing directories.

Screen snippet of opening slide delivered by Kathleen Shearer, presented at the soft launch of the COAR International Repository Directory (IRD). Slide explains the motivation for a new, comprehensive, and accurate directory of repositories. It also summarises some of the deficiencies inherent in existing directories.

Yay! Attending the soft launch of the new International #Repositories Directory (IRD), created under the ausprices of @coar_repositories and led by @paulwalk. Kudos to all involved! The IRD presents a long needed superior model of #repository directory […]

[Original post on code4lib.social]

08.07.2025 10:42 — 👍 0    🔁 0    💬 0    📌 0
Original post on code4lib.social

Another week -- which means another research paper questioning whether #LLMs should go anywhere near the scholarly research process. And the answer is, unsurprisingly, 'no'. #ChatGPT 4.0 and #Bard #hallucinated #references in circa 29% and 91% of cases. But there are many other worrying […]

04.07.2025 07:54 — 👍 2    🔁 1    💬 0    📌 0
Original post on mastodon.social

This is Ben Zhao's talk at Open Repositories 2025: "Dealing with Generative AI, Harms and Mitigation Techniques"

https://zenodo.org/records/15790708

The slides and a recording of the talk are there. Zhao is a clear, authoritative, and engaging speaker.

I think this talk should be *required* […]

02.07.2025 15:10 — 👍 0    🔁 0    💬 0    📌 0
Springer Nature book on machine learning is full of made-up citations Would you pay $169 for an introductory ebook on machine learning with citations that appear to be made up? If not, you might want to pass on purchasing _Mastering Machine Learning: From Basics to Advanced_, published by Springer Nature in April. Based on a tip from a reader, we checked 18 of the 46 citations in the book. Two-thirds of them either did not exist or had substantial errors. And three researchers cited in the book confirmed the works they supposedly authored were fake or the citation contained substantial errors. “We wrote this paper and it was not formally published,” said Yehuda Dar, a computer scientist at Ben-Gurion University of the Negev, whose work was cited in the book. “It is an arXiv preprint.” The citation incorrectly states the paper appeared in _IEEE Signal Processing Magazine_. Aaron Courville, a professor of computer science at Université de Montréal and coauthor on the book _Deep Learning_, was correctly cited for the text itself, but for a section that “doesn’t seem to exist,” he said. “Certainly not at pages 194-201.” And Dimitris Kalles of Hellenic Open University in Greece also confirmed he did not write a cited work listing him as the author. The researcher who emailed us, and asked to remain anonymous, had received an alert from Google Scholar about the book, which cited him. While his name appeared on multiple citations, the cited works do not exist. Nonexistent and error-prone citations are a hallmark of text generated by large language models like ChatGPT. These models don’t search literature databases for published papers like a human author would. Instead, they generate content based on training data and prompts. So LLM-generated citations might look legitimate, but the content of the citations might be fabricated. The book’s author, Govindakumar Madhavan, asked for an additional “week or two” to fully respond to our request for comment. He did not answer our questions asking if he used an LLM to generate text for the book. However, he told us, “reliably determining whether content (or an issue) is AI generated remains a challenge, as even human-written text can appear ‘AI-like.’ This challenge is only expected to grow, as LLMs … continue to advance in fluency and sophistication.” According to his bio in the book, Madhavan is the founder and CEO of SeaportAi and author of about 40 video courses and 10 books. The 257-page book includes a section on ChatGPT that states: “the technology raises important ethical questions about the use and misuse of AI-generated text.” Springer Nature provides policies and guidance about the use of AI to its authors, Felicitas Behrendt, senior communications manager for books at the publisher, told us by email. “Whilst we recognise that authors may use LLMs, we emphasise that any submission must be undertaken with full human oversight, and any AI use beyond basic copy editing must be declared.” _Mastering Machine Learning_ contains no such declaration. When asked about the potential use of AI in the work, Behrendt told us: “We are aware of the text and are currently looking into it.” She did not comment on efforts taken during Springer Nature’s editorial process to ensure its AI policies are followed. LLM-generated citations were at the center of controversies around Robert F. Kennedy Jr.’s “Make America Healthy Again” report and a CDC presentation on the vaccine preservative thimerosal. At Retraction Watch, our cofounders were once cited in a made-up reference in an Australian government report on research integrity. We’ve seen fake citations fell research articles, and our list of papers with evidence of undisclosed ChatGPT use has grown long and almost certainly represents only a fraction of those that actually do. The same day Behrendt replied to our query, Springer Nature published a post on its blog titled, “Research integrity in books: Prevention by balancing human oversight and AI tools.” “All book manuscripts are initially assessed by an in-house editor who decides whether to forward the submission to further review,” Deidre Hudson Reuss, senior content marketing manager at the company, wrote. “The reviewers – subject matter experts – evaluate the manuscript’s quality and originality, to ensure its validity and that it meets the highest integrity and ethics standards.” * * * _Like Retraction Watch? You can make a _tax-deductible contribution to support our work_, follow us _on X_ or Bluesky, like us _on Facebook_, follow us on LinkedIn, add us to your _RSS reader_, or subscribe to our _daily digest_. If you find a retraction that’s _not in our database_, you can _let us know here_. For comments or feedback, email us at [email protected]._ * * * Sign up for our newsletter By clicking submit, you agree to share your email address with the site owner and Mailchimp to receive marketing, updates, and other emails from the site owner. Use the unsubscribe link in those emails to opt out at any time. Processing… Success! You're on the list. Whoops! There was an error and we couldn't process your subscription. Please reload the page and try again. ### Share this: * Click to email a link to a friend (Opens in new window) Email * Click to share on Bluesky (Opens in new window) Bluesky * Click to share on LinkedIn (Opens in new window) LinkedIn * Click to share on Facebook (Opens in new window) Facebook * Click to share on X (Opens in new window) X * ### _Related_

Whoops....

Springer Nature book on machine learning is full of made-up citations
https://retractionwatch.com/2025/06/30/springer-nature-book-on-machine-learning-is-full-of-made-up-citations/ #SpringerNature #MachineLearning #AI #LLM #hallucinations

01.07.2025 07:32 — 👍 0    🔁 3    💬 0    📌 0
Original post on code4lib.social

'Unfair publisher fees for deposit into #repositories highlight the need for #authors to exercise their rights'

New blog post from @coar_repositories once again pushing against the emergence of "publishers which] are audaciously seeking to monetize funder mandates by making it more difficult […]

30.06.2025 18:42 — 👍 0    🔁 0    💬 0    📌 0
Original post on code4lib.social

"Works-magnet addresses challenges related to metadata heterogeneity, complex processing chains, and the need for human curation in a diverse research landscape..."

2506.14430] Works-magnet: Accelerating [#Metadata Curation for Open Science

https://arxiv.org/abs/2506.14430 #OpenScience […]

21.06.2025 16:19 — 👍 0    🔁 0    💬 0    📌 0

@DanielRThomas Holy smokes. Not that it is unexpected -- but the localism of #Glasgow adds to the realism that the #environment is changing. #ClimateChange

21.06.2025 16:11 — 👍 0    🔁 1    💬 0    📌 0
Preview
ChatGPT Gets 'Absolutely Wrecked' in Chess Match With 1978 Atari The chatbot 'made enough blunders to get laughed out of a third-grade chess club,' according to the developer who set up the contest.

Way to go Atari 2600! 😃

"The #chatbot made enough blunders to get laughed out of a third-grade chess club..."

#ChatGPT Gets 'Absolutely Wrecked' in #Chess Match With 1978 #Atari
https://uk.pcmag.com/ai/158596/chatgpt-gets-absolutely-wrecked-in-chess-match-with-1978-atari

19.06.2025 16:02 — 👍 1    🔁 1    💬 0    📌 0
A picture of M. C. Escher's print ‘Stars’, depicting various polyhedra, with two chameleons inside the largest of them

A picture of M. C. Escher's print ‘Stars’, depicting various polyhedra, with two chameleons inside the largest of them

I almost forgot to wish you all a very happy Escher's birthday!

17.06.2025 17:00 — 👍 2    🔁 6    💬 0    📌 0
Original post on code4lib.social

Time travel! Love the retro search UI. Terrific work @internetarchive!

Keep on GIFin’ — A New Version of GifCities, Internet Archive’s GeoCities Animated GIF Search Engine! […]

12.06.2025 08:07 — 👍 0    🔁 0    💬 0    📌 0

@g3om4c.code4lib.social.ap.brid.gy is following 5 prominent accounts