Crystal Lewis's Avatar

Crystal Lewis

@cghlewis.bsky.social

Research Data Management Consultant | cghlewis.com Co-organizer @r-ladies-stl.bsky.social‬ Co-organizer POWER Data Management Hub | https://osf.io/ap3tk/ Author of DMLSER: https://datamgmtinedresearch.com/ RDM Weekly: https://rdmweekly.substack.com/

5,785 Followers  |  1,786 Following  |  1,507 Posts  |  Joined: 15.08.2023
Posts Following

Posts by Crystal Lewis (@cghlewis.bsky.social)

Preview
RDM Weekly - Issue 034 A weekly roundup of Research Data Management resources.

Issue 34 of #rdmweekly is out! πŸ“¬

➑️ Updated Elements of an NIH Data Management and Sharing Plan
➑️ Making Code Ready for Publication @tladeras.bsky.social
➑️10 Things for Curating Reproducible and FAIR Research @researchdataall.bsky.social
and more!

rdmweekly.substack.com/p/rdm-weekly...

03.03.2026 14:01 β€” πŸ‘ 8    πŸ” 5    πŸ’¬ 2    πŸ“Œ 0
Preview
RDM Weekly - Issue 034 A weekly roundup of Research Data Management resources.

Issue 34 of #rdmweekly is out! πŸ“¬

➑️ Updated Elements of an NIH Data Management and Sharing Plan
➑️ Making Code Ready for Publication @tladeras.bsky.social
➑️10 Things for Curating Reproducible and FAIR Research @researchdataall.bsky.social
and more!

rdmweekly.substack.com/p/rdm-weekly...

03.03.2026 14:01 β€” πŸ‘ 8    πŸ” 5    πŸ’¬ 2    πŸ“Œ 0

I came across this platform that might be of interest to ed researchers.

"A community-powered platform that helps educators, families, researchers, and leaders compare what's working (and what isn't) across real settings β€” and connect practice to credible evidence."

educationanswers.org/help.html

03.03.2026 13:43 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

My neighborhood news.

02.03.2026 20:41 β€” πŸ‘ 18    πŸ” 2    πŸ’¬ 3    πŸ“Œ 1
UAB Libraries-Reference Librarian–RISC (Health Sciences & Systematic Reviews) University of Alabama at Birmingham (UAB) Libraries is seeking a creative, collaborative, and forward-thinking librarian to join our team at Lister Hill Library of the Health Sciences. In this role, y...

πŸ‰Hey there to all my 5 followers! UAB is seeking a Systematic Review Librarian. Come work with me!! Learn more and apply here: uab.peopleadmin.com/postings/27128 #medlibs

02.03.2026 16:25 β€” πŸ‘ 7    πŸ” 3    πŸ’¬ 0    πŸ“Œ 0

It is NEVER too late to make a data dictionary. Sometimes it’s the only way to start working with an existing data set.

02.03.2026 15:22 β€” πŸ‘ 17    πŸ” 4    πŸ’¬ 0    πŸ“Œ 0

Maybe this question was designed for you then. πŸ˜…

02.03.2026 15:15 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

πŸ˜‚ Seriously! I was like, I don't know how to answer this question?

02.03.2026 15:07 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
a cartoon character with orange hair has the words hmmmm on his head ALT: a cartoon character with orange hair has the words hmmmm on his head

Duo Mobile:
"Are you enjoying Duo Mobile?"

Am I enjoying an authentication app? Such a strange question.

02.03.2026 14:46 β€” πŸ‘ 15    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

πŸ˜‚

02.03.2026 00:35 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

πŸ˜‚πŸ₯°

01.03.2026 17:46 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
OSF

Template: osf.io/e5g6t/files/...

01.03.2026 15:55 β€” πŸ‘ 5    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

The data dictionary: A simple but effective way to plan, organize, validate, and document your data.

datamgmtinedresearch.com/document#doc...

01.03.2026 15:53 β€” πŸ‘ 50    πŸ” 8    πŸ’¬ 2    πŸ“Œ 1
β€œI don’t like tibbles”, β€œpipes are overrated”, β€œloops are great actually”, … what’s next?

Ah yes β€” the R contrarian character arc πŸ˜„
If we continue the escalation, the next stages usually look like:
	β€’	β€œActually, base R plotting is more transparent.”
	β€’	β€œdata.table was right all along.”
	β€’	β€œI write my own S3 methods for fun.”
	β€’	β€œVectorization is just premature optimization avoidance.”
	β€’	β€œapply() is perfectly readable.”
	β€’	β€œWhy depend on 40 packages for a left join?”
	β€’	β€œI don’t trust NSE.”
	β€’	β€œMagrittr changed evaluation semantics and nobody talks about it.”
	β€’	β€œOne well-written for-loop beats clever code.”
	β€’	β€œI opened the tidyverse source code once and never recovered.”

β€œI don’t like tibbles”, β€œpipes are overrated”, β€œloops are great actually”, … what’s next? Ah yes β€” the R contrarian character arc πŸ˜„ If we continue the escalation, the next stages usually look like: β€’ β€œActually, base R plotting is more transparent.” β€’ β€œdata.table was right all along.” β€’ β€œI write my own S3 methods for fun.” β€’ β€œVectorization is just premature optimization avoidance.” β€’ β€œapply() is perfectly readable.” β€’ β€œWhy depend on 40 packages for a left join?” β€’ β€œI don’t trust NSE.” β€’ β€œMagrittr changed evaluation semantics and nobody talks about it.” β€’ β€œOne well-written for-loop beats clever code.” β€’ β€œI opened the tidyverse source code once and never recovered.”

Planning my next conversation starter

26.02.2026 18:06 β€” πŸ‘ 102    πŸ” 11    πŸ’¬ 22    πŸ“Œ 7

Keep an eye on it. The template is still a draft. :)

26.02.2026 21:40 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@cghlewis.bsky.social do you have any recommendations for lightweight data dictionaries? I'm thinking about how you record the knowledge you build up over the course of analysis, particularly when the data comes to you only partially documented.

23.02.2026 18:26 β€” πŸ‘ 30    πŸ” 4    πŸ’¬ 4    πŸ“Œ 0

πŸ’―

26.02.2026 13:48 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

While it does reduce burden on applicants, I’m not sure this checkbox model prepares anyone for actual data sharing.
Here is a draft, awaiting OMB clearance:
grants.nih.gov/sites/defaul...

26.02.2026 12:38 β€” πŸ‘ 9    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0
NOT-OD-26-046: Updated Elements of an NIH Data Management and Sharing Plan NIH Funding Opportunities and Notices in the NIH Guide for Grants and Contracts: Updated Elements of an NIH Data Management and Sharing Plan NOT-OD-26-046. NIH

This is a dramatic update to the NIH Data Management and Sharing Plan template, effective for applications submitted for due dates on or after May 25, 2026.

grants.nih.gov/grants/guide...

26.02.2026 12:34 β€” πŸ‘ 14    πŸ” 16    πŸ’¬ 2    πŸ“Œ 1
Post image

🎢 Here I go again on my own. Going down the only road I’ve ever known 🎢

26.02.2026 00:11 β€” πŸ‘ 17    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

That’s it :)

25.02.2026 18:30 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
A Comparison of Packages to Generate Codebooks in R

There’s a lot of R packages that make codebooks from existing data. I just need something that I can create before data is collected. cghlewis.github.io/rladies-nyc-...

25.02.2026 16:43 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Chapter 8 Documentation | Data Management in Large-Scale Education Research Figure 8.1: Documentation in the research project life cycle. Documentation is a collection of files containing procedural and descriptive information about your team, project, workflows, and...

Also, here is where I wrote about how I create data dictionaries.
Section 8.4.1
Probably more information than you wanted. πŸ˜…
datamgmtinedresearch.com/document

25.02.2026 12:23 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Sage Journals: Discover world-class research Subscription and open access journals from Sage, the world's leading independent academic publisher.

This starts to get there but I don’t think it fits the human readable piece and I’m not sure it has all the field options I’d want. But frictionless data packages might?

journals.sagepub.com/doi/10.1177/...

25.02.2026 11:46 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

But I’ve definitely been thinking about other ways to create a data dictionary (for spreadsheets) before or after you collect data that documents information in both a human and machine readable way using a standard metadata schema. That can then be used to both understand and validate data.

25.02.2026 11:38 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0
OSF

πŸ˜‚ You got it!
The first tab is a template, the second tab is an example. I keep the columns pretty flexible (adding fields as needed or removing ones that are not needed).

osf.io/e5g6t/files/...

25.02.2026 11:29 β€” πŸ‘ 9    πŸ” 0    πŸ’¬ 3    πŸ“Œ 0

But also I’ve heard good things about frictionless data packages.

25.02.2026 03:22 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Hey, sorry I’ve been a little off the grid. I’m not sure you’re going to love my solution. After doing some detective work, I usually just manually create a data dictionary in Excel to document all the logic that I uncover.

25.02.2026 03:10 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 3    πŸ“Œ 0

I haven’t heard one word about AI since I’ve been in Costa Rica and it’s been amazing.

23.02.2026 00:14 β€” πŸ‘ 24    πŸ” 3    πŸ’¬ 2    πŸ“Œ 0
Live coding TidyTuesday. Join us with Hadley Wickham, Tues, Feb 17 at 12pm ET, pos.it/dslab

Live coding TidyTuesday. Join us with Hadley Wickham, Tues, Feb 17 at 12pm ET, pos.it/dslab

Join us in a few hours to watch Hadley Wickham live code!

Featuring data from the TidyTuesday project, the new Posit AI in RStudio, and lots of good times. #RStats

Sign up here: pos.it/dslab

17.02.2026 15:08 β€” πŸ‘ 11    πŸ” 3    πŸ’¬ 0    πŸ“Œ 1