Tag Archives: data hygiene

Spreading the open science love like it’s my job because it is my job.

So, this happened. Apparently, yelling into the void about data management and reproducibility is a very good way to get yourself onto a rag-tag team of open-science proto-superheroes.1 Yes, that’s right, I’m a Mozilla Science fellow, and together, baby, we’re … Continue reading

Posted in Uncategorized | Tagged , , , , , , , , , , , , | 1 Comment

Dealing with dates as data in Excel

This post is not about dating, as in romantic entanglements, although lots of scientists I know collect data on the dates they go on.* But like dates of the human variety, dealing with dates, as in those pesky things that … Continue reading

Posted in Uncategorized | Tagged , , , , , , , , , | 4 Comments

Help me, I’m covered in bees -or- using OpenRefine to clean specimen data

It’s probably time I got to this- I did promise it over three months ago. Yesterday, I finally sat down to revisit a dataset which was created in 2008-2009, but then orphaned when staff moved on to other positions. It’s … Continue reading

Posted in Uncategorized | Tagged , , , , , , , , , , | 1 Comment

Getting meta

Friends, your resident bug-counter has been, unh, busy lately. I’ve been travelling a lot over the past two months, between interviews for faculty positions and conferences. And it’s not over- I’ve got two more trips in May. So far, I’ve … Continue reading

Posted in Uncategorized | Tagged , , , , , , , , | 1 Comment

Bridging the data style gap- Part 1- different disciplines, different data formats

This post is first in a series where I will essentially be live-blogging on working on an #otherpeoplesdata dataset. These data are associated with a project examining bee diversity in bioenergy cropping systems. I’ve already done a little bit of … Continue reading

Posted in Uncategorized | Tagged , , , , , , , , , , , , | 4 Comments

This post is about nothing*

You’ll all be pleased to know that I got my big, Big-Data pre-proposal to NSF sent off.** It’s been occupying about 90% of my brain these last few weeks, so it’s good to send it off, and get myself back … Continue reading

Posted in Uncategorized | Tagged , , , , , , , , , | 2 Comments

Cleaning up your act: simple ways to find errors in your raw data

It’s been an interesting week. We had a massive ice storm in the eastern Great Lakes area, trees and power lines were totaled, power was out everywhere- I didn’t have electricity for four days. Things, basically, are a disordered mess, … Continue reading

Posted in Uncategorized | Tagged , , , , , , , , , , , | 9 Comments