Updated: Jun 4, 2013
Data Sets from Different Research Projects
To organize data sets generated or processed from different research projects, put them in separate folders with names reflecting the research projects. It is good practice to keep a README file under each folder logging any changes.
When the size of the data set is not large, e.g. several megabytes, using Git for version control is a good choice. Git is an open source version control system, widely adopted by programmers. It is also suitable for version control of any documents including data set files. If you are totally new to Git, start here. It is innovative and yet easy to use.
Some Git service providers are:
Research data sets will change as the research project progresses. For efficient management of research data, and for ease the re-use of research data sets, keep a well-maintained record of changes in your data sets. To document data sets, you need to choose a metadata standard and record all changes.
Metadata covers different components and includes:
Choose a metadata standard:
Use metadata tools: