#
Data Preparation
#
General dataset properties
A dataset summarizes research data on a delimited situation in a structured manner. The research data should be prepared in such a way that easy reuse is possible. In principle, the data record should be tabular and column-based. In most cases, soil and agricultural data have a spatial reference. The spatial position of the measuring points or areas should be given in the table as detailed as possible.
A typical dataset for data transfer to the BonaRes Repository has the following properties:
- Each column (attribute) of the table contains the attribute name in the first row and the attribute values in the following rows, which means the data within the table are column-oriented.
- Each table or dataset must be given a short, concise name.
- For widespread reuse, work should be done in English if possible.
Typically, each table contains the following standard column names:
#
Formal criterias for datasets
The following formal criteria for tabular data especially submitted as Excel files must be met:
#
Table criteria
#
Column criteria
#
Cell criteria
#
Preferred file formats
In case of tabular data, the prepared dataset should be submitted in the file formats:
- Textfile (*.txt)
- Comma separated value file (*.csv)
- Excel sheet (*.xls or *.xslx).
Avoid file formats that cannot be read with common programs. For example, formats for special company software for data loggers. In addition to tables, the BonaRes Repository also publishes all formats of research data that are common in science, such as:
- pictures,
- videos,
- texts.
The BonaRes Repository also is able to deal with complex file structures, e.g.:
- MS Access
- SQL
- Shape files
- ...
In this case contact the support team of BonaRes Repository.