I would like to make Hyrax serve tabular data, and I am exploring the best practices for that.
Context
I have some (in situ) sensor data, at the same location, and spans several months. The variables are Temperature (originally Fahrenheit, although I will convert to Celsius), Humidity, Particular Matter 1, 2.5 and 10. And so I would like to run Hyrax and make these available.
The sensor data is stored as csv, and looks like this:
As you can see from the screenshot, the first 7 rows of the csv file contain some metadata, and the values actually begin in row 8 and on. The environmental variables are located:
Temperature
: Column 6.PM1
,PM10
andPM2.5
: Columns 7, 8, and 9, respectively.Humidity
: Column 10.
And so, this dataset needs some cleaning before it is ready for Hyrax (removing empty rows, declaring data types for each column). I am using Pandas for cleaning, and prepping for Hyrax.
Question
Is there a way to prepare my csv file so that:
- There are Global attributes? For example, latitude, longitude, who created it, how to cite, etc.
- Each variables has attributes, for example
units
,long name
, etc?
These are features of more complex (self-describing) file formats. But these types of attributes are long-supported by OPeNDAP data servers, and I am wondering is such a thing is possible.