[FR] Out-of-memory data reduction. #48
Labels
easy
Solving this has easy to medium difficulty!
feature request
A new feature we would like to have!
hard
This is definitely difficult to solve!
While the in-memory functionality is great, it is typically the case that you have so much data that they don't fit to memory. Typically these data are saved in either monthly or yearly files, where each file contains one year of all the data, etc.
This is good for us, because at the moment it isn't hard to write a simple
for
-loop over your code. However we can streamline many things. For example, the outputClimArray
can be pre-initialized and efficiently aggregated over, similarly to howyearlyagg
works now.So in principle there are two ways to do out-of-memory data reduction:
yearlyagg
and looping over the files.The above is in my eyes easy, provided that the required issues are solved first.
The thing that is hard is also getting automatic parallelization to work here.
The text was updated successfully, but these errors were encountered: