API reference
Contents
API reference#
This page provides an auto-generated summary of Xbatcher’s API.
Dataset.batch and DataArray.batch#
|
Return a BatchGenerator via the batch accessor |
|
Return a BatchGenerator via the batch accessor |
Core#
- class xbatcher.BatchGenerator(ds: xarray.core.dataset.Dataset, input_dims: Dict[Hashable, int], input_overlap: Dict[Hashable, int] = {}, batch_dims: Dict[Hashable, int] = {}, concat_input_dims: bool = False, preload_batch: bool = True)[source]#
Create generator for iterating through xarray datarrays / datasets in batches.
- Parameters
- ds
xarray.Dataset
orxarray.DataArray
The data to iterate over
- input_dimsdict
A dictionary specifying the size of the inputs in each dimension, e.g.
{'lat': 30, 'lon': 30}
These are the dimensions the ML library will see. All other dimensions will be stacked into one dimension calledsample
.- input_overlapdict, optional
A dictionary specifying the overlap along each dimension e.g.
{'lat': 3, 'lon': 3}
- batch_dimsdict, optional
A dictionary specifying the size of the batch along each dimension e.g.
{'time': 10}
. These will always be iterated over.- concat_input_dimsbool, optional
If
True
, the dimension chunks specified ininput_dims
will be concatenated and stacked into thesample
dimension. The batch index will be included as a new levelinput_batch
in thesample
coordinate. IfFalse
, the dimension chunks specified ininput_dims
will be iterated over.- preload_batchbool, optional
If
True
, each batch will be loaded into memory before reshaping / processing, triggering any dask arrays to be computed.
- ds
- Yields
- ds_slice
xarray.Dataset
orxarray.DataArray
Slices of the array matching the given batch size specification.
- ds_slice
Dataloaders#
- class xbatcher.loaders.torch.MapDataset(*args: Any, **kwargs: Any)[source]#
- __init__(X_generator, y_generator, transform: Optional[Callable] = None, target_transform: Optional[Callable] = None) None [source]#
PyTorch Dataset adapter for Xbatcher
- Parameters
- X_generatorxbatcher.BatchGenerator
- y_generatorxbatcher.BatchGenerator
- transformcallable, optional
A function/transform that takes in an array and returns a transformed version.
- target_transformcallable, optional
A function/transform that takes in the target and transforms it.
- class xbatcher.loaders.keras.CustomTFDataset(*args: Any, **kwargs: Any)[source]#
- __init__(X_generator, y_generator, *, transform: Optional[Callable] = None, target_transform: Optional[Callable] = None) None [source]#
Keras Dataset adapter for Xbatcher
- Parameters
- X_generatorxbatcher.BatchGenerator
- y_generatorxbatcher.BatchGenerator
- transformcallable, optional
A function/transform that takes in an array and returns a transformed version.
- target_transformcallable, optional
A function/transform that takes in the target and transforms it.