lsdb.catalog.dataset.dataset#
Classes#
Base HATS Dataset |
Module Contents#
- class Dataset(ddf: lsdb.nested.NestedFrame, hc_structure: hats.catalog.Dataset)[source]#
Base HATS Dataset
- compute() nested_pandas.NestedFrame [source]#
Compute dask distributed dataframe to pandas dataframe
- to_delayed(optimize_graph: bool = True) list[dask.delayed.Delayed] [source]#
Get a list of Dask Delayed objects for each partition in the dataset
Used for more advanced custom operations, but to use again with LSDB, the delayed objects must be converted to a Dask DataFrame and used with extra metadata to construct an LSDB Dataset.
- Parameters:
optimize_graph (bool) – If True [default], the graph is optimized before converting into
dask.delayed
objects.