new_join_dataset#
- geodesic.boson.dataset.new_join_dataset(name, left_dataset, right_dataset, left_field=None, right_field=None, spatial_join=False, left_drop_fields=[], right_drop_fields=[], left_suffix='_left', right_suffix='_right', use_geometry='right', skip_initialize=False, feature_limit=1000, max_left_page_queries=10, right_collection=None, left_collection=None, project=None, middleware={}, cache={}, tile_options={}, domain='*', category='*', type='*', **kwargs)[source]#
Creates a left join of two feature datasets on the values of specific keys.
Currently this is intended for smaller datasets or used in conjuction with the view provider to limit the scope of the join.
- Parameters:
name (str) – the name of the new
Datasetleft_dataset (Dataset) – the left dataset to join
left_field (str | None) – the field in the left dataset to join on
right_dataset (Dataset) – the right dataset to join
right_field (str | None) – the field in the right dataset to join on
spatial_join (bool) – if True, will perform a spatial join
left_drop_fields (List[str]) – fields to drop from the left dataset
right_drop_fields (List[str]) – fields to drop from the right dataset
left_suffix (str) – the suffix to add to the left dataset fields
right_suffix (str) – the suffix to add to the right dataset fields
use_geometry (str) – which geometry to use, either ‘left’ or ‘right’
skip_initialize (bool) – if True, will not initialize the right provider. This is necessary if the right provider is particularly large - all joins will then be dynamic.
feature_limit (int) – the max size of a results page from a query/search
max_left_page_queries (int) – the max number of queries a single join request will make to the left provider. The default is 10. This limit is in place to prevent inefficient join requests. Before adjusting this, consider increasing the max page size of the left provider.
right_collection (str | None) – if the right dataset has multiple collections, the name of the collection to use.
left_collection (str | None) – if the left dataset has multiple collections, the name of the collection to use.
project (Project | str | None) – the name of the project this will be assigned to
middleware (MiddlewareConfig) – configure any boson middleware to be applied to the new dataset.
cache (CacheConfig) – configure caching for this dataset
tile_options (TileOptions) – configure tile options for this dataset
domain (str) – the domain of the dataset
category (str) – the category of the dataset
type (str) – the type of the dataset
**kwargs (dict) – additional properties to set on the new dataset