datasets#
Use datasets to curate spans from your LLM applications for testing. Read our quickstart guide.
To use in your code, import the following:
from arize.experimental.datasets import ArizeDatasetsClient
client#
- class ArizeDatasetsClient(developer_key, api_key, host='flight.arize.com', port=443, scheme='grpc+tls', otlp_endpoint='https://otlp.arize.com/v1')#
Bases:
object
ArizeDatasetsClient is a client for interacting with the Arize Datasets API.
- Parameters:
developer_key (str, required) – Arize provided developer key associated with your user profile, located on the space settings page.
api_key (str, required) – Arize provided API key associated with your user profile, located on the space settings page.
host (str, optional) – URI endpoint host to send your export request to Arize AI. Defaults to “{DEFAULT_ARIZE_FLIGHT_HOST}”.
port (int, optional) – URI endpoint port to send your export request to Arize AI. Defaults to {DEFAULT_ARIZE_FLIGHT_PORT}.
scheme (str, optional) – Transport scheme to use for the connection. Defaults to “{DEFAULT_TRANSPORT_SCHEME}”.
otlp_endpoint (str, optional) – OTLP endpoint to send experiment traces to Arize. Defaults to “{DEFAULT_ARIZE_OTLP_ENDPOINT}”.
- create_dataset(space_id, dataset_name, dataset_type, data, convert_dict_to_json=True)#
Create a new dataset.
- Parameters:
space_id (str) – The ID of the space where the dataset will be created.
dataset_name (str) – The name of the dataset.
dataset_type (DatasetType) – The type of the dataset.
data (pd.DataFrame) – The data to be included in the dataset.
convert_dict_to_json (bool, optional) – Convert dictionary columns to JSON strings for default JSON str columns per Open Inference. Defaults to True.
- Returns:
The ID of the created dataset, or None if the creation failed.
- Return type:
str
- delete_dataset(space_id, dataset_id)#
Delete a dataset.
- Parameters:
space_id (str) – The ID of the space where the dataset is located.
dataset_id (str) – The ID of the dataset to delete.
- Returns:
True if the dataset was successfully deleted, False otherwise.
- Return type:
bool
- get_dataset(space_id, dataset_id=None, dataset_name=None, dataset_version=None, convert_json_str_to_dict=True)#
Get the data of a dataset.
- Parameters:
space_id (str) – The ID of the space where the dataset is located.
dataset_id (str) – Dataset id. Required if dataset_name is not provided.
dataset_name (str) – Dataset name. Required if dataset_id is not provided.
dataset_version (str, optional) – version name of the dataset Defaults to “” and gets the latest version by based on creation time
convert_json_str_to_dict (bool, optional) – Convert JSON strings to Python dictionaries. For default JSON str columns per Open Inference. Defaults to True.
- Returns:
The data of the dataset.
- Return type:
pd.DataFrame
- get_dataset_versions(space_id, dataset_id)#
Get the versions of a dataset.
- Parameters:
space_id (str) – The ID of the space where the dataset is located.
dataset_id (str) – The ID of the dataset to get versions info for.
- Returns:
A list of dictionaries representing the versions of the dataset.
- Return type:
List[Dict[str, Any]]
- get_experiment(space_id, experiment_name=None, dataset_name=None, experiment_id=None)#
Retrieve experiment data from Arize.
- Parameters:
space_id (str) – The ID of the space where the experiment is located.
experiment_name (Optional[str]) – The name of the experiment. Required if experiment_id is not provided.
dataset_name (Optional[str]) – The name of the dataset associated with the experiment. Required if experiment_id is not provided.
experiment_id (Optional[str]) – The ID of the experiment. Required if experiment_name and dataset_name are not provided.
- Returns:
- A pandas DataFrame containing the experiment data,
or None if the retrieval fails.
- Return type:
Optional[pd.DataFrame]
- Raises:
ValueError – If neither experiment_id nor both experiment_name and dataset_name are provided.
RuntimeError – If the experiment retrieval fails.
Note
You must provide either the experiment_id or both the experiment_name and dataset_name.
- list_datasets(space_id)#
List all datasets in a space.
- Parameters:
space_id (str) – The ID of the space to list datasets for.
- Returns:
A list of dictionaries representing the datasets in the space.
- Return type:
List[Dict[str, Any]]
- run_experiment(space_id, experiment_name, task, dataset_df=None, dataset_id=None, dataset_name=None, evaluators=None, dry_run=False, concurrency=3, set_global_tracer_provider=False, exit_on_error=False)#
Run an experiment on a dataset and upload the results.
This function initializes an experiment, retrieves or uses a provided dataset, runs the experiment with specified tasks and evaluators, and uploads the results.
- Parameters:
space_id (str) – The ID of the space where the experiment will be run.
experiment_name (str) – The name of the experiment.
task (ExperimentTask) – The task to be performed in the experiment.
dataset_df (Optional[pd.DataFrame], optional) – The dataset as a pandas DataFrame. If not provided, the dataset will be downloaded using dataset_id or dataset_name. Defaults to None.
dataset_id (Optional[str], optional) – The ID of the dataset to use. Required if dataset_df and dataset_name are not provided. Defaults to None.
dataset_name (Optional[str], optional) – The name of the dataset to use. Used if dataset_df and dataset_id are not provided. Defaults to None.
evaluators (Optional[Evaluators], optional) – The evaluators to use in the experiment. Defaults to None.
dry_run (bool) – If True, the experiment result will not be uploaded to Arize. Defaults to False.
concurrency (int) – The number of concurrent tasks to run. Defaults to 3.
set_global_tracer_provider (bool) – If True, sets the global tracer provider for the experiment. Defaults to False.
exit_on_error (bool) – If True, the experiment will stop running on first occurrence of an error.
- Returns:
A tuple of experiment ID and experiment result DataFrame. If dry_run is True, the experiment ID will be an empty string.
- Return type:
Tuple[str, pd.DataFrame]
- Raises:
ValueError – If dataset_id and dataset_name are both not provided, or if the dataset is empty.
RuntimeError – If experiment initialization, dataset download, or result upload fails.
- update_dataset(space_id, dataset_id, data)#
Update an existing dataset by creating a new version.
- Parameters:
space_id (str) – The ID of the space where the dataset is located.
dataset_id (str) – The ID of the dataset to update.
data (pd.DataFrame) – The updated data to be included in the dataset.
- Returns:
The ID of the updated dataset, or None if the update failed.
- Return type:
str