# HuggingFaceDatasetLoader

> **Class** in `langchain_community`

📖 [View in docs](https://reference.langchain.com/python/langchain-community/document_loaders/hugging_face_dataset/HuggingFaceDatasetLoader)

Load from `Hugging Face Hub` datasets.

## Signature

```python
HuggingFaceDatasetLoader(
    self,
    path: str,
    page_content_column: str = 'text',
    name: Optional[str] = None,
    data_dir: Optional[str] = None,
    data_files: Optional[Union[str, Sequence[str], Mapping[str, Union[str, Sequence[str]]]]] = None,
    cache_dir: Optional[str] = None,
    keep_in_memory: Optional[bool] = None,
    save_infos: bool = False,
    use_auth_token: Optional[Union[bool, str]] = None,
    num_proc: Optional[int] = None,
)
```

## Parameters

| Name | Type | Required | Description |
|------|------|----------|-------------|
| `path` | `str` | Yes | Path or name of the dataset. |
| `page_content_column` | `str` | No | Page content column name. Default is "text". (default: `'text'`) |
| `name` | `Optional[str]` | No | Name of the dataset configuration. (default: `None`) |
| `data_dir` | `Optional[str]` | No | Data directory of the dataset configuration. (default: `None`) |
| `data_files` | `Optional[Union[str, Sequence[str], Mapping[str, Union[str, Sequence[str]]]]]` | No | Path(s) to source data file(s). (default: `None`) |
| `cache_dir` | `Optional[str]` | No | Directory to read/write data. (default: `None`) |
| `keep_in_memory` | `Optional[bool]` | No | Whether to copy the dataset in-memory. (default: `None`) |
| `save_infos` | `bool` | No | Save the dataset information (checksums/size/splits/...). Default is False. (default: `False`) |
| `use_auth_token` | `Optional[Union[bool, str]]` | No | Bearer token for remote files on the Dataset Hub. (default: `None`) |
| `num_proc` | `Optional[int]` | No | Number of processes. (default: `None`) |

## Extends

- `BaseLoader`

## Constructors

```python
__init__(
    self,
    path: str,
    page_content_column: str = 'text',
    name: Optional[str] = None,
    data_dir: Optional[str] = None,
    data_files: Optional[Union[str, Sequence[str], Mapping[str, Union[str, Sequence[str]]]]] = None,
    cache_dir: Optional[str] = None,
    keep_in_memory: Optional[bool] = None,
    save_infos: bool = False,
    use_auth_token: Optional[Union[bool, str]] = None,
    num_proc: Optional[int] = None,
)
```

| Name | Type |
|------|------|
| `path` | `str` |
| `page_content_column` | `str` |
| `name` | `Optional[str]` |
| `data_dir` | `Optional[str]` |
| `data_files` | `Optional[Union[str, Sequence[str], Mapping[str, Union[str, Sequence[str]]]]]` |
| `cache_dir` | `Optional[str]` |
| `keep_in_memory` | `Optional[bool]` |
| `save_infos` | `bool` |
| `use_auth_token` | `Optional[Union[bool, str]]` |
| `num_proc` | `Optional[int]` |


## Properties

- `path`
- `page_content_column`
- `name`
- `data_dir`
- `data_files`
- `cache_dir`
- `keep_in_memory`
- `save_infos`
- `use_auth_token`
- `num_proc`

## Methods

- [`lazy_load()`](https://reference.langchain.com/python/langchain-community/document_loaders/hugging_face_dataset/HuggingFaceDatasetLoader/lazy_load)
- [`parse_obj()`](https://reference.langchain.com/python/langchain-community/document_loaders/hugging_face_dataset/HuggingFaceDatasetLoader/parse_obj)

---

[View source on GitHub](https://github.com/langchain-ai/langchain-community/blob/a6a6079511ac8a5c1293337f88096b8641562e77/libs/community/langchain_community/document_loaders/hugging_face_dataset.py#L9)