Buckets:

rtrm's picture
|
download
raw
9.35 kB
# Utilities
## Configure logging[[datasets.utils.logging.get_verbosity]]
🤗 Datasets strives to be transparent and explicit about how it works, but this can be quite verbose at times. We have included a series of logging methods which allow you to easily adjust the level of verbosity of the entire library. Currently the default verbosity of the library is set to `WARNING`.
To change the level of verbosity, use one of the direct setters. For instance, here is how to change the verbosity to the `INFO` level:
```py
import datasets
datasets.logging.set_verbosity_info()
```
You can also use the environment variable `DATASETS_VERBOSITY` to override the default verbosity, and set it to one of the following: `debug`, `info`, `warning`, `error`, `critical`:
```bash
DATASETS_VERBOSITY=error ./myprogram.py
```
All the methods of this logging module are documented below. The main ones are:
- [logging.get_verbosity()](/docs/datasets/pr_7835/en/package_reference/utilities#datasets.utils.logging.get_verbosity) to get the current level of verbosity in the logger
- [logging.set_verbosity()](/docs/datasets/pr_7835/en/package_reference/utilities#datasets.utils.logging.set_verbosity) to set the verbosity to the level of your choice
In order from the least to the most verbose (with their corresponding `int` values):
1. `logging.CRITICAL` or `logging.FATAL` (int value, 50): only report the most critical errors.
2. `logging.ERROR` (int value, 40): only report errors.
3. `logging.WARNING` or `logging.WARN` (int value, 30): only reports error and warnings. This the default level used by the library.
4. `logging.INFO` (int value, 20): reports error, warnings and basic information.
5. `logging.DEBUG` (int value, 10): report all information.
<div class="docstring border-l-2 border-t-2 pl-4 pt-3.5 border-gray-100 rounded-tl-xl mb-6 mt-8">
<docstring><name>datasets.utils.logging.get_verbosity</name><anchor>datasets.utils.logging.get_verbosity</anchor><source>https://github.com/huggingface/datasets/blob/r_7835/src/datasets/utils/logging.py#L94</source><parameters>[]</parameters><retdesc>Logging level, e.g., `datasets.logging.DEBUG` and `datasets.logging.INFO`.</retdesc></docstring>
Return the current level for the HuggingFace datasets library's root logger.
> [!TIP]
> HuggingFace datasets library has following logging levels:
> - `datasets.logging.CRITICAL`, `datasets.logging.FATAL`
> - `datasets.logging.ERROR`
> - `datasets.logging.WARNING`, `datasets.logging.WARN`
> - `datasets.logging.INFO`
> - `datasets.logging.DEBUG`
</div>
<div class="docstring border-l-2 border-t-2 pl-4 pt-3.5 border-gray-100 rounded-tl-xl mb-6 mt-8">
<docstring><name>datasets.utils.logging.set_verbosity</name><anchor>datasets.utils.logging.set_verbosity</anchor><source>https://github.com/huggingface/datasets/blob/r_7835/src/datasets/utils/logging.py#L110</source><parameters>[{"name": "verbosity", "val": ": int"}]</parameters><paramsdesc>- **verbosity** --
Logging level, e.g., `datasets.logging.DEBUG` and `datasets.logging.INFO`.</paramsdesc><paramgroups>0</paramgroups></docstring>
Set the level for the Hugging Face Datasets library's root logger.
</div>
<div class="docstring border-l-2 border-t-2 pl-4 pt-3.5 border-gray-100 rounded-tl-xl mb-6 mt-8">
<docstring><name>datasets.utils.logging.set_verbosity_info</name><anchor>datasets.utils.logging.set_verbosity_info</anchor><source>https://github.com/huggingface/datasets/blob/r_7835/src/datasets/utils/logging.py#L119</source><parameters>[]</parameters></docstring>
Set the level for the Hugging Face datasets library's root logger to `INFO`.
This will display most of the logging information and tqdm bars.
Shortcut to `datasets.logging.set_verbosity(datasets.logging.INFO)`.
</div>
<div class="docstring border-l-2 border-t-2 pl-4 pt-3.5 border-gray-100 rounded-tl-xl mb-6 mt-8">
<docstring><name>datasets.utils.logging.set_verbosity_warning</name><anchor>datasets.utils.logging.set_verbosity_warning</anchor><source>https://github.com/huggingface/datasets/blob/r_7835/src/datasets/utils/logging.py#L129</source><parameters>[]</parameters></docstring>
Set the level for the Hugging Face datasets library's root logger to `WARNING`.
This will display only the warning and errors logging information and tqdm bars.
Shortcut to `datasets.logging.set_verbosity(datasets.logging.WARNING)`.
</div>
<div class="docstring border-l-2 border-t-2 pl-4 pt-3.5 border-gray-100 rounded-tl-xl mb-6 mt-8">
<docstring><name>datasets.utils.logging.set_verbosity_debug</name><anchor>datasets.utils.logging.set_verbosity_debug</anchor><source>https://github.com/huggingface/datasets/blob/r_7835/src/datasets/utils/logging.py#L139</source><parameters>[]</parameters></docstring>
Set the level for the Hugging Face datasets library's root logger to `DEBUG`.
This will display all the logging information and tqdm bars.
Shortcut to `datasets.logging.set_verbosity(datasets.logging.DEBUG)`.
</div>
<div class="docstring border-l-2 border-t-2 pl-4 pt-3.5 border-gray-100 rounded-tl-xl mb-6 mt-8">
<docstring><name>datasets.utils.logging.set_verbosity_error</name><anchor>datasets.utils.logging.set_verbosity_error</anchor><source>https://github.com/huggingface/datasets/blob/r_7835/src/datasets/utils/logging.py#L149</source><parameters>[]</parameters></docstring>
Set the level for the Hugging Face datasets library's root logger to `ERROR`.
This will display only the errors logging information and tqdm bars.
Shortcut to `datasets.logging.set_verbosity(datasets.logging.ERROR)`.
</div>
<div class="docstring border-l-2 border-t-2 pl-4 pt-3.5 border-gray-100 rounded-tl-xl mb-6 mt-8">
<docstring><name>datasets.utils.logging.disable_propagation</name><anchor>datasets.utils.logging.disable_propagation</anchor><source>https://github.com/huggingface/datasets/blob/r_7835/src/datasets/utils/logging.py#L159</source><parameters>[]</parameters></docstring>
Disable propagation of the library log outputs.
Note that log propagation is disabled by default.
</div>
<div class="docstring border-l-2 border-t-2 pl-4 pt-3.5 border-gray-100 rounded-tl-xl mb-6 mt-8">
<docstring><name>datasets.utils.logging.enable_propagation</name><anchor>datasets.utils.logging.enable_propagation</anchor><source>https://github.com/huggingface/datasets/blob/r_7835/src/datasets/utils/logging.py#L166</source><parameters>[]</parameters></docstring>
Enable propagation of the library log outputs.
Please disable the Hugging Face datasets library's default handler to prevent double logging if the root logger has
been configured.
</div>
## Configure progress bars[[datasets.enable_progress_bars]]
By default, `tqdm` progress bars will be displayed during dataset download and preprocessing. You can disable them globally by setting `HF_DATASETS_DISABLE_PROGRESS_BARS`
environment variable. You can also enable/disable them using [enable_progress_bars()](/docs/datasets/pr_7835/en/package_reference/utilities#datasets.enable_progress_bars) and [disable_progress_bars()](/docs/datasets/pr_7835/en/package_reference/utilities#datasets.disable_progress_bars). If set, the environment variable has priority on the helpers.
<div class="docstring border-l-2 border-t-2 pl-4 pt-3.5 border-gray-100 rounded-tl-xl mb-6 mt-8">
<docstring><name>datasets.enable_progress_bars</name><anchor>datasets.enable_progress_bars</anchor><source>https://github.com/huggingface/datasets/blob/r_7835/src/datasets/utils/tqdm.py#L77</source><parameters>[]</parameters></docstring>
Enable globally progress bars used in `datasets` except if `HF_DATASETS_DISABLE_PROGRESS_BAR` environment
variable has been set.
Use [disable_progress_bars()](/docs/datasets/pr_7835/en/package_reference/utilities#datasets.disable_progress_bars) to disable them.
</div>
<div class="docstring border-l-2 border-t-2 pl-4 pt-3.5 border-gray-100 rounded-tl-xl mb-6 mt-8">
<docstring><name>datasets.disable_progress_bars</name><anchor>datasets.disable_progress_bars</anchor><source>https://github.com/huggingface/datasets/blob/r_7835/src/datasets/utils/tqdm.py#L60</source><parameters>[]</parameters></docstring>
Disable globally progress bars used in `datasets` except if `HF_DATASETS_DISABLE_PROGRESS_BAR` environment
variable has been set.
Use [enable_progress_bars()](/docs/datasets/pr_7835/en/package_reference/utilities#datasets.enable_progress_bars) to re-enable them.
</div>
<div class="docstring border-l-2 border-t-2 pl-4 pt-3.5 border-gray-100 rounded-tl-xl mb-6 mt-8">
<docstring><name>datasets.are_progress_bars_disabled</name><anchor>datasets.are_progress_bars_disabled</anchor><source>https://github.com/huggingface/datasets/blob/r_7835/src/datasets/utils/tqdm.py#L94</source><parameters>[]</parameters></docstring>
Return whether progress bars are globally disabled or not.
Progress bars used in `datasets` can be enable or disabled globally using [enable_progress_bars()](/docs/datasets/pr_7835/en/package_reference/utilities#datasets.enable_progress_bars)
and [disable_progress_bars()](/docs/datasets/pr_7835/en/package_reference/utilities#datasets.disable_progress_bars) or by setting `HF_DATASETS_DISABLE_PROGRESS_BAR` as environment variable.
</div>
<EditOnGithub source="https://github.com/huggingface/datasets/blob/main/docs/source/package_reference/utilities.mdx" />

Xet Storage Details

Size:
9.35 kB
·
Xet hash:
2041c66506f3b8d5208c17e5d16eecc99450f7120a6b953bae758bc94e6c7295

Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.