Buckets:

hf-doc-build/doc / transformers /main /ja /internal /modeling_utils.md
|
download
raw
2.93 kB
# カスタムレイヤーとユーティリティ
このページには、ライブラリで使用されるすべてのカスタム レイヤーと、モデリングに提供されるユーティリティ関数がリストされます。
これらのほとんどは、ライブラリ内のモデルのコードを研究する場合にのみ役に立ちます。
## Pytorch custom modules[[transformers.Conv1D]]
#### transformers.Conv1D[[transformers.Conv1D]]
[Source](https://github.com/huggingface/transformers/blob/main/src/transformers/pytorch_utils.py#L97)
1D-convolutional layer as defined by Radford et al. for OpenAI GPT (and also used in GPT-2).
Basically works like a linear layer but the weights are transposed.
**Parameters:**
nf (`int`) : The number of output features.
nx (`int`) : The number of input features.
## PyTorch Helper Functions[[transformers.apply_chunking_to_forward]]
#### transformers.apply_chunking_to_forward[[transformers.apply_chunking_to_forward]]
[Source](https://github.com/huggingface/transformers/blob/main/src/transformers/pytorch_utils.py#L126)
This function chunks the `input_tensors` into smaller input tensor parts of size `chunk_size` over the dimension
`chunk_dim`. It then applies a layer `forward_fn` to each chunk independently to save memory.
If the `forward_fn` is independent across the `chunk_dim` this function will yield the same result as directly
applying `forward_fn` to `input_tensors`.
Examples:
```python
# rename the usual forward() fn to forward_chunk()
def forward_chunk(self, hidden_states):
hidden_states = self.decoder(hidden_states)
return hidden_states
# implement a chunked forward function
def forward(self, hidden_states):
return apply_chunking_to_forward(self.forward_chunk, self.chunk_size_lm_head, self.seq_len_dim, hidden_states)
```
**Parameters:**
forward_fn (`Callable[..., torch.Tensor]`) : The forward function of the model.
chunk_size (`int`) : The chunk size of a chunked tensor: `num_chunks = len(input_tensors[0]) / chunk_size`.
chunk_dim (`int`) : The dimension over which the `input_tensors` should be chunked.
input_tensors (`tuple[torch.Tensor]`) : The input tensors of `forward_fn` which will be chunked
**Returns:**
``torch.Tensor``
A tensor with the same shape as the `forward_fn` would have given if applied`.
#### transformers.pytorch_utils.prune_linear_layer[[transformers.pytorch_utils.prune_linear_layer]]
[Source](https://github.com/huggingface/transformers/blob/main/src/transformers/pytorch_utils.py#L63)
Prune a linear layer to keep only entries in index.
Used to remove heads.
**Parameters:**
layer (`torch.nn.Linear`) : The layer to prune.
index (`torch.LongTensor`) : The indices to keep in the layer.
dim (`int`, *optional*, defaults to 0) : The dimension on which to keep the indices.
**Returns:**
``torch.nn.Linear``
The pruned layer as a new layer with `requires_grad=True`.

Xet Storage Details

Size:
2.93 kB
·
Xet hash:
e2d03f88a1c39818a8f3b7463f31dee0376a08110a087e05893751299de72411

Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.