bohatey
/

DiCoW_v3_2_SF

Automatic Speech Recognition

speaker-diarization

meeting-transcription

target-speaker-asr

Model card Files Files and versions

DiCoW_v3_2_SF / utils.py

bohatey's picture

Upload DiCoWForConditionalGeneration

d7e9d80 verified 17 days ago

history blame contribute delete

540 Bytes

	import torch
	from transformers import WhisperTimeStampLogitsProcessor


	class WhisperTimeStampLogitsProcessorCustom(WhisperTimeStampLogitsProcessor):

	def __call__(self, input_ids: torch.LongTensor, scores: torch.FloatTensor) -> torch.FloatTensor:
	scores_processed = super().__call__(input_ids, scores)

	# Enable to early exit from silence via eos token
	if input_ids.shape[1] == self.begin_index:
	scores_processed[:, self.eos_token_id] = scores[:, self.eos_token_id]

	return scores_processed