python/FunASR-XL.git

python / FunASR-XL

FUNASR训练

概况
操作记录
提交次数
目录
文档
派生
对比

parent: aab1fcd6 | 补丁 | 提交 | ignore whitespace

Merge pull request #1161 from alibaba-damo-academy/dev_lhn

hnluo

2023-12-08 202ab8a2c9e2af5c147faf080f96e97abbb7be42

Merge pull request #1161 from alibaba-damo-academy/dev_lhn

fix loss normalization for ddp training

1个文件已修改

funasr/models/e2e_uni_asr.py

2 ●●●●● 补丁 | 查看 | 原始文档 | blame | 历史

 funasr/models/e2e_uni_asr.py

@@ -442,7 +442,7 @@
        stats["loss"] = torch.clone(loss.detach())
        # force_gatherable: to-device and to-tensor if scalar for DataParallel
        if self.length_normalized_loss:
            batch_size = (text_lengths + 1).sum().type_as(batch_size)
            batch_size = int((text_lengths + 1).sum())
        loss, stats, weight = force_gatherable((loss, stats, batch_size), loss.device)
        return loss, stats, weight

			@@ -442,7 +442,7 @@
			stats["loss"] = torch.clone(loss.detach())
			# force_gatherable: to-device and to-tensor if scalar for DataParallel
			if self.length_normalized_loss:
			batch_size = (text_lengths + 1).sum().type_as(batch_size)
			batch_size = int((text_lengths + 1).sum())
			loss, stats, weight = force_gatherable((loss, stats, batch_size), loss.device)
			return loss, stats, weight