transformers.generator_utils函数源码解析之RepetitionPenaltyLogitsProcessor - 代码天地

transformers.generator_utils函数源码解析之RepetitionPenaltyLogitsProcessor

企业开发 2023-08-01 19:58:10 阅读次数: 0

主要记录源码中解决文本生成中词组重复出现的问题，代码中有具体操作解析。

class RepetitionPenaltyLogitsProcessor(LogitsProcessor):
    r"""
    :class:`transformers.LogitsProcessor` enforcing an exponential penalty on repeated sequences.

    Args:
        repetition_penalty (:obj:`float`):
            The parameter for repetition penalty. 1.0 means no penalty. See `this paper
            <https://arxiv.org/pdf/1909.05858.pdf>`__ for more details.
    """

    def __init__(self, penalty: float):
        if not isinstance(penalty, float) or not (penalty > 0):
            raise ValueError(f"`penalty` has to be a strictly positive float, but is {penalty}")

        self.penalty = penalty

    def __call__(self, input_ids: torch.LongTensor, scores: torch.FloatTensor) -> torch.FloatTensor:
        #scores为cur-step的词表分布[batch,seq,vocab_size]，input_ids为输入decoder的文本序列[batch,seq]，则score则是获取当前已经生成文本序列的token概率
        score = torch.gather(scores, 1, input_ids) 

        # if score < 0 then repetition penalty has to be multiplied to reduce the previous token probability
        #减少已经出现的token的概率
        score = torch.where(score < 0, score * self.penalty, score / self.penalty) 
        
        #将减少后的概率重分配到原始的cur-step词表分布中
        scores.scatter_(1, input_ids, score) 
        return scores

猜你喜欢

转载自blog.csdn.net/yangyanbao8389/article/details/121651056

transformers.generator_utils函数源码解析之RepetitionPenaltyLogitsProcessor

transformers库源码解析:transformers/src/transformers/models/flaubert/modeling_flaubert.py

Transformers源码解析：transformers/src/transformers/models/llama/modeling_llama.py RotaryEmbedding

can-utils源码解析cansend

PyTorch 源码解读之 torch.utils.data：解析数据处理全流程

transformers库源码解析/src/transformers/tools/agents.py（二）from_pretrained()方法

JavaScript异步之generator函数

源码解析ChatGLM Efficient Tuning/src/utils/config.py

源码解析 ChatGLM Efficient Tuning utils/common.py

Transformers源码阅读——BertModel

PyTorch源码解读之torch.utils.data.DataLoader(转)

ES6之generator函数

ES6语法之 Generator 函数

js之generator函数简单学习/复习

ES6之Generator 函数

解析Excel文件 Utils

torchvision.utils的解析

ImportError: cannot import name ‘GenerationConfig‘ from ‘transformers.generation.utils‘

Generator函数

Generator 函数

js 常用的utils函数

源码解析之HashMap源码

源码解析之Hashtable

源码解析之mybatis

Tensorflow版Faster RCNN源码解析（TFFRCNN）（7） utils/blob.py

ChatGLM Efficient Tuning源码解析 src/utils/peft_trainer.py

ChatGLM Efficient Tuning源码解析src/utils/seq2seq.py (二)

ChatGLM Efficient Tuning源码解析 src/utils/peft_triner.py

ChatGLM Efficient Tuning源码解析 src/utils/common.py(二)load_pretrained

源码解析之AQS源码解析

今日推荐

NetBSD 禁止提交由 AI 生成的代码

Apache Doris 2.0.10 版本正式发布！

开源日报 | 大模型开战；大模型独角兽被曝卖身；周鸿祎建议谷歌开源所有产品；最大开源AI社区提供1000万美元共享GPU

开源日报 | Chrome内置Gemini的意义不在于Gemini；中国AI追随之路的五大误区；ECharts创始人“下海”养鱼；谷歌I/O开发者大会什么都有，只是没有惊喜

微软回应中国区AI团队“打包赴美”传闻

基于大语言模型的开源知识库问答系统 MaxKB GitHub Star 数量突破 5,000 个！

周排行

static方法和非static方法的区别（java）

如何查找计算机专业paper

java.lang.ClassFormatError: Incompatible magic value 0 in class file com/sitecha

跳跃游戏II

stm32_之【建立工程】

TeaWeb v0.0.9 发布，统计底层优化、主机监控功能改进

事件分发 -----控制字体大小

JavaScript DOM练习（动态表格添加） December 25，2019

JSF Scope & CDI

实现从零搭建一个登录注册页面（附源代码）

每日归档

更多

2024-05-19(0)

2024-05-18(4)

2024-05-17(34)

2024-05-16(6)

2024-05-15(24)

2024-05-14(0)

2024-05-13(18)

2024-05-12(0)

2024-05-11(38)

2024-05-10(38)