Skip to content

关于模型的Loss #1

@Protostars

Description

@Protostars

文章中公式(12)和代码似乎不一致?
代码中似乎按公式(2)计算了每个序列的ground truth对应的Loss:

loss = F.nll_loss((scores+1e-8).log(), data['item_tgt'].reshape(-1), ignore_index=0)

而并非文章中对每个序列的每个元素τ=1,...,|I_S|计算概率,这样做也没有意义
请问是否是文章公式(12)有误?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions