Attention Example is Not Efficient, Needs Greedy Decoding

Currently the attention example is not very efficient, particularly on GPUs. For example, this for loop could be changed so it only does a single matrix multiplication (which can be done only once per sentence):
https://github.com/clab/dynet/blob/master/examples/python/attention.py#L75

Also, here the attention model is randomly generating examples instead of selecting the best one, which is more in line with what we would expect:
https://github.com/clab/dynet/blob/master/examples/python/attention.py#L105

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Attention Example is Not Efficient, Needs Greedy Decoding #242

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Attention Example is Not Efficient, Needs Greedy Decoding #242

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions