Daniel Voigt Godoy - Deep Learning with PyTorch Step-by-Step A Beginner’s Guide-leanpub

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

The attention score of the second data point, as expected, was set to zero, leaving

the whole attention on the first data point.


We also need to make some small adjustments to the decoder:

Decoder + Attention

1 class DecoderAttn(nn.Module):

2 def __init__(self, n_features, hidden_dim):

3 super().__init__()

4 self.hidden_dim = hidden_dim

5 self.n_features = n_features

6 self.hidden = None

7 self.basic_rnn = nn.GRU(self.n_features,

8 self.hidden_dim,

9 batch_first=True)

10 self.attn = Attention(self.hidden_dim) 1

11 self.regression = nn.Linear(2 * self.hidden_dim,

12 self.n_features) 1


14 def init_hidden(self, hidden_seq):

15 # the output of the encoder is N, L, H

16 # and init_keys expects batch-first as well

17 self.attn.init_keys(hidden_seq) 2

18 hidden_final = hidden_seq[:, -1:]

19 self.hidden = hidden_final.permute(1, 0, 2) # L, N, H


21 def forward(self, X, mask=None):

22 # X is N, 1, F

23 batch_first_output, self.hidden = \

24 self.basic_rnn(X, self.hidden)

25 query = batch_first_output[:, -1:]

26 # Attention

27 context = self.attn(query, mask=mask) 3

28 concatenated = torch.cat([context, query],

29 axis=-1) 3

30 out = self.regression(concatenated)


32 # N, 1, F

33 return out.view(-1, 1, self.n_features)

728 | Chapter 9 — Part I: Sequence-to-Sequence

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!