電子發(fā)燒友網(wǎng)>電子資料下載>電子資料>PyTorch教程16.2之情感分析:使用遞歸神經(jīng)網(wǎng)絡(luò)

PyTorch教程16.2之情感分析:使用遞歸神經(jīng)網(wǎng)絡(luò)

2512950 2023-06-05 | pdf | 0.20 MB | 次下載 | 免費(fèi)

資料介紹

與詞相似度和類比任務(wù)一樣，我們也可以將預(yù)訓(xùn)練詞向量應(yīng)用于情感分析。由于第 16.1 節(jié)中的 IMDb 評(píng)論數(shù)據(jù)集不是很大，使用在大規(guī)模語料庫上預(yù)訓(xùn)練的文本表示可能會(huì)減少模型的過度擬合。作為圖 16.2.1所示的具體示例，我們將使用預(yù)訓(xùn)練的 GloVe 模型表示每個(gè)標(biāo)記，并將這些標(biāo)記表示輸入多層雙向 RNN 以獲得文本序列表示，并將其轉(zhuǎn)換為情感分析輸出（Maas等，2011）。對(duì)于相同的下游應(yīng)用程序，我們稍后會(huì)考慮不同的架構(gòu)選擇。

https://file.elecfans.com/web2/M00/A9/CD/poYBAGR9PJyAB1TZAAKGTdnYvUk151.svg

圖 16.2.1本節(jié)將預(yù)訓(xùn)練的 GloVe 提供給基于 RNN 的架構(gòu)進(jìn)行情緒分析。

						import torch
from torch import nn
from d2l import torch as d2l

batch_size = 64
train_iter, test_iter, vocab = d2l.load_data_imdb(batch_size)

						 

						from mxnet import gluon, init, np, npx
from mxnet.gluon import nn, rnn
from d2l import mxnet as d2l

npx.set_np()

batch_size = 64
train_iter, test_iter, vocab = d2l.load_data_imdb(batch_size)

						 

16.2.1。用 RNN 表示單個(gè)文本

在文本分類任務(wù)中，例如情感分析，變長的文本序列將被轉(zhuǎn)換為固定長度的類別。在下面的BiRNN類中，雖然文本序列的每個(gè)標(biāo)記都通過嵌入層 ( self.embedding) 獲得其單獨(dú)的預(yù)訓(xùn)練 GloVe 表示，但整個(gè)序列由雙向 RNN ( self.encoder) 編碼。更具體地說，雙向 LSTM 在初始和最終時(shí)間步的隱藏狀態(tài)（在最后一層）被連接起來作為文本序列的表示。然后通過具有兩個(gè)輸出（“正”和“負(fù)”）的全連接層 ( self.decoder) 將該單一文本表示轉(zhuǎn)換為輸出類別。

							class BiRNN(nn.Module):
  def __init__(self, vocab_size, embed_size, num_hiddens,
         num_layers, **kwargs):
    super(BiRNN, self).__init__(**kwargs)
    self.embedding = nn.Embedding(vocab_size, embed_size)
    # Set `bidirectional` to True to get a bidirectional RNN
    self.encoder = nn.LSTM(embed_size, num_hiddens, num_layers=num_layers,
                bidirectional=True)
    self.decoder = nn.Linear(4 * num_hiddens, 2)

  def forward(self, inputs):
    # The shape of `inputs` is (batch size, no. of time steps). Because
    # LSTM requires its input's first dimension to be the temporal
    # dimension, the input is transposed before obtaining token
    # representations. The output shape is (no. of time steps, batch size,
    # word vector dimension)
    embeddings = self.embedding(inputs.T)
    self.encoder.flatten_parameters()
    # Returns hidden states of the last hidden layer at different time
    # steps. The shape of `outputs` is (no. of time steps, batch size,
    # 2 * no. of hidden units)
    outputs, _ = self.encoder(embeddings)
    # Concatenate the hidden states at the initial and final time steps as
    # the input of the fully connected layer. Its shape is (batch size,
    # 4 * no. of hidden units)
    encoding = torch.cat((outputs[0], outputs[-1]), dim=1)
    outs = self.decoder(encoding)
    return outs

							 

							class BiRNN(nn.Block):
  def __init__(self, vocab_size, embed_size, num_hiddens,
         num_layers, **kwargs):
    super(BiRNN, self).__init__(**kwargs)
    self.embedding = nn.Embedding(vocab_size, embed_size)
    # Set `bidirectional` to True to get a bidirectional RNN
    self.encoder = rnn.LSTM(num_hiddens, num_layers=num_layers,
                bidirectional=True, input_size=embed_size)
    self.decoder = nn.Dense(2)

  def forward(self, inputs):
    # The shape of `inputs` is (batch size, no. of time steps). Because
    # LSTM requires its input's first dimension to be the temporal
    # dimension, the input is transposed before obtaining token
    # representations. The output shape is (no. of time steps, batch size,
    # word vector dimension)
    embeddings = self.embedding(inputs.T)
    # Returns hidden states of the last hidden layer at different time
    # steps. The shape of `outputs` is (no. of time steps, batch size,
    # 2 * no. of hidden units)
    outputs = self.encoder(embeddings)
    # Concatenate the hidden states at the initial and final time steps as
    # the input of the fully connected layer. Its shape is (batch size,
    # 4 * no. of hidden units)
    encoding = np.concatenate((outputs[0], outputs[-1]), axis=1)
    outs = self.decoder(encoding)
    return outs

							 

讓我們構(gòu)建一個(gè)具有兩個(gè)隱藏層的雙向 RNN 來表示用于情感分析的單個(gè)文本。

							embed_size, num_hiddens, num_layers, devices = 100, 100, 2, d2l.try_all_gpus()
net = BiRNN(len(vocab), embed_size, num_hiddens, num_layers)

def init_weights(module):
  if type(module) == nn.Linear:
    nn.init.xavier_uniform_(module.weight)
  if type(module) == nn.LSTM:
    for param in module._flat_weights_names:
      if "weight" in param:
        nn.init.xavier_uniform_(module._parameters[param])
net.apply(init_weights);

							 

							embed_size, num_hiddens, num_layers, devices = 100, 100, 2, d2l.try_all_gpus()
net = BiRNN(len(vocab), embed_size, num_hiddens, num_layers)

net.initialize(init.Xavier(), ctx=devices)

16.2.2。加載預(yù)訓(xùn)練詞向量

embed_size下面我們?yōu)樵~匯表中的標(biāo)記加載預(yù)訓(xùn)練的 100 維（需要與一致）GloVe 嵌入。

							glove_embedding = d2l.TokenEmbedding('glove.6b.100d')

							 

							Downloading ../data/glove.6B.100d.zip from http://d2l-data.s3-accelerate.amazonaws.com/glove.6B.100d.zip...

						

							glove_embedding = d2l.TokenEmbedding('glove.6b.100d')

							 

打印詞匯表中所有標(biāo)記的向量形狀。

							embeds = glove_embedding[vocab.idx_to_token]
embeds.shape

							torch.Size([49346, 100])

						

							embeds = glove_embedding[vocab.idx_to_token]
embeds.shape

							(49346, 100)

						

我們使用這些預(yù)訓(xùn)練的詞向量來表示評(píng)論中的標(biāo)記，并且不會(huì)在訓(xùn)練期間更新這些向量。

							net.embedding.weight.data.copy_(embeds)
net.embedding.weight.requires_grad = False

							net.embedding.weight.set_data(embeds)
net.embedding.collect_params().setattr('grad_req', 'null')

16.2.3。訓(xùn)練和評(píng)估模型

現(xiàn)在我們可以訓(xùn)練雙向 RNN 進(jìn)行情感分析。

							lr, num_epochs = 0.01, 5
trainer = torch.optim.Adam(net.parameters(), lr=lr)
loss = nn.CrossEntropyLoss(reduction="none")
d2l.train_ch13(net, train_iter, test_iter, loss, trainer, num_epochs, devices)

							 

							loss 0.311, train acc 0.872, test acc 0.850
574.5 examples/sec on [device(type='cuda', index=0), device(type='cuda', index=1)]

https://file.elecfans.com/web2/M00/A9/CD/poYBAGR9PJ6AJIk8AAECA4Wy71Y322.svg

							lr, num_epochs = 0.01, 5
trainer = gluon.Trainer(net.collect_params(), 'adam', {'learning_rate': lr})
loss = gluon.loss.SoftmaxCrossEntropyLoss()
d2l.train_ch13(net, train_iter, test_iter, loss, trainer, num_epochs, devices)

							 

							loss 0.428, train acc 0.806, test acc 0.791
488.5 examples/sec on [gpu(0), gpu(1)]

https://file.elecfans.com/web2/M00/AA/48/pYYBAGR9PKGAE9v0AAEB8Qpd38M668.svg

我們定義了以下函數(shù)來使用經(jīng)過訓(xùn)練的模型預(yù)測(cè)文本序列的情緒net。

神經(jīng)網(wǎng)絡(luò)rnn pytorch

下載該資料的人也在下載下載該資料的人還在閱讀

更多 >

PyTorch如何實(shí)現(xiàn)多層全連接神經(jīng)網(wǎng)絡(luò) 532次閱讀
遞歸神經(jīng)網(wǎng)絡(luò)和循環(huán)神經(jīng)網(wǎng)絡(luò)的模型結(jié)構(gòu) 295次閱讀
遞歸神經(jīng)網(wǎng)絡(luò)的實(shí)現(xiàn)方法 186次閱讀
BP神經(jīng)網(wǎng)絡(luò)在語言特征信號(hào)分類中的應(yīng)用 181次閱讀
BP神經(jīng)網(wǎng)絡(luò)和卷積神經(jīng)網(wǎng)絡(luò)的關(guān)系 530次閱讀
BP神經(jīng)網(wǎng)絡(luò)和人工神經(jīng)網(wǎng)絡(luò)的區(qū)別 340次閱讀
PyTorch神經(jīng)網(wǎng)絡(luò)模型構(gòu)建過程 276次閱讀
人工神經(jīng)網(wǎng)絡(luò)的案例分析 473次閱讀
深度神經(jīng)網(wǎng)絡(luò)與基本神經(jīng)網(wǎng)絡(luò)的區(qū)別 307次閱讀
卷積神經(jīng)網(wǎng)絡(luò)與循環(huán)神經(jīng)網(wǎng)絡(luò)的區(qū)別 878次閱讀
使用PyTorch構(gòu)建神經(jīng)網(wǎng)絡(luò) 412次閱讀
神經(jīng)網(wǎng)絡(luò)架構(gòu)有哪些 324次閱讀
教你用PyTorch快速準(zhǔn)確地建立神經(jīng)網(wǎng)絡(luò) 3186次閱讀
BP神經(jīng)網(wǎng)絡(luò)概述 4.4w次閱讀
卷積神經(jīng)網(wǎng)絡(luò)CNN架構(gòu)分析-LeNet 2687次閱讀

評(píng)論

資料 -- | 積分 --

查看他上傳的所有資料

+關(guān)注個(gè)人主頁

上傳資料賺積分

下載排行

本周

1山景DSP芯片AP8248A2數(shù)據(jù)手冊(cè)
1.06 MB | 532次下載 | 免費(fèi)
2RK3399完整板原理圖（支持平板，盒子VR）
3.28 MB | 339次下載 | 免費(fèi)
3TC358743XBG評(píng)估板參考手冊(cè)
1.36 MB | 330次下載 | 免費(fèi)
4DFM軟件使用教程
0.84 MB | 295次下載 | 免費(fèi)
5元宇宙深度解析—未來的未來-風(fēng)口還是泡沫
6.40 MB | 227次下載 | 免費(fèi)
6迪文DGUS開發(fā)指南
31.67 MB | 194次下載 | 免費(fèi)
7元宇宙底層硬件系列報(bào)告
13.42 MB | 182次下載 | 免費(fèi)
8FP5207XR-G1中文應(yīng)用手冊(cè)
1.09 MB | 178次下載 | 免費(fèi)

本月

1OrCAD10.5下載OrCAD10.5中文版軟件
0.00 MB | 234315次下載 | 免費(fèi)
2555集成電路應(yīng)用800例(新編版)
0.00 MB | 33566次下載 | 免費(fèi)
3接口電路圖大全
未知 | 30323次下載 | 免費(fèi)
4開關(guān)電源設(shè)計(jì)實(shí)例指南
未知 | 21549次下載 | 免費(fèi)
5電氣工程師手冊(cè)免費(fèi)下載(新編第二版pdf電子書)
0.00 MB | 15349次下載 | 免費(fèi)
6數(shù)字電路基礎(chǔ)pdf(下載)
未知 | 13750次下載 | 免費(fèi)
7電子制作實(shí)例集錦下載
未知 | 8113次下載 | 免費(fèi)
8《LED驅(qū)動(dòng)電路設(shè)計(jì)》溫德爾著
0.00 MB | 6656次下載 | 免費(fèi)

總榜

1matlab軟件下載入口
未知 | 935054次下載 | 免費(fèi)
2protel99se軟件下載(可英文版轉(zhuǎn)中文版)
78.1 MB | 537798次下載 | 免費(fèi)
3MATLAB 7.1 下載 (含軟件介紹)
未知 | 420027次下載 | 免費(fèi)
4OrCAD10.5下載OrCAD10.5中文版軟件
0.00 MB | 234315次下載 | 免費(fèi)
5Altium DXP2002下載入口
未知 | 233046次下載 | 免費(fèi)
6電路仿真軟件multisim 10.0免費(fèi)下載
340992 | 191187次下載 | 免費(fèi)
7十天學(xué)會(huì)AVR單片機(jī)與C語言視頻教程下載
158M | 183279次下載 | 免費(fèi)
8proe5.0野火版下載(中文版免費(fèi)下載)
未知 | 138040次下載 | 免費(fèi)

搜索歷史