Pytorch在NLP中的简单应用详解

2023-08-03 09:53:04 441

因为之前在项目中一直使用Tensorflow，最近需要处理NLP问题，对Pytorch框架还比较陌生，所以特地再学习一下pytorch在自然语言处理问题中的简单使用，这里做一个记录。

一、Pytorch基础

首先，第一步是导入pytorch的一系列包

importtorch
importtorch.autogradasautograd#Autograd为Tensor所有操作提供自动求导方法
importtorch.nnasnn
importtorch.nn.functionalasF
importtorch.optimasoptim

1）Tensor张量

a)创建Tensors

#tensor
x=torch.Tensor([[1,2,3],[4,5,6]])
#size为2x3x4的随机数随机数
x=torch.randn((2,3,4))

b)Tensors计算

x=torch.Tensor([[1,2],[3,4]])
y=torch.Tensor([[5,6],[7,8]])
z=x+y

c)ReshapeTensors

x=torch.randn(2,3,4)
#拉直
x=x.view(-1)
#4*6维度
x=x.view(4,6)

2）计算图和自动微分

a)Variable变量

#将Tensor变为Variable
x=autograd.Variable(torch.Tensor([1,2,3]),requires_grad=True)
#将Variable变为Tensor
y=x.data

b)反向梯度算法

x=autograd.Variable(torch.Tensor([1,2]),requires_grad=True)
y=autograd.Variable(torch.Tensor([3,4]),requires_grad=True)
z=x+y
#求和
s=z.sum()
#反向梯度传播
s.backward()
print(x.grad)

c)线性映射

linear=nn.Linear(3,5)#三维线性映射到五维
x=autograd.Variable(torch.randn(4,3))
#输出为（4,5）维
y=linear(x)

d)非线性映射（激活函数的使用）

x=autograd.Variable(torch.randn(5))
#relu激活函数
x_relu=F.relu(x)
print(x_relu)
x_soft=F.softmax(x)
#softmax激活函数
print(x_soft)
print(x_soft.sum())

output:

Variablecontaining:
-0.9347
-0.9882
1.3801
-0.1173
0.9317
[torch.FloatTensorofsize5]

Variablecontaining:
0.0481
0.0456
0.4867
0.1089
0.3108
[torch.FloatTensorofsize5]

Variablecontaining:
1
[torch.FloatTensorofsize1]

Variablecontaining:
-3.0350
-3.0885
-0.7201
-2.2176
-1.1686
[torch.FloatTensorofsize5]

二、Pytorch创建网络

1)wordembedding词嵌入

通过nn.Embedding(m,n)实现，m表示所有的单词数目，n表示词嵌入的维度。

word_to_idx={'hello':0,'world':1}
embeds=nn.Embedding(2,5)#即两个单词，单词的词嵌入维度为5
hello_idx=torch.LongTensor([word_to_idx['hello']])
hello_idx=autograd.Variable(hello_idx)
hello_embed=embeds(hello_idx)
print(hello_embed)

output:

Variablecontaining:
-0.69820.3909-1.0760-1.62150.4429
[torch.FloatTensorofsize1x5]

2)N-Gram语言模型

先介绍一下N-Gram语言模型，给定一个单词序列，计算，其中是序列的第个单词。

importtorch
importtorch.nnasnn
importtorch.nn.functionalasF
importtorch.autogradasautograd
importtorch.optimasoptim

fromsix.movesimportxrange

对句子进行分词：

context_size=2
embed_dim=10
text_sequence="""Whenfortywintersshallbesiegethybrow,
Anddigdeeptrenchesinthybeauty'sfield,
Thyyouth'sproudliverysogazedonnow,
Willbeatotter'dweedofsmallworthheld:
Thenbeingasked,whereallthybeautylies,
Whereallthetreasureofthylustydays;
Tosay,withinthineowndeepsunkeneyes,
Wereanall-eatingshame,andthriftlesspraise.
Howmuchmorepraisedeserv'dthybeauty'suse,
Ifthoucouldstanswer'Thisfairchildofmine
Shallsummycount,andmakemyoldexcuse,'
Provinghisbeautybysuccessionthine!
Thisweretobenewmadewhenthouartold,
Andseethybloodwarmwhenthoufeel'stitcold.""".split()
#分词
trigrams=[([text_sequence[i],text_sequence[i+1]],text_sequence[i+2])foriinxrange(len(text_sequence)-2)]
trigrams[:10]

分词的形式为：

#建立vocab索引
vocab=set(text_sequence)
word_to_ix={word:ifori,wordinenumerate(vocab)}

建立N-GramLanguagemodel

#N-GramLanguagemodel
classNGramLanguageModeler(nn.Module):
def__init__(self,vocab_size,embed_dim,context_size):
super(NGramLanguageModeler,self).__init__()
#词嵌入
self.embedding=nn.Embedding(vocab_size,embed_dim)
#两层线性分类器
self.linear1=nn.Linear(embed_dim*context_size,128)
self.linear2=nn.Linear(128,vocab_size)

defforward(self,input):
embeds=self.embedding(input).view((1,-1))#2,10拉直为20
out=F.relu(self.linear1(embeds))
out=F.relu(self.linear2(out))
log_probs=F.log_softmax(out)
returnlog_probs

输出模型看一下网络结构

#输出模型看一下网络结构
model=NGramLanguageModeler(96,10,2)
print(model)

定义损失函数和优化器

#定义损失函数以及优化器
loss_function=nn.NLLLoss()
optimizer=optim.SGD(model.parameters(),lr=0.01)
model=NGramLanguageModeler(len(vocab),embed_dim,context_size)
losses=[]

模型训练

#模型训练
forepochinxrange(10):
total_loss=torch.Tensor([0])
forcontext,targetintrigrams:
#1.处理数据输入为索引向量
#print(context)
#注：python3中map函数前要加上list()转换为列表形式
context_idxs=list(map(lambdaw:word_to_ix[w],context))
#print(context_idxs)
context_var=autograd.Variable(torch.LongTensor(context_idxs))


#2.梯度清零
model.zero_grad()

#3.前向传播，计算下一个单词的概率
log_probs=model(context_var)

#4.损失函数
loss=loss_function(log_probs,autograd.Variable(torch.LongTensor([word_to_ix[target]])))

#反向传播及梯度更新
loss.backward()
optimizer.step()

total_loss+=loss.data
losses.append(total_loss)
print(losses)

以上这篇Pytorch在NLP中的简单应用详解就是小编分享给大家的全部内容了，希望能给大家一个参考，也希望大家多多支持毛票票。

声明：本文内容来源于网络，版权归原作者所有，内容由互联网用户自发贡献自行上传，本网站不拥有所有权，未作人工编辑处理，也不承担相关法律责任。如果您发现有涉嫌版权的内容，欢迎发送邮件至：czq8825#qq.com（发邮件时，请将#更换为@）进行举报，并提供相关证据，一经查实，本站将立刻删除涉嫌侵权内容。

Pytorch在NLP中的简单应用详解

热门推荐

随机推荐