Created
April 12, 2016 02:26
-
-
Save AndrewShang/8a14e78f5eb03a0fb91248540041cc7d to your computer and use it in GitHub Desktop.
The Data set of Neural Responding Machine for Short-Text Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This |
Thank you so much!
mark
The dataset link is not active.
Original link is now dead. Found a copy here: http://61.93.89.94/Noah_NRM_Data/
All the links are not available now, hope someone who have already downloaded this data set can provide a new download link.
@clearmymind hi Shang, where could we upload the dataset now?
More than thankful!
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
This dataset consists of 4.4 millions of message-response pairs crawled from Weibo. It can be used for training of a neural dialogue system. You can get this dataset for research purposes by clicking Noah_NRM_Data (link: https://pan.baidu.com/s/1x4MD5OL-ewxvcCS6d0j5Jw password: 3n82). If you have any question on the dataset, please contact Lifeng Shang (lifengshang@gmail.com) or Zhengdong Lu.
Please cite the following paper if you use the data in your work.
Neural Responding Machine for Short-Text Conversation. Lifeng Shang, Zhengdong Lu, and Hang Li. ACL 2015.