RUHSOLD dataset.
时间: 2023-10-16 21:06:59 浏览: 105
RUHSOLD dataset is a publicly available dataset that contains Russian hotel reviews. It consists of 100,000 hotel reviews in Russian language and covers a wide range of hotels across Russia. The dataset is divided into two subsets: a training set with 80,000 reviews and a test set with 20,000 reviews. Each review is labeled with a rating on a scale of 1 to 5, where 1 is the lowest and 5 is the highest rating. The dataset can be used for sentiment analysis and other natural language processing tasks.
阅读全文