Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. HotelRec: a Novel Very Large-Scale Hotel Recommendation Dataset
 
Loading...
Thumbnail Image
conference paper

HotelRec: a Novel Very Large-Scale Hotel Recommendation Dataset

Antognini, Diego  
•
Faltings, Boi  
January 1, 2020
Proceedings Of The 12Th International Conference On Language Resources And Evaluation (Lrec 2020)
12th International Conference on Language Resources and Evaluation (LREC)

Today, recommender systems are an inevitable part of everyone's daily digital routine and are present on most internet platforms. State-of-the-art deep learning-based models require a large number of data to achieve their best performance. Many datasets fulfilling this criterion have been proposed for multiple domains, such as Amazon products, restaurants, or beers. However, works and datasets in the hotel domain are limited: the largest hotel review dataset is below the million samples. Additionally, the hotel domain suffers from a higher data sparsity than traditional recommendation datasets and therefore, traditional collaborative-filtering approaches cannot be applied to such data. In this paper, we propose HotelRec, a very large-scale hotel recommendation dataset, based on TripAdvisor, containing 50 million reviews. To the best of our knowledge, HotelRec is the largest publicly available dataset in the hotel domain ( 50M versus 0:9M) and additionally, the largest recommendation dataset in a single domain and with textual reviews ( 50M versus 22M). We release HotelRec for further research: https://github.com/Diego999/HotelRec.

  • Details
  • Metrics
Type
conference paper
Web of Science ID

WOS:000724697205111

Author(s)
Antognini, Diego  
•
Faltings, Boi  
Date Issued

2020-01-01

Publisher

EUROPEAN LANGUAGE RESOURCES ASSOC-ELRA

Publisher place

Paris

Journal
Proceedings Of The 12Th International Conference On Language Resources And Evaluation (Lrec 2020)
ISBN of the book

979-10-95546-34-4

Start page

4917

End page

4923

Subjects

reviews

•

recommender systems

•

text mining

•

sentiment analysis

Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
LIA  
Event nameEvent placeEvent date
12th International Conference on Language Resources and Evaluation (LREC)

Marseille, FRANCE

May 11-16, 2020

Available on Infoscience
January 15, 2022
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/184462
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés