• Login
    View Item 
    •   DBS eSource Home
    • Masters Dissertations
    • Information & Communications Technology
    • View Item
    •   DBS eSource Home
    • Masters Dissertations
    • Information & Communications Technology
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Yelp rating classification using connected graph feature extraction and feature importance in machine learning workflow

    View/Open
    msc_shaikh_a_2019.pdf (2.030Mb)
    Author
    Shaikh, Aquib Hassan
    Date
    2019
    Degree
    MSc in Data Analytics
    URI
    https://esource.dbs.ie/handle/10788/3962
    Publisher
    Dublin Business School
    Rights holder
    http://esource.dbs.ie/copyright
    Rights
    Items in eSource are protected by copyright. Previously published items are made available in accordance with the copyright policy of the publisher/copyright holder.
    Metadata
    Show full item record
    Abstract
    This thesis titled- “Yelp Rating Classification Using Connected Graph Feature Extraction and Feature Importance In Machine Learning Workflow” focused on Yelp’s Challenge Dataset Round 13, we analyze data about restaurants from Yelp, specifically the reviews, to classify the star-ratings of the restaurants based on the contents of the reviews. In this thesis, I focus on improving the ML workflow using graph algorithms: connected feature extraction and feature importance in classification. Graph-enhanced ML can help fill in that missing contextual information that is so important for better decisions. ML pipeline was build using a few classification algorithms and H2O AutoML: Automatic Machine Learning interface for automating the machine learning workflow. The results obtained reveal that connected graph features played an important role in enhanced machine learning workflow. H2O’s Stacked Ensemble best able to classify the yelp rating with use of business influential rating obtained from Page Rank graph algorithm.
    Collections
    • Information & Communications Technology

    Browse

    All of DBS eSourceCommunities & CollectionsBy Issue DateAuthorsSupervisorTitlesSubjectsThis CollectionBy Issue DateAuthorsSupervisorTitlesSubjects

    My Account

    LoginRegister

    Statistics

    View Usage Statistics

    DSpace software copyright © 2002-2022  DuraSpace
    Contact Us | Send Feedback
    DSpace Express is a service operated by 
    Atmire NV