Читать книгу Advanced Analytics and Deep Learning Models - Группа авторов - Страница 64

3.4.1.1 Discussion and Result

In this demonstration, they analyzed discrete arrangements. Those are mainly based on aspect-based sentiment analysis. The results we can see in Tables 3.2 and 3.3. They stated the results picked up with AFINN sentiment analysis algorithm, due to space reasons. It did not come out with any major dissimilarity with the CoreNLP algorithm. As it is based on CF user-based approach, on Yelp dataset and Tripadvisor dataset, they took top 10 aspects from the datasets. Besides, the above results are better than the previous 50 aspects. Accordingly, they did not take a bigger space.

One more attractive result comes from the Yelp and TripAdvisor by use of sub-aspect which gave a significant improvement in performance. Here, the maximum efficiency came by using the top 50 aspects. For a better understanding of this, we need to do further investigations [5, 20].

Table 3.2 Result comparison.

Result of MCRS-Based CF Experiment 1

Result of Experiment 2

Configuration	Dataset			Dataset
#neigh.	#asp.	Sub-asp	Yelp	TripAdvisor	Amazon	Configuration	Yelp	TripAdvisor	Amazon
10	10	Y	0.8362	0.7111	0.6464	Multi-U2U	0.8362	0.7111	0.6276
10	10	N	0.841	0.7564	0.6335	U2U-Euclidean	0.886	0.8337	0.7254
10	50	Y	0.841	0.7269	0.6346	U2U-Pearson	0.964	1.1222	0.9789
10	50	N	0.8364	0.8007	0.6276	Static-Multi-U2U	N.A.	0.798	N.A.
30	10	Y	0.8461	0.7677	0.712	Multi-I2I	0.864	0.8245	0.811
30	10	N	0.8473	0.7722	0.7122	I2I-Euclidean	0.8745	0.8429	0.8117
30	50	Y	0.8474	0.7743	0.7101	I2I-Pearson	1.1794	0.8644	0.9679
30	50	N	0.8494	0.8003	0.714	Static-Multi-I2I	N.A.	0.8474	N.A.
80	10	Y	0.8579	0.7971	0.7584	RatingSGD	0.8409	0.745	0.8859
80	10	N	0.8592	0.7953	0.7554	ParallelSGD	0.8409	0.7449	0.8852
80	50	Y	0.859	0.7907	0.7544	ALSWR	0.9545	0.9053	1.0354
80	50	N	0.8597	0.7995	0.7544

Result of MCRS Item–Based CF

Top 10 Main Aspects Extracted

#asp.	Sub-asp	Yelp	TripAdvisor	Amazon
Configuration	Dataset		Yelp	Place, food, service, restaurant, price menu, staff, drink, and lunch
10	Y	0.864	0.8245	0.811	TripAdvisor	Hotel, room, staff, location, service, breakfast, restaurant, bathroom, price, view
10	N	0.8643	0.8252	0.8117
50	Y	0.8641	0.8254	0.8118	Amazon	game, graphic, story, character, player price, gameplay, controller level, and music
50	N	0.8648	0.826	0.8124

It is noted that, to get better efficiency, we need to configure all the datasets in top 10 neighbors.

In the next step, they compared the algorithm with matrix factorization (MF) algorithms for getting better baselines. Besides, it is specified that TripAdvisor dataset has an unchangeable set of six features. For every analysis like “Cleanliness”, “location”, “value”, “service”, “sleep quality”, and “overall”. They have differentiated their way with a MCRS algorithm that depends on those aspects. In Table 3.5, we can see that their method gets the better of against all the baselines. These results surely established that their perception, since they proved that their approach could overpower both strategies exploiting single ratings and MCRS algorithms.

When multiple-criteria user-to-user CF is used as recommender algorithm, then the best overall results are obtained [5].

Advanced Analytics and Deep Learning Models

Подняться наверх