Computer Science > QUESTIONS & ANSWERS > UIUC-CS 412 Introduction To Data Mining - University of Illinois (Fall 2015) Final Exam Solution. 1 (All)

UIUC-CS 412 Introduction To Data Mining - University of Illinois (Fall 2015) Final Exam Solution. 150 points

Document Content and Description Below

UIUC-CS412 \Introduction to Data Mining" (Fall 2015) Final Exam Solution 180 minutes, 150 points 1. Short answer questions [30’]. (a) [3’] What is the advantage of cosine similarity over E... uclidean distance in measuring the document similarity? (b) [3’] Consider a transaction database in which for each single item x, there is at least one transaction that only contains x. Can the number of closed frequent patterns be smaller than the number of frequent patterns? (Answer yes or no; no need to explain.) (c) [3’] In graph mining algorithm gSpan, what key steps were used to avoid redundant computation but not to miss any possible frequent graph? (d) [3’] In GSP, for a sequence pattern of length 100, what is the lower bound of the size of length- 1 candidates? How about that of length-2? And what is the number for all candidates? (Use the tightest lower bound estimation you can get.) (e) [3’] The larger k is, the more accurate the kNN classifier will be. (Choose from True/False, and briefly explain.) (f) [3’] Decision trees are superior to Naive Bayes classifiers since they give more interpretable results. (Choose from True/False, and briefly explain.) (g) [3’] A paper suggests running a single-link hierarchical clustering algorithm for a few iterations on the K-means clustering result. List one advantage of this approach over using K-Means alone. (h) [3’] In some cases, using PCA to reduce dimension can make classification much less precise than it would have been in the original feature space. Explain how this can happen. (i) [3’] In our guest lecture by Matt Ahrens, according to the speaker, what is the takeaway of the lecture, if there is only one thing? (j) [3’] In our guest lecture by Matt Ahrens, he explained that fraudulent advertisements are rather significant. What is the typical rate he reported? a) 1-3% b) 5-10% c) 15-20% d) 30-40% [Show More]

Last updated: 2 years ago

Preview 1 out of 17 pages

Buy Now

Instant download

We Accept:

We Accept
document-preview

Buy this document to get the full access instantly

Instant Download Access after purchase

Buy Now

Instant download

We Accept:

We Accept

Reviews( 0 )

$6.50

Buy Now

We Accept:

We Accept

Instant download

Can't find what you want? Try our AI powered Search

53
0

Document information


Connected school, study & course


About the document


Uploaded On

Apr 02, 2023

Number of pages

17

Written in

Seller


seller-icon
PAPERS UNLIMITED™

Member since 3 years

509 Documents Sold

Reviews Received
55
20
8
2
8
Additional information

This document has been written for:

Uploaded

Apr 02, 2023

Downloads

 0

Views

 53

Document Keyword Tags

More From PAPERS UNLIMITED™

View all PAPERS UNLIMITED™'s documents »

$6.50
What is Scholarfriends

In Scholarfriends, a student can earn by offering help to other student. Students can help other students with materials by upploading their notes and earn money.

We are here to help

We're available through e-mail, Twitter, Facebook, and live chat.
 FAQ
 Questions? Leave a message!

Follow us on
 Twitter

Copyright © Scholarfriends · High quality services·