- Split the set of instances in subsets such that the variation within each subset becomes
smaller
Entropy = degree of uncertainty
,First number means total classified here, second number means incorrectly classified ones of the
total
Exercise
What is tree depth? how many squares there are, 3
Would you further grow tree? yes, ‘young’ has 55 incorrectly classified out of 381. Further
growing could prove usefu
Confusion matrix and measures
Quality measures
Error
, - (FP + FN) / total
How many of the actual negative instances did the model identify?
Accuracy
- (TP + TN) / total
How many instances did the model classify correctly over all instances?
Precision
- TP / (TP + FP)
How many of the predicted positive instances are actually positive?
Recall
- TP / (TP + FN)
How many of the actual positive instances did the model identify?
F1 score
- 2 * (precision * recall) / (precision + recall)
A balance between precision and recall
Exercise
What setting would you choose for the tree size?
Answer between 10 and 20 are the best results, smaller tree is generally better so 10
Lecture 3
Overfitting
- The model is too specific for the data set used to learn the model and performs poorly on
new instances
- High variance
Underfitting
- The model is too general and does not exploit the data
- High bias
The benefits of buying summaries with Stuvia:
Guaranteed quality through customer reviews
Stuvia customers have reviewed more than 700,000 summaries. This how you know that you are buying the best documents.
Quick and easy check-out
You can quickly pay through credit card or Stuvia-credit for the summaries. There is no membership needed.
Focus on what matters
Your fellow students write the study notes themselves, which is why the documents are always reliable and up-to-date. This ensures you quickly get to the core!
Frequently asked questions
What do I get when I buy this document?
You get a PDF, available immediately after your purchase. The purchased document is accessible anytime, anywhere and indefinitely through your profile.
Satisfaction guarantee: how does it work?
Our satisfaction guarantee ensures that you always find a study document that suits you well. You fill out a form, and our customer service team takes care of the rest.
Who am I buying these notes from?
Stuvia is a marketplace, so you are not buying this document from us, but from seller timb3. Stuvia facilitates payment to the seller.
Will I be stuck with a subscription?
No, you only buy these notes for $6.56. You're not tied to anything after your purchase.