100% satisfaction guarantee Immediately available after payment Both online and in PDF No strings attached
logo-home
BIDA 630 Data Analytics Study Guide with Complete Solutions $9.99   Add to cart

Exam (elaborations)

BIDA 630 Data Analytics Study Guide with Complete Solutions

 0 view  0 purchase
  • Course
  • BIDA 630
  • Institution
  • BIDA 630

BIDA 630 Data Analytics Study Guide with Complete Solutions Identify whether the task required is supervised or unsupervised learning: Deciding whether to issue a loan to an applicant based on demographic and financial data (with reference to a database of similar data on prior customers). - S...

[Show more]

Preview 3 out of 18  pages

  • October 3, 2024
  • 18
  • 2024/2025
  • Exam (elaborations)
  • Questions & answers
  • BIDA 630
  • BIDA 630
avatar-seller
EmillyCharlotte
EMILLYCHARLOTTE 2024/2025 ACADEMIC YEAR ©2024 ALL RIGHTS RESERVED,
FIRST PUBLISH SEPTEMBER 2024

BIDA 630 Data Analytics Study Guide
with Complete Solutions

Identify whether the task required is supervised or unsupervised learning: Deciding

whether to issue a loan to an applicant based on demographic and financial data (with

reference to a database of similar data on prior customers).




- Supervised

- Unsupervised - Answer✔✔-Supervised



This is supervised learning, because the database includes whether the loan was

approved or not.

Identify whether the task required is supervised or unsupervised learning: Printing of

custom discount coupons at the conclusion of a grocery store checkout based on what

you just bought and what others have bought previously.



- Supervised

- Unsupervised - Answer✔✔-Unsupervised



This is unsupervised learning, if we assume that we do not know what will be purchased

in the future.

The test data are used to build models, or to further tweak the model or improve its fit.


1/18

, EMILLYCHARLOTTE 2024/2025 ACADEMIC YEAR ©2024 ALL RIGHTS RESERVED,
FIRST PUBLISH SEPTEMBER 2024



- True

- False - Answer✔✔-False



The test data are not used to build models, or to further tweak the model or improve its

fit. (If the test data were used for these purposes, they would play a role in building or

selecting the best model, and would no longer provide an unbiased assessment of the

chosen model's performance with completely new data.)

_____________ of data is used to assess the performance of each supervised learning

model so that we can compare models and pick the best one.



- The test partition

- The validation partition - Answer✔✔-Validation



The validation partition is used to assess the performance of each supervised learning

model so that we can compare models and pick the best one. In some algorithms (e.g.,

classification and regression trees, k-nearest neighbors) the validation partition may be

used in automated fashion to tune and improve the model. This means that the

validation data are actually used to help build the model.

When a model is fit to training data, zero error with those data is not necessarily good.

This special case is called ______.



- Overestimating

2/18

, EMILLYCHARLOTTE 2024/2025 ACADEMIC YEAR ©2024 ALL RIGHTS RESERVED,
FIRST PUBLISH SEPTEMBER 2024
- Good fit

- Overfitting - Answer✔✔-Overfitting



Overfitting occurs when the model captures not only the generalizeable pattern in the

data, but also the error. When we split the data into training and validation sets, we

assume that the same pattern (if there is a pattern) exists in both, and that they differ

only in the error that they contain. An absurd and false model may fit perfectly (on

training data set) if the model has enough complexity. Therefore, we may get zero error

for such a model using the training dataset. Such a model, however, is not likely to give

useful results on the validation data set.

Bar charts are useful for comparing a single statistic (e.g. average, count, percentage)

across groups. The height of the bar represents the value of statistic, and different bars

correspond to different groups.



- True

- False - Answer✔✔-True

Which of the following are the most popular visualization tools in JMP_Pro? (3 correct

answers)



- Distribution

- Fit Y by X

- Graph Builder

- Data visualizer

3/18

The benefits of buying summaries with Stuvia:

Guaranteed quality through customer reviews

Guaranteed quality through customer reviews

Stuvia customers have reviewed more than 700,000 summaries. This how you know that you are buying the best documents.

Quick and easy check-out

Quick and easy check-out

You can quickly pay through credit card or Stuvia-credit for the summaries. There is no membership needed.

Focus on what matters

Focus on what matters

Your fellow students write the study notes themselves, which is why the documents are always reliable and up-to-date. This ensures you quickly get to the core!

Frequently asked questions

What do I get when I buy this document?

You get a PDF, available immediately after your purchase. The purchased document is accessible anytime, anywhere and indefinitely through your profile.

Satisfaction guarantee: how does it work?

Our satisfaction guarantee ensures that you always find a study document that suits you well. You fill out a form, and our customer service team takes care of the rest.

Who am I buying these notes from?

Stuvia is a marketplace, so you are not buying this document from us, but from seller EmillyCharlotte. Stuvia facilitates payment to the seller.

Will I be stuck with a subscription?

No, you only buy these notes for $9.99. You're not tied to anything after your purchase.

Can Stuvia be trusted?

4.6 stars on Google & Trustpilot (+1000 reviews)

79064 documents were sold in the last 30 days

Founded in 2010, the go-to place to buy study notes for 14 years now

Start selling
$9.99
  • (0)
  Add to cart