Resume

Business Intelligence: samenvatting van de powerpoints

30 vues 0 fois vendu

Cours
Business Intelligence

Établissement
Universiteit Gent (UGent)

heb de powerpoints even overzichtelijk samengezet. Bevat de te kennen leerstof van de theorie. Handig om dit document te overlopen met het document op ufora 'te kennen leerstof' en er zo het nuttige uit te halen

[Montrer plus]

Aperçu 4 sur 60 pages

Voir l'exemple

Publié le 31 mai 2022
Nombre de pages 60
Écrit en 2021/2022
Type Resume

femkedw1 Membre depuis 2 année 15 documents vendus

€5,39

Ajouté

Ajouter au panier Ajouter au liste de veux

Garantie de satisfaction à 100%
Disponible immédiatement après paiement
En ligne et en PDF
Tu n'es attaché à rien

1.2. different data mining tasks

Each data-driven business problem is unique. But there are sets of common tasks that underlie the
business problems.

E.g. Churn @ MegaTelco – Unique; Identifying which customers are more likely to terminate their
contracts – standard probability estimation problem.

Critical skill – Decompose problems into pieces such that each piece matches a known task.

1) Classification & class probability estimation

Attempt to predict, for each individual in a population which class the individual belongs to.

Purpose

• Classifictaion: to welke groep behoort deze?
• Class probability prediction: hoeveel kans is er dat deze behoort tot groep X of Y

2) Regression

Attempt to estimate or predict, for each individual, the numerical value of some variable.

Classification vs. regression

Classification: will something happen?

Regression: to what degree will something happen?

3) Similarity matching
Kijken naar twee objecten en in welke mate ze gelijkmatig zijn, identify similar objects

4) Clustering
Group by similarity, without a specific purpose

5) Co-occurrence grouping
Find associations between items, based on transactions involving them.

6) Profiling
Doel om beter inzicht te krijgen in het profiel van klanten

7) Link prediction
Proberen voorspellen van een link tussen twee personen (facebook; voorgestelde vrienden)

8) Data reduction
Attempt to replace a large set of data with a smaller set of data that contains as much
information

9) Causal modelling
Attempts to help us understand what events or actions influence others

,Expensive techniques:
• Investment in data
• Randomized controlled experiments

Counterfactual analysis

Two high-level primary goals

Prediction Description
Using some variables to predict unknown, Using some variables to find human-
or future values or other variables interpretable patterns describing the data

Often used to work toward a causal
understanding of the data

Techniques

1) Classification
2) Regression
3) Clustering
4) Summarization
5) Dependency modelling
6) Change and deviation detection

Supervised vs. unsupervised

“Do our customers naturally fall into different groups ?”

> Has no target variable (unsupervised)

“Can we find groups of customers who have a particularly high likelihood of…
(defaulting/denying) ?”

> Has target variable (supervised)

supervised Unsupervised
• Has a target variable • Has no target variable
• Requires target data • No guarantee that results are
• More meaningful results meaningful or useful
• Bv. Kat of geen kat • Bv. Kat, hond, kip, …

,Classification vs. regression

Will something happen?

(Target is categorical variable, classification)

To what degree will something happen?

(Target is numerical variable, regression)

THE DATA MINING PROCESS

An important difference

1. Model bouwen: Doe je op historische data, je gaat een model maken (bv wat je op
weka doet) aan de hand van classifiers (die je dan vertaalt naar een model)
2. Model gebruiken: De vertaling van de classifier gebruiken in een bedrijf. het kan
bijvoorbeeld dat een software aan de hand van nieuwe data en het opgebouwde
model beslissingen zal nemen waar het management/de marketing mee kan werken.

, Knowledge discovery in databases

‘Two’ biggest players:

CRISP-DM

Cross Industry Standard Process for Data Mining

SEMMA

Sample, Explore, Modify, Model and Assess

CRISP-DM

Iteration is the rule

Process is an exploration of data

Business understanding:

• Craft: importance of analysts creativity
• Toolset of techniques
• Think about use-scenario

Data understanding:

• Material from which solution will be
constructed
• Strengths and limitations
• Availability and cost of data
• Think about; fraud detection

Data preparation:

• Techniques impose a certain requirments on data
• Think about : missing values, conversions, symbolic or categorical data, numerical values,
normalization of values

Think about: leaks (= Een variabele die in historische gegevens verzameld is, “informatie geeft” over
de target variable, maar niet daadwerkelijk beschikbaar is wanneer de beslissingen worden genomen!)

Modeling:

• Primary place to apply data mining techniques

Evaluation:

• Assess data mining results
• Test model
• Satisfy business goals?
• Sign off by stake-holders > comprehensibility

Deployment:

• May be a model

Les avantages d'acheter des résumés chez Stuvia:

Qualité garantie par les avis des clients

Les clients de Stuvia ont évalués plus de 700 000 résumés. C'est comme ça que vous savez que vous achetez les meilleurs documents.

L’achat facile et rapide

Vous pouvez payer rapidement avec iDeal, carte de crédit ou Stuvia-crédit pour les résumés. Il n'y a pas d'adhésion nécessaire.

Focus sur l’essentiel

Vos camarades écrivent eux-mêmes les notes d’étude, c’est pourquoi les documents sont toujours fiables et à jour. Cela garantit que vous arrivez rapidement au coeur du matériel.

Foire aux questions

Qu'est-ce que j'obtiens en achetant ce document ?

Vous obtenez un PDF, disponible immédiatement après votre achat. Le document acheté est accessible à tout moment, n'importe où et indéfiniment via votre profil.

Garantie de remboursement : comment ça marche ?

Notre garantie de satisfaction garantit que vous trouverez toujours un document d'étude qui vous convient. Vous remplissez un formulaire et notre équipe du service client s'occupe du reste.

Auprès de qui est-ce que j'achète ce résumé ?

Stuvia est une place de marché. Alors, vous n'achetez donc pas ce document chez nous, mais auprès du vendeur femkedw1. Stuvia facilite les paiements au vendeur.

Est-ce que j'aurai un abonnement?

Non, vous n'achetez ce résumé que pour €5,39. Vous n'êtes lié à rien après votre achat.

Peut-on faire confiance à Stuvia ?

4.6 étoiles sur Google & Trustpilot (+1000 avis)

79202 résumés ont été vendus ces 30 derniers jours

Fondée en 2010, la référence pour acheter des résumés depuis déjà 14 ans

Commencez à vendre!

Populaire universiteiten

Populaire hogescholen

Populaire studieboeken voor Communicatie en Taal

Populaire studieboeken voor Economie en Bedrijf

Populaire studieboeken voor Exact en Informatica

Populaire studieboeken voor Gedrag en Maatschappij

Populaire studieboeken voor Gezondheid en Geneeskunde

Populaire studieboeken voor Recht en Bestuur

Resume

Business Intelligence: samenvatting van de powerpoints

Infos sur le Document

Sujets

École, étude et sujet

Vendeur

Avis reçus

Aperçu du contenu