100% satisfaction guarantee Immediately available after payment Both online and in PDF No strings attached
logo-home
Summary Data Engineering (MSc Marketing Analytics & Data Science) $10.43   Add to cart

Summary

Summary Data Engineering (MSc Marketing Analytics & Data Science)

 20 views  1 purchase
  • Course
  • Institution

Summary of Data Engineering (EBM213A05). This summary is advised to use by the professor. It covers all mandatory material for the exam.

Preview 3 out of 21  pages

  • January 28, 2023
  • 21
  • 2022/2023
  • Summary
avatar-seller
Summary Data Engineering
Week 1

Different steps to structure the business challenge
1. define managerial/research dilemma
2. define managerial question
3. define research question
4. refine research questions (opportunity tree is used here)
5. define research proposal

What is a management/research dilemma?
- Usually a symptom of an underlying problem
- Usually not hard to identify
- Research dilemma may be either a problem or an opportunity. At this stage you may even have identified
symptoms rather than problems or opportunities.

What is a management question?
- Management dilemma, restated in question form
- Defined in terms of the underlying problem
- Preferably clearly linked to an important KPI (key performance indicator)
- Still managerial in nature; does not specify the research that needs to be done; questions are still abroad
(Further discussion is needed)

What is the difference between a top-down and a bottom-up approach?
- Top-down approach: with a data warehouse increasing data is cleaned and organized into a single
consistent schema before being put into the warehouse.
- Analysis is done directly on the curated warehouse data.
- Pro: Consistency consensus and shared best practices
- Con: No domain knowledge and responsiveness

- Bottom-up approach: with a data lake, incoming data goes into the lake in its raw form.
- We select and organize data for each need.
- Pro: Autonomy, agility, innovation, domain expertise
- Con: Lack of: management, consensus, analytical models, governance

What are research questions? And how do they differ from management questions?
- To find the research question, you have to think about possible management actions to solve the dilemma.
- RQ: asks what research should be conducted; information oriented
- MQ: asks what the decision maker needs to do; action oriented

,What are the three domains of which data science consists of? (guest lecture)

1. Domain Expertise (business): This domain involves having a deep
understanding of the subject matter that the data relates to, in
order to effectively analyze and interpret it. This can also be
considered as Business Intelligence, which is the ability to use data
and analysis to drive business decisions.

2. Technical data engineering: This domain involves using
programming and database management skills to extract, clean,
and organize large sets of data. This includes Data Management,
Data Integration, Data Governance, and Data Quality.

3. Math & Statistical knowledge: This domain involves using statistical
techniques and algorithms to analyze and understand patterns in
data. This includes statistical modeling, probability theory,
optimization, and hypothesis testing.

Explain the opportunity tree and its five components
- From research question to analysis questions
- Sub-questions & factors: who, what, where, when, why
o Use the five W’s for the sub-questions & factors
- Opportunity tree:
1. Business challenge (MQ)
2. Sub-business challenge (sub-MQ’s)
3. Sub-questions (RQ; who, what, where, when, why, which)
4. Factors
5. Hypotheses

What are the four steps discussed when defining a problem? Regarding HBR article.
Step 1: Establish the need for a solution
- What is the basic need?
- What is the desired outcome?
- Who stands to benefit and why?
Step 2: Justify the need
- Is the effort aligned with our strategy?
- What are the desired benefits for the company, and how will we measure them?
- How will we ensure that a solution is implemented?
Step 3: Conceptualize the problem
- What approaches have we tried?
- What have others tried?
- What are the internal and external constraints on implementing a solution?
Step 4: Write the problem statement
- Is the problem actually many problems?
- What requirements must a solution meet?
- Which problem solvers should we engage?
- What information and language should the problem statement conclude?

, Week 2

Name the four components of the ‘Data Science Value Creation Model” and explain how value for a firm is
created by using data science

1. Value objectives (V2F, V2C)
2. Data assets
3. Analytics
4. Value creation
- Capabilities

This model starts with value objectives
that have to be set before developing a
data science strategy. The core data
science strategy elements are data assets
and analytics, which then should lead to
value creation. The data science strategy
should be enabled by data science
capabilities.

Why can’t you just use excel/a spreadsheet for all that data?
- Excel has limited rows; storage space is limited
- Efficiency issues: due to duplicates
- Storage space is limited

What are the two perspectives on value creation?
1. Value to the customer (V2C)
2. Value to the firm (V2F)

What is a database?
A database is a collection of information that is organized so that it can easily be accessed, managed, and updated.

Explain how to balance both value creation perspectives on one dimension each (four cases)




What are fields and records?
A database consists of multiple tables: ‘spreadsheets’ with columns (fields) and rows (records)

What are the three characteristics of big data?
Big data itself has also changed the data landscape. Big data has specific characteristics known as the 3Vs of big data,
posing specific challenges for researchers and managers.
1. Increasing data Volume
2. Increasing data Velocity
3. Increasing data Variety

The benefits of buying summaries with Stuvia:

Guaranteed quality through customer reviews

Guaranteed quality through customer reviews

Stuvia customers have reviewed more than 700,000 summaries. This how you know that you are buying the best documents.

Quick and easy check-out

Quick and easy check-out

You can quickly pay through credit card or Stuvia-credit for the summaries. There is no membership needed.

Focus on what matters

Focus on what matters

Your fellow students write the study notes themselves, which is why the documents are always reliable and up-to-date. This ensures you quickly get to the core!

Frequently asked questions

What do I get when I buy this document?

You get a PDF, available immediately after your purchase. The purchased document is accessible anytime, anywhere and indefinitely through your profile.

Satisfaction guarantee: how does it work?

Our satisfaction guarantee ensures that you always find a study document that suits you well. You fill out a form, and our customer service team takes care of the rest.

Who am I buying these notes from?

Stuvia is a marketplace, so you are not buying this document from us, but from seller SDaan99. Stuvia facilitates payment to the seller.

Will I be stuck with a subscription?

No, you only buy these notes for $10.43. You're not tied to anything after your purchase.

Can Stuvia be trusted?

4.6 stars on Google & Trustpilot (+1000 reviews)

79650 documents were sold in the last 30 days

Founded in 2010, the go-to place to buy study notes for 14 years now

Start selling

Recently viewed by you


$10.43  1x  sold
  • (0)
  Add to cart