Business Analytics for professional People

alatifs1 1 views 80 slides Oct 09, 2025
Slide 1
Slide 1 of 80
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14
Slide 15
15
Slide 16
16
Slide 17
17
Slide 18
18
Slide 19
19
Slide 20
20
Slide 21
21
Slide 22
22
Slide 23
23
Slide 24
24
Slide 25
25
Slide 26
26
Slide 27
27
Slide 28
28
Slide 29
29
Slide 30
30
Slide 31
31
Slide 32
32
Slide 33
33
Slide 34
34
Slide 35
35
Slide 36
36
Slide 37
37
Slide 38
38
Slide 39
39
Slide 40
40
Slide 41
41
Slide 42
42
Slide 43
43
Slide 44
44
Slide 45
45
Slide 46
46
Slide 47
47
Slide 48
48
Slide 49
49
Slide 50
50
Slide 51
51
Slide 52
52
Slide 53
53
Slide 54
54
Slide 55
55
Slide 56
56
Slide 57
57
Slide 58
58
Slide 59
59
Slide 60
60
Slide 61
61
Slide 62
62
Slide 63
63
Slide 64
64
Slide 65
65
Slide 66
66
Slide 67
67
Slide 68
68
Slide 69
69
Slide 70
70
Slide 71
71
Slide 72
72
Slide 73
73
Slide 74
74
Slide 75
75
Slide 76
76
Slide 77
77
Slide 78
78
Slide 79
79
Slide 80
80

About This Presentation

What is Business Analysis from AI perspective and showcasing


Slide Content

Business Analytics Creating frontiers for Business

Course Duration: 3 Days 8 hours per day Participants: 18-20 Course Outline: Day 1 : Introduction to Business Analysis Day 2 : Learning the Tools for Business Analysis Day 3: Understanding the Process of Business Analysis

Day 1

Nol card transactions Ticket machines 01 Bus-/Metro and Taxi vehicle tracking 02 RTA app, S'hail , real-time feedback. 04 Surveys Complaints Reviews (Call Centers, Digital) 05 Passenger Counters Gate Sensors Escalator Activity 03 Expo Events Dubai Run Public Holidays 06 Ticketing Systems GPS & Telematics Mobile Apps Customer Feedback Sensors External Calendars 01 02 03 04 05 06 Together, these sources create a 360° view of operational performance and customer behavior in RTA services. Transportation Data Sources

🔐 Data Governance Accurate Secure Timely Documented C onsistent 3. Timely 2. C onsistent Formats and definitions are standardized across systems and departments. Enables seamless integration and reliable comparisons. Data is updated regularly. Ensures relevance for real-time and strategic decisions. 1. Accurate Data is correct, validated, and free from errors or inconsistencies. 5. Documented 4. Secure Protected from unauthorized access. Ensures compliance with privacy laws and internal policies. Ownership, data sources, and definitions are clearly recorded. Supports traceability, auditing, and informed usage Data Governance 🧠 Example: ❌ A missing GPS timestamp → inaccurate ETA → poor rider experience ✅ Clean trip records → better traffic forecasting and resource planning

Data Preparation & Cleaning Techniques Combine data from multiple sources (e.g., Nol + GPS + feedback) Consolidate Remove duplicates Handle missing or inaccurate values Correct entry errors (e.g., route codes, timestamps) Clean • Keep relevant periods (e.g., past 90 days) • Exclude test data or cancelled trips Filter • Check logic rules (e.g., trip start time < end time) • Detect anomalies (e.g., 500+ passengers in 1-minute window) Validate • Standardize formats (e.g., dates, station names) • Create calculated columns (e.g., delay duration) Transform “Data is like garbage, You'd better know what you are going to do with it before you collect it” Mark Twain

KPI Dashboards & Visualization Dashboards transform transportation data into quick, actionable insights . The Right KPIs, when visualized correctly, will improve service delivery, efficiency, and customer satisfaction. “Good KPIs tell the story in seconds. Great dashboards help teams act on it instantly.”

Show trends over time Ridership by hour/day Compare across categories On-time rate across bus routes Show part-to-whole (limited categories) Passenger type distribution (Adult/Student) Show volume + trend together Passenger flow through stations Display a single metric clearly Avg wait time today Show intensity across locations Complaint volume per station Show correlation between 2 variables Delay vs. distance traveled Show parts of a total and compare across categories. Ridership split by type (regular, senior, student) across metro lines Choosing the Right Chart Type Before building dashboards, it's important to match each KPI with the best chart type The wrong visual can confuse — the right one reveals patterns instantly. Show parts of a total and compare

What is Business Analysis? How to improve business Using Analytics for improvement Gaining insights from business Continuous improvement

Individual Activity

SCRS

Individual Activity

PESTLE P – Political E – Economic S – Social T – Technology L – Legal E – Environment

Group Activity

Porter’s 5 Force Bargaining Power of Customers Buyer Concentration Buyer Volume Buyer Switching Costs Buyer Information Ability to integrate backward Substitute products Price/ Total Purchases Product Differences Brand Identity Impact of Quality/ Performance Buyer Profits Threat Of New Entry Economies of Scale Proprietary Product Differences Brand Identity Switching Costs Capital Requirements Access to distribution Absolute Cost Advantages Government Policy Expected Retaliation Bargaining Power of Suppliers Differentiation of inputs Switching Costs Presence of substitute inputs Supplier Concentration Importance of Volume to supplier Cost Relative to Total Purchase Impact of inputs on cost or difference Threat of forward integration Threat of Substitutes Differentiation of inputs Relative Price Performance of Substitutes Switching Costs Buyer Propensity to substitute Existing Competitors Industry Growth Fixed Costs/ Value Add Over Capacity Product Differences Brand Identity Switching costs Concentration and Balance Informational Complexity Diversity of Competitors Corporate Stakes Exit Barriers

Data & Analytics What is data? How is data related to business? Real Life Analytics Understanding the data

Data Information in raw or unorganized form (such as alphabets, numbers, or symbols) that refer to, or represent, conditions, ideas, or objects. Data is limitless and present everywhere in the universe. Reference: http://www.businessdictionary.com/definition/data.html

Edward Deming quote 'In God we trust, everyone else bring data"

Data and business business needs valuable data and insights Understanding your target audience and customers preferences Data Analytics is a combination of all the processes and tools related to utilizing and managing large/Small data sets Integrating physical and digital shopping spheres Assessing Trends and measures to reduce deviations

Data and business Improved Service Level Performance Better Order Fulfillment Improved Supplier Management Maximise Customer Value Driving Down Costs Improved Advertising Better Product Management

Data data ‘ deɪtə ’ noun the quantities, characters, or symbols on which operations are performed by a computer, which may be stored and transmitted in the form of electrical signals and recorded on magnetic, optical, or mechanical recording media.

Understanding data Unbiased Data Achieve a large sample Ask the right questions (Quantitative & Qualitative) Interpret the data correctly Consider Margin of Error Create a Data Checklist Use Statistical Analysis

Activity (25 Minutes) Objective: To collect data to benefit the learning of Business Analytics Target Group: Attendees of the workshop Team Size: 4-5 Members

Video The Blind Men and the Elephant

Data Modeling Types of Data Models Data Modeling Data Modeling Process Constructing Data Models

Types of Data models

Hierarchical data model

Relational Model

Network model

object oriented model

Object-relational model

Data Modeling Data Modeling is a method of defining and analyzing data requirements needed to support the business functions of an enterprise. Data modeling is the act of exploring, understanding and designing data-oriented structures. You identify entity types their purpose and then relationships among them. Structuring the data to understand it’s value Pictorial and visual aides to help in the data framing

Exercise (1 hour) Create your data base on the models shared Object is to learn the patterns of the persons who are attending this course What is the objective of collecting the data? What are the data points you will collect? What is the interpretation of the data collected?

Points to be noted Identify entities and classify them into their types Identify attributes of each entity Define and apply naming conventions Create standards for commonly used attributes Identifying relationships among entities Applying data model patterns Create and assigning key attributes Perform Normalization to maintain data integrity and reduce redundancy De-normalizing to improve performance

Day 2

Analytics Descriptive analytics – Past Data Predictive analytics – Future data Prescriptive analytics – Past and Future data Diagnostic analytics – Root Cause Analysis

Predictive Analytics

Correlation The correlation is one of the most common and most useful statistics. A correlation is a single number that describes the degree of relationship between two variables. Let's work through an example to show you how this statistic is computed. Seek the height of each individual in your class and correlate the data with a histrogram .

Regression Regression  analysis is a statistical process for estimating the relationships among variables. It includes many techniques for modeling and analyzing several variables, when the focus is on the relationship between a dependent variable and one or more independent variables (or 'predictors').

Scatter Plots Scatter plots are used to establish relationship between two characteristics. The Factor which supposed to determine the value of other factor is called Independent variable. Factor which is assumed to be impacted by the other factor is called Dependent variable. In the equation, y = f (x). X is assiumed to affect the value of y. ie ., y is depending on x Hence, x is the independent variable and y is dependent variable. Always plot independent variable in x axis . In this example, each dot shows one person's weight versus his height.

Relationships

Regression Analysis – Linear Model A linear relationship simply means that a change of a given size in x produces a proportionate change in y. Described by the line equation y = a + bx

Regression - Exercise A Gold ornaments selling company is interested in increasing the sales of it’s outlet and cost is a concern. The company has collected data from it’s past sales on the number of products sold and period it sold it in. The company is looking to increase their sales in the coming year. Chart the data points and data to be captured from the customers to improve the sales based on past data.

Multiple R.  This is the correlation coefficient. It tells you how strong the linear relationship is. For example, a value of 1 means a perfect positive relationship and a value of zero means no relationship at all. It is the square root of r squared (see #2). R squared . This is r 2 , the Coefficient of Determination. It tells you how many points fall on the regression line. for example, 80% means that 80% of the variation of y-values around the mean are explained by the x-values. In other words, 80% of the values fit the model. Adjusted R square.  The adjusted R-square adjusts for the number of terms in a model. You’ll want to use this instead of #2 if you have more than one x variable. Standard Error of the regression:  An estimate of the standard deviation of the error μ. This is  not  the same as the standard error in descriptive statistics! The standard error of the regression is the precision that the regression coefficient is measured; if the coefficient is large compared to the standard error, then the coefficient is probably different from 0. Observations . Number of observations in the sample.

EXPLAINED PART TWO: ANOVA SS = Sum of Squares. Regression MS = Regression SS / Regression degrees of freedom. Residual MS = mean squared error (Residual SS / Residual degrees of freedom). F: Overall F test for the null hypothesis. Significance F: The significance associated P-Value.

INTERPRET REGRESSION COEFFICIENTS Coefficient: Gives you the least squares estimate. Standard Error: the least squares estimate of the standard error. T Statistic: The T Statistic for the null hypothesis vs. the alternate hypothesis. P Value: Gives you the p-value for the hypothesis test. Lower 95%: The lower boundary for the confidence interval. Upper 95%: The upper boundary for the confidence interval.

Time Series Forecasting An ordered sequence of values of a variable at equally spaced time intervals Can be used for: Sales Forecasting Budgetary Analysis Stock Market Analysis Yield Projections Process and Quality Control Inventory Studies Workload Projections

https://www.youtube.com/watch?v=gHdYEZA50KE

Standard Deviation

Hypothesis Testing Hypothesis testing is a statistical procedure in which a choice is made between a null hypothesis and an alternative hypothesis based on information in a sample. The null hypothesis, denoted H0, is the statement about the population parameter that is assumed to be true unless there is convincing evidence to the contrary. The alternative hypothesis, denoted Ha, is a statement about the population parameter that is contradictory to the null hypothesis, and is accepted as true only if there is convincing evidence in favor of it.

Hypothesis Testing Null hypothesis stands true until and otherwise proved wrong. Accused is considered as Innocent until and otherwise proven guilty.

Hypothesis Testing We want to take a practical problem and change it to a statistical problem We use relatively small samples to estimate population parameters There is always a chance that we can select a “weird” sample Sample may not represent a “typical” set of observations Inferential statistics allows us to estimate the probability of getting a “weird” sample

Why do Hypothesis Testing? To determine whether there is a difference between processes To validate process improvements (to prove that the process improvements done are yielding results) To identify the factors which impact the Mean or Standard deviation

Errors Type-1 (a) Error: Rejecting the Null hypothesis when it is true a-value indicate the probability of Type-1 error happening Type-2 (b) Error: Failing to reject Null hypothesis when it is false b-value indicates the probability of Type-2 error happening

Hypothesis & Risk When accepting or rejecting a hypothesis, we do so with a known degree of risk and confidence To do so, we specify in advance of the investigation the magnitude of decision risk and test sensitivity which is acceptable

Hypothesis and Decision Risk At  level of significance, the degree of confidence in our decision Is (1- ) which is called confidence coefficient.

Prescriptive Analytics

FMEA Failures are any errors or defects, especially ones that affect the customer, and can be potential or actual. Failure Mode is the ways or modes by which something might fail. Effect Analysis refers to studying the consequences of those failures

Steps to construct FMEA For each of failure modes identified, give scoring (from 1 to 10) based on below table Category Criteria Question Highest Score (10) Lowest Score (1) Severity How severe are the consequences of this failure mode? Fatal / very high No risk Occurrence How frequently it happens? Always Never / Very rare Detection How quickly the failure could be identified by our system? We cant identify, we will come to know this upon consumption by customer The failure will be detected as it happens and will not move further

Failure Mode & Effects Analysis Process / product FMEA Date (Original) FMEA Team (Revised) Project Name Page: Of Process Actions Results Process steps / requirements Potential failure mode Potential effects of failure Severity Potential cause(s) of failure Occurrence Current controls Detection Risk priority number Recommended action Responsibility and target completion date Action taken Severity Occurrence Detection Risk priority number Text Text Text Number Text Number Text Number calculation Text Text Text Number Number Number Calculation

Cause and Effect Diagram This is also known as Ishikawa Diagram or Fish-bone Diagram. Dr Ishikawa suggested that every problem we face in our shop floor could be caused by any of these six factors, viz., Machine, Method, Material, Measurement, Man and Mother Nature. It is not recommended to add a seventh factor. Machine Method Material Measurement Man Mother Nature 6 Category of causes Effect Poor Process Capability

Cause and Effect Diagram In true sense, Fishbone diagram is a stand alone tool for root cause analysis, where brainstorming is the start point, followed as category brainstorming. Root cause analysis is done by drawing branches to the main 6 bone of fish. But in DMAIC, we spilt the functions as Brainstorming or FMEA Fish-bone for categorical analysis and Root Cause analysis using Why-Why Analysis Hence, after brainstorming, fit every primary cause in appropriate category on fishbone diagram and ensure all the 6 bones of the fish is having adequate flesh (number of causes)

Root Cause Analysis using Why-Why Why-Why Analysis is a simple and effective method of finding out root causes of a problem. It is a method of questioning all what we know about the problem with why (all potential causes we identified in brainstorming or FMEA). This relentless questioning opens up our thinking process and lead to root cause of the problem. Each of the possible causes listed in the brainstorming or FMEA is questioned and we need to have at least 2 root causes per possible cause.

Prioritisation of Root Causes The scoring criteria are as below. Category Criteria Question Highest Score (10) Lowest Score (1) Severity How strongly the root cause and the problem (project y) are connected? Or how much % of problem is associated with this cause? They are strongly related. This root cause give raise to 100% of the problem. Seemingly no connection Occurrence How frequently it happens? Always Never / Very rare Controllability How quickly and completely the team can take actions on this root cause. (team’s authority, expertise and technology) Team can act right away. They are adequately authorised and skilful. Team has to cross multiple approvals and also needs additional expertise

Day 3

Test Assumptions

Analyzing Quantitative Data Frequency Analysis Difference Analysis Qualitative Data Descriptive Analysis Interpretive Analysis

Brain Storming SCOT Cause and Effect Comparative Analysis

Data Visualization Charts Flow Charts Graphs Pictures Map

Software SAS Python Programming R programming Microsoft Excel Google Spreadsheets/ Fusion Tables Tableau (Online tool)

MaxStat  by MaxStat Software Complete statistics package with intuitive user interface and easily understandable results. Designed for researchers and students SPSS  by IBM Predictive Analytics can uncover unexpected patterns and associations and develop models to guide front-line interaction Minitab 17  by Minitab Analyze your data and improve your products and services with the leading statistical software used for quality improvement worldwide. DataMelt (" Dmelt ")  by jWork.ORG Data analysis, math and data visuzalization program which combines the power of Python and Java (free) Analytica  by Lumina Decision Systems Analytica is a powerful, stand-alone application for visual quantitative modeling with a full array of statistical analysis functions.

Statwing  by Statwing Statwing chooses statistical tests automatically, then reports results in plain English. Statwing is delightful and efficient analysis. Stata  by StataCorp Stata statistical software is a complete, integrated statistical software package. STEM  by Princeton National Surveys Excel macro package for running statistical tests on summary data. Output arranged to easily produce graphs in PowerPoint. XLSTAT  by Addinsoft Variety of tools to enhance the analytical capabilities of Excel, making it the ideal for data analysis and statistics requirements. AcaStat  by AcaStat Software Statistical software and instructional aids to help you quickly organize and analyze data. AlterWind Log Analyzer  by AlterWind A log analyzer tool for determining the basic characteristics of the hits on your site. Analyse-it  by Analyse-it Software Statistical analysis software for researchers in environmental & life sciences, engineering, manufacturing and education. Analysis Studio  by Appricon Provides an end to end model generation process designed for fast development, analysis and deployment.

ChemStat  by Starpoint Software Full featured RCRA compliant statistical analysis of ground water data. CoPlot  by CoHort Software A program for making publication-quality maps, technical drawings, and 2D and 3D scientific graphs; includes a statistical add-on. Decision Analyst STATS  by Decision Analyst Windows-based statistical software for marketing research.

Decision Science  by Stone Analytics Embeddable analytic engines designed for integration into a wide variety of enterprise applications. Develve  by Develve Statistical Software Statistical software for fast and easy analysis. Basic statistics, Design of Experiments, Gauge R&R and Sample size calculations. EasyFit  by MathWave Technologies Data analysis and simulation software with data management, reporting functionality for probability distribution selection

ESBStats  by ESB Consultancy Statistical Analysis and Inference Software Package for Windows. Forecast Pro  by Business Forecast Systems A standalone analytic tool for business forecasting that combines proven statistical methods with an intuitive interface. JMP Statistical Software  by JMP Statistical Software JMP, data analysis software for scientists and engineers, links dynamic data visualization with powerful statistics, on the desktop. KnowledgeSTUDIO  by Angoss Business intelligence and predictive analytics software suite with decision tress and data visualization.

MATLAB  by The MathWorks A programming environment for algorithm development, data analysis, visualization, and numerical computation. MedCalc  by MedCalc Software A complete Windows-based statistical program for biomedical researchers. Number Analytics  by Number Analytics Provides statistical analytics software for business users, pricing & promotion optimization, conjoint analysis, new product design. PolyAnalyst  by Megaputer Intelligenc Offers a comprehensive selection of algorithms for automated analysis of text and structured data. Predictive Suite  by Predictive Dynamix Computational intelligence software for data mining analysis and predictive modeling . Scilab  by Scilab Enterprises Open source software for numerical computation providing a computing environment for engineering and scientific applications.

SigmaPlot  by Systat Software Systat Software presents award winning scientific data analysis software. The R Project  by R- Project.Org Software environment for statistical computing and graphics (free). Orange  by Biolab.Si Open source data visualization and data analysis for novice and expert (free) Statistix  by Analytical Software Easy-to-use, comprehensive statistics and data manipulation Weka  by Waikato Machine Learning Group Machine learning algorithms for data mining tasks software (free). UNISTAT  by UNISTAT A statistical software package featuring a statistics add-in for Excel data analysis, charting and presentation-quality reporting
Tags