What is data analytics ,Data science,Data processing chain,regression,decision Tress.ppt

hannahroseline2 97 views 33 slides Jun 30, 2024
Slide 1
Slide 1 of 33
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14
Slide 15
15
Slide 16
16
Slide 17
17
Slide 18
18
Slide 19
19
Slide 20
20
Slide 21
21
Slide 22
22
Slide 23
23
Slide 24
24
Slide 25
25
Slide 26
26
Slide 27
27
Slide 28
28
Slide 29
29
Slide 30
30
Slide 31
31
Slide 32
32
Slide 33
33

About This Presentation

about Data analytics.


Slide Content

What is data analytics?
Mostcompaniesarecollectingloadsofdataallthetime—but,initsrawform,thisdata
doesn’treallymeananything.Thisiswheredataanalyticscomesin.Dataanalyticsisthe
processofanalyzingrawdatainordertodrawoutmeaningful,actionableinsights,
whicharethenusedtoinformanddrivesmartbusinessdecisions.
Adataanalystwillextractrawdata,organizeit,andthenanalyzeit,transformingitfrom
incomprehensiblenumbersintocoherent,intelligibleinformation.Havinginterpretedthe
data,thedataanalystwillthenpassontheirfindingsintheformofsuggestionsor
recommendationsaboutwhatthecompany’snextstepsshouldbe.
,businessesandorganizationsareabletodevelopamuchdeeperunderstandingoftheir
audience,theirindustry,andtheircompanyasawhole—and,asaresult,aremuchbetter
equippedtomakedecisionsandplanahead.

Youcanthinkofdataanalyticsasaformofbusinessintelligence,usedtosolvespecificproblemsand
challengeswithinanorganization.
It’sallaboutfindingpatternsinadatasetwhichcantellyousomethingusefulandrelevantabouta
particularareaofthebusiness—howcertaincustomergroupsbehave,forexample,orhowemployees
engagewithaparticulartool.
Dataanalyticshelpsyoutomakesenseofthepastandtopredictfuturetrendsandbehaviors;ratherthan
basingyourdecisionsandstrategiesonguesswork,you’remakinginformedchoicesbasedonwhatthedata
istellingyou.
Howbusinessesusedataanalytics
Armedwiththeinsightsdrawnfromthedata,businessesandorganizationsareabletodevelopamuch
deeperunderstandingoftheiraudience,theirindustry,andtheircompanyasawhole—and,asaresult,are
muchbetterequippedtomakedecisionsandplanahead.

What’s the difference between data analytics
and data science?
Dataanalytics
Adataanalystwillseektoanswerspecificquestionsoraddressparticular
challengesthathavealreadybeenidentifiedandareknowntothebusiness.
Todothis,theyexaminelargedatasetswiththegoalofidentifyingtrends
andpatterns.Theythen“visualize”theirfindingsintheformofcharts,
graphs,anddashboards.Thesevisualizationsaresharedwithkey
stakeholdersandusedtomakeinformed,data-drivenstrategicdecisions.

Data science:
•Adatascientist,ontheotherhand,considerswhatquestionsthebusiness
shouldorcouldbeasking.
•Theydesignnewprocessesfordatamodeling,writealgorithms,devise
predictivemodels,andruncustomanalyses.
•Forexample:Theymightbuildamachinetoleverageadatasetandautomate
certainactionsbasedonthatdata—and,withcontinuousmonitoringand
testing,andasnewpatternsandtrendsemerge,improveandoptimizethat
machinewhereverpossible.

What are the different types of data analysis?
Now we have a working definition of data analytics,
let’s explore the four main types of data
analysis:descriptive,diagnostic,predictive,
andprescriptive.

What is the typical process that a data analyst will
follow?
Nowwe’vesetthesceneintermsoftheoveralldataanalystrole,
let’sdrilldowntotheactualprocessofdataanalysis.Here,we’ll
outlinethefivemainstepsthatadataanalystwillfollowwhen
tacklinganewproject:

Define the question(s) you want to answer
•The first step is to identifywhy you are conducting
analysisandwhat question or challenge you hope to solve.
•At this stage, you’ll take a clearly defined problem and come up with
a relevant question or hypothesis you can test. You’ll then need to
identify what kinds of data you’ll need and where it will come from.
•For example: A potential business problem might be that customers
aren’t subscribing to a paid membership after their free trial ends.
Your research question could then be “What strategies can we use to
boost customer retention?”

Collect the data
•Withaclearquestioninmind,you’rereadytostartcollectingyour
data.Dataanalystswillusuallygatherstructureddatafromprimaryor
internalsources,suchasCRMsoftwareoremailmarketingtools.
•Theymayalsoturntosecondaryorexternalsources,suchasopendata
sources.Theseincludegovernmentportals,toolslikeGoogleTrends,
anddatapublishedbymajororganizationssuchasUNICEFandthe
WorldHealthOrganization.

Clean the data
•Onceyou’vecollectedyourdata,youneedtogetitreadyforanalysis—
andthismeansthoroughlycleaningyourdataset.Youroriginaldataset
maycontainduplicates,anomalies,ormissingdatawhichcoulddistort
howthedataisinterpreted,sotheseallneedtoberemoved.Data
cleaningcanbeatime-consumingtask,butit’scrucialforobtaining
accurateresults.

Analyze the data
•Now for the actual analysis! How youanalyze the datawill depend
on the question you’re asking and the kind of data you’re working
with, but some common techniques include regression
analysis,cluster analysis, and time-series analysis (to name just a
few).
•We’ll go over some of these techniques in the next section. This step
in the process also ties in with the four different types of analysis we
looked at in section three (descriptive, diagnostic, predictive, and
prescriptive).

Visualize and share your findings
•Thisfinalstepintheprocessiswheredataistransformedintovaluable
businessinsights.Dependingonthetypeofanalysisconducted,you’ll
presentyourfindingsinawaythatotherscanunderstand—intheformofa
chartorgraph,forexample.
•Atthisstage,you’lldemonstratewhatthedataanalysistellsyouinregards
toyourinitialquestionorbusinesschallenge,andcollaboratewithkey
stakeholdersonhowtomoveforwards.Thisisalsoagoodtimeto
highlightanylimitationstoyourdataanalysisandtoconsiderwhatfurther
analysismightbeconducted.

What skills do you need to become a data
analyst?
Hardskills
•Mathematicalandstatisticalability
•KnowledgeofprogramminglanguagessuchasSQL,R,orPython
•Ananalyticalmindset
Softskills
Keenproblem-solvingskills
Excellentcommunicationskills
Adaptability

•Dataanalyticsisanimportantfieldthatinvolvestheprocessofcollecting,
processing,andinterpretingdatatouncoverinsightsandhelpinmaking
decisions.Dataanalyticsisthepracticeofexaminingrawdatatoidentify
trends,drawconclusions,andextractmeaningfulinformation.Thisinvolves
varioustechniquesandtoolstoprocessandtransformdataintovaluable
insightsthatcanbeusedfordecision-making.
•wewilllearnaboutDataanalytics,datawhichwillhelpbusinessesand
individualsthatcanhelpthemtoenhanceandsolvecomplexproblems,Types
ofDataAnalytics,Techniques,Tools,andtheImportanceofDataAnalytics

What is Data Analytics?
Inthisnewdigitalworld,dataisbeinggeneratedinanenormousamount
whichopensnewparadigms.Aswehavehighcomputingpoweranda
largeamountofdatawecanusethisdatatohelpusmakedata-driven
decisionmaking.Themainbenefitsofdata-drivendecisionsarethatthey
aremadeupbyobservingpasttrendswhichhaveresultedinbeneficial
results.

Understanding Data Analytics
•Dataanalyticsencompassesawidearrayoftechniquesforanalyzingdatatogainvaluable
insightsthatcanenhancevariousaspectsofoperations.Byscrutinizinginformation,
businessescanuncoverpatternsandmetricsthatmightotherwisegounnoticed,enabling
themtooptimizeprocessesandimproveoverallefficiency.
•Forinstance,inmanufacturing,companiescollectdataonmachineruntime,downtime,and
workqueuestoanalyzeandimproveworkloadplanning,ensuringmachinesoperateat
optimallevels.
•Beyondproductionoptimization,dataanalyticsisutilizedindiversesectors.Gamingfirms
utilizeittodesignrewardsystemsthatengageplayerseffectively,whilecontentproviders
leverageanalyticstooptimizecontentplacementandpresentation,ultimatelydrivinguser
engagement.

Types of Data Analytics
•There are four major types of data analytics:
•Predictive (forecasting)
•Descriptive (business intelligence and data mining)
•Prescriptive (optimization and simulation)
•Diagnostic analytics

What is Business Intelligence?
•BusinessIntelligenceisoneofthemostpowerfultoolsmany
organizationsusetoknowtheircustomerbaseandmarketbetter.It
describesthebusinessmethodologyinwhichtherawdatais
transformedintousefulinformationwhichhelpsindecisionmaking.
BenefitsofBusinessIntelligence
•Businessintelligencehasbroadapplications,andiftalkingaboutthe
benefitsofbusinessintelligenceintheretailsector,nowadaysbusiness
intelligencetoolsenableorganizationstotakebenefitofdatanotonly
toassumecurrentsalesbutalsotoestimatefuturepotential,patterns,
trendsandknowthedemandofthecustomeronadeeperlevel.

Pattern
•Patterniseverythingaroundinthisdigitalworld.Apatterncaneither
beseenphysicallyoritcanbeobservedmathematicallybyapplying
algorithms.
•Example:Thecolorsontheclothes,speechpattern,etc.Incomputer
science,apatternisrepresentedusingvectorfeaturevalues.

What is Pattern Recognition?
Patternrecognitionistheprocessofrecognizingpatternsbyusinga
machinelearningalgorithm.Patternrecognitioncanbedefinedasthe
classificationofdatabasedonknowledgealreadygainedoronstatistical
informationextractedfrompatternsand/ortheirrepresentation.Oneofthe
importantaspectsofpatternrecognitionisitsapplicationpotential.
Examples:Speechrecognition,speakeridentification,multimedia
documentrecognition(MDR),automaticmedicaldiagnosis.

TemporalPattern
Itissomethingthatregularlyoccursovertime.
Ex:
Atemporalrulewouldbethat"somepeoplearealwayslate,"nomatter
whattheoccasionortime.Somepeoplemaybeawareofthispattern
andsomemaynot.Understandingapatternlikethiswouldhelp
dissipatealotofunnecessaryfrustrationandanger.Onecanjustjoke
thatsomepeopleareborn"10minuteslate,"andlaughitaway.
Similarly,Parkinson'sLawstatesthatworkexpandstofillupallthe
timeavailabletodoit.

SpatialPattern
Patternscanalsobespatialsuchasthingsbeingorganizedincertainway.
FunctionalPatterns
Patternscanbefunctionalwhichmeansdoingcertainthingsleadsto
certaineffects
Afunctionalpatternmayinvolvetest-takingskills.Somestudents
performwellonessay-typequestions.Othersdowellinmultiple-choice
questions.Yetotherstudentsexcelindoinghands-onprojects,orinoral
presentations.Anawarenessofsuchapatterninaclassofstudentscan
helptheteacherdesignabalancedtestingmechanismthatisfairtoall.

Finding a Pattern
•Diamondminingistheactofdiggingintolargeamountsofunrefinedoretodis-cover
preciousgemsornuggets.Similarly,dataminingistheactofdiggingintolargeamountsof
rawdatatodiscoveruniquenontrivialusefulpatterns.Dataiscleanedup,andthenspecial
toolsandtechniquescanbeappliedtosearchforpatterns.Divingintocleanandnicely
organizeddatafromtherightperspectivescanincreasethechancesofmakingtheright
discoveries.
•Askilleddiamondminerknowswhatadiamondlookslike.Similarly,askilleddataminer
shouldknowwhatkindsofpatternstolookfor.Thepatternsareessentiallyaboutwhathangs
togetherandwhatisseparate.Therefore,know-ingthebusinessdomainwellisvery
important.Ittakesknowledgeandskilltodiscoverthepatterns.Itislikefindinganeedleina
haystack.Sometimesthepatternmaybehidinginaplainsight.Atothertimes,itmaytakea
lotofworkandlookingfarandwide,tofindsurprisingusefulpatterns.Thus,asystematic
approachtominingdataisnecessarytoefficientlyrevealvaluableinsights.

Use of Pattern
Organizationscanfindoutanemployee'sarrivaltimeattheoffice
bywhentheircellphoneshowsupintheparkinglot.Observingthe
recordoftheswipeoftheparkingpermitcardinthecompany
parkinggarage,caninformtheorganizationwhetheranemployee
SȚintheofficebuildingoroutoftheofficeatanymomentintime.

Data Processing Chain

DATA:
•Anything that is recorded is data. Observations and facts are data: anecdotes Data and
opinions are also data, of a different kind.
•Data can be numbers. like the record of daily weather, or daily sales. Data can be
alphanumeric, such as the names of employees and customers O can come from any
number of sources from operational records 1n-Data side an organization, or from records
compiled by the industrial bodies and government agencies.
•Data can come from individuals telling stories from memory and from people's interaction in
social contexts, or from machines reporting their own status or from logs of web usage Data
can come in many ways it may come as paper reports, or as a file stored on a computer.
•It may be words spoken over the phone. It may be e-mail or chat on the Internet or may
come as movies and songs in DVDs, and so on. There is also data about data that is called
metadata. For example, people regularly upload videos on YouTube.
•The format of the video file (whether high resolution or lower resolution) is metadata. The
information about the time of uploading is metadata. The account from which it was
uploaded is also metadata. The record of downloads of the video is also metadata

Database
•Database A database is a modeled collection of data that is accessible in
many ways.
•A data model can be designed to integrate the operational data of the
organızation. 'The data model abstracts the key entities involved in an
action and their relation-ships.
•Most databases today follow the relational data model and its variants.
Each data modeling technique imposes rigorous rules and constraints to
ensure the integrity and consistency of data over time.

Data Warehouse
•Data Warehouse A data warehouse is an organized store of data from all over
the organization specially designed to help make management decisions.
Data can be extracted from operational database to answer a particular set of
queries.
•This data, com binedwith other data, can be rolled up to a consistent
granularity and uploaded to a separate data store called the data warehouse.
Therefore. the data warehouse is a simpler version of the operational
database, with the purpose of addressing reporting and decision-making
needs only.
•The data in the warehouse cumula-tivelygrows as more operational data
becomes available and is extracted and appended to the data warehouse.
Unlike in the operational database, the data values in the warehouse are not
updated

Data Mining
•Data Mining is the art and science of discovering useful and innovative
patterns from data. There is a wide variety of patterns that can be found in
the data. There are many techniques, simple or complex, that help with
finding patterns.

Data Visualization
•Data Visualization As data and insights grow in number, a new
requirement is the ability of the executives and decision makers to absorb
this information in real time. There BSI limit to human comprehension and
visualization capacity. That is a good reason to prioritize and manage with
fewer but key variables that relate directly to the Key Result Areas (KRAs) of
a role.
•Here are a few considerations when presenting using data: = Present the
conclusions and not just report the data. Choose wisely from a palette of
graphs to suit the data. Organize the results to make the central point stand
out Ensure that the visuals accurately reflect the numbers. Inappropriate
visu-als can create misinterpretations and misunderstandings = Make the
presentation unique, imaginative and memorable
Tags