DS-Visualization-Unit-4 COMPUTER SCIENCE.pdf

coreyanderson7866 12 views 14 slides Aug 28, 2024
Slide 1
Slide 1 of 14
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14

About This Presentation

D DATA SCIENCE VISUALISATION UNIT-4 (COMPUTER SCIENCE)


Slide Content

BIG DATA
ANALYTICS
DR. OWAIS BHAT

DATA-SCIENCE APPLICATIONS
Data science is a rapidly growing field that is being used in a wide variety of
applications.

Fraud detection:Data science can be used to identify and prevent fraudulent
transactions. For example, banks use data science to identify suspicious
activity in credit card transactions.


Customer segmentation:Data science can be used to segment customers into
groups based on their characteristics. This information can be used to target
customers with specific marketing messages.

Product recommendations:Data science can be used to recommend products
to customers based on their past purchases or browsing history. This
information can be used to increase sales and improve customer satisfaction.

Risk assessment:Data science can be used to assess the risk of an event
happening. For example, insurance companies use data science to assess the
risk of a customer filing a claim.


Personalized medicine:Data science can be used to personalize medical
treatment for patients. For example, doctors can use data science to identify
the best treatment for a patient based on their individual characteristics.

RECENT TRENDS IN DATA COLLECTION & ANALYSIS

The rise of real-time data collection and analysis.

The increasing use of artificial intelligence (AI) and machine learning (ML) for
data analysis.

The growing popularity of cloud-based data collection and analysis tools.

The increasing focus on data privacy and security

Some of the most popular technologies for data visualization in data science:

Matplotlib:Matplotlib is a Python library for creating static, animated, and
interactive visualizations. It is a popular choice for data scientists because it is
easy to use and versatile.

Seaborn:Seaborn is a Python library that builds on Matplotlib to provide a
high-level interface for creating attractive and informative visualizations. It is a
good choice for data scientists who want to create visualizations that are both
visually appealing and easy to understand.

Plotly:Plotlyis a Python library for creating interactive visualizations that can
be embedded in web pages or documents. It is a good choice for data scientists
who want to create visualizations that can be shared and explored online.
Tableau:Tableau is a commercial data visualization software that is known for
its ease of use and interactive capabilities. It is a good choice for businesses
and organizations that need to create data-driven visualizations for non-
technical audiences.
QlikSense:QlikSense is another commercial data visualization software that is
known for its speed and scalability. It is a good choice for businesses that need
to process large amounts of data quickly and create interactive visualizations.

DATA-SCIENCE –R LANGUAGE
R is a powerful programming language for data science, and it can be used to
develop a variety of applications. Some of the most common application
development methods in data science using R include:
Shiny:

Shiny is an R package that facilitates the creation of interactive web
applications directly from R scripts.

It allows data scientists to build dynamic dashboards, visualizations, and data-
driven web interfaces without extensive web development knowledge.

Shiny apps can be hosted online or deployed on local servers.

R Markdown:

R Markdown is a versatile tool for creating reproducible reports, documents,
and presentations that integrate R code, visualizations, and narrative text.

It enables data scientists to weave code, output, and text into a single
document, making it easy to share insights and analysis.

Plumber:

Plumber is an R package for building APIs (Application Programming
Interfaces) using R code.

Data scientists can create RESTfulAPIs to expose R models, functions, or data
processing pipelines for integration with other applications.
RStudioConnect:

RStudioConnect is a platform that allows you to publish and share Shiny apps,
R Markdown documents, and Plumber APIs securely within your organization.

It simplifies the deployment and management of R-based applications.

R Packages:

R allows you to develop custom R packages that encapsulate functions, data,
and documentation for specific data science tasks.

Packages can be shared and reused across projects, enhancing code
modularity and reusability.
R with SQL Databases:

R can be integrated with SQL databases using packages like RMySQLor DBI,
enabling data retrieval, analysis, and visualization directly from databases.

This is useful for building data-driven applications that fetch and process data
from relational databases.

THANK YOU
[email protected]
Tags