Advanced Data Analytics using Knime and Excel

ApoorvaNaik12 84 views 27 slides Jun 08, 2024
Slide 1
Slide 1 of 27
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14
Slide 15
15
Slide 16
16
Slide 17
17
Slide 18
18
Slide 19
19
Slide 20
20
Slide 21
21
Slide 22
22
Slide 23
23
Slide 24
24
Slide 25
25
Slide 26
26
Slide 27
27

About This Presentation

knime


Slide Content

Doing Data Science with Excel and Knime

MAKING LIFE EASIER WITH EXCEL The benefit of using excel in a data science : Offers a fast and easy way to get up close and personal with your data. To browse every data point in your dataset Well-compartmentalized Advanced functionality Simplify your data cleanup and analysis tasks

Main features Filters : Filters are useful for sorting out all records that are irrelevant to the analysis at hand . » Conditional formatting : By using conditional formatting, you can easily detect outliers and trends in your tabular datasets. » Charts : Charts have long been used to visually detect outliers and trends in data, so charting is an integral part of almost all data science analysis

Filtering in Excel Data filtering is the process of examining a dataset to exclude, rearrange, or apportion data according to certain criteria . For example, data filtering may involve finding out the total number of sales per quarter and excluding records from last month.

Filter a range of data Select any cell within the range. Select Data > Filter. Select the column header arrow . Select Text Filters or Number Filters, and then select a comparison, like Between. Enter the filter criteria and select OK.

Conditional formatting Conditional formatting makes it easy to highlight certain values or make particular cells easy to identify. This changes the appearance of a cell range based on a condition (or criteria). You can use conditional formatting to highlight cells that contain values which meet a certain condition.

Excel charting Charts are visual representations of data used to make it more understandable. Commonly used charts are: Pie chart Column chart Line chart Different charts are used for different types of data.

Excel charting

Reformatting and summarizing with pivot tables A Pivot Table is a powerful tool to calculate, summarize, and analyze data that lets you see comparisons, patterns, and trends in your data.  It is  an interactive way to quickly summarize large amounts of data. It is used to analyze numerical data in detail, and answer unanticipated questions about your data. It is especially designed for: Querying large amounts of data in many user-friendly ways.

pivot table

Automating Excel tasks with macros An Excel macro is a recorded sequence of Excel commands and actions that you can play back as many times as you want.  Macros can be used to automate just about any sequence of tasks in Excel, from something as simple as entering your company’s name and address into a spreadsheet to something as complex as creating a custom report. If you can do it in Excel, you can probably automate it with a macro.

Using KNIME for Advanced Data Analytics KNIME ( K o n stanz I nformation M in e r), is a free and open-source data analytics, reporting and integration platform. KNIME integrates various components for machine learning and data mining through its modular data pipelining "Building Blocks of Analytics" concept . The free and open-source KNIME Analytics Platform ensures the ETL process is powerful, scalable, repeatable, and reusable.

Using KNIME for Advanced Data Analytics Beginners and advanced users alike can use KNIME predictive analytics to » Upsell and cross-sell : To enable you to increase sales. » Churn reduction : Mine customer data and identify which customers you’re most likely to lose and why. » Sentiment and network analysis : Analyze the sentiment of people. » Energy usage prediction and auditing : Perform time series analyses.

Using KNIME for Advanced Data Analytics KNIME Software covers all kinds of data analytics functionality. For example classification, regression , dimension reduction, or clustering, using advanced algorithms including deep learning, tree-based methods, and logistic regression . This powerful and versatile open source platform offers a visual interface and a wide range of built-in algorithms to unlock the full potential of your data.  KNIME is a game-changer for anyone working with data .

Applying Domain Expertise to Solve Real-World Problems Using Data Science

Data Science for Driving Growth in E-Commerce

The work of data scientists in e-commerce involves: Data analysis: Simple statistical and mathematical inference. Segmentation analysis gets rather complicated when trying to make sense of e-commerce data . Data wrangling : Data wrangling involves using processes and procedures to clean and convert data from one format and structure to another s Data visualization design : Expect to use a lot of line charts, bar charts, scatter charts, and map-based data visualizations. Data visualizations should be simple and to the point.

The work of data scientists in e-commerce involves… Communication: After you make sense of the data, you have to communicate its meaning in clear, direct, and concise ways that decision makers can easily understand. Custom development work: In some cases, you may need to design custom scripts for automated custom data analysis and visualization

Optimizing e-commerce business systems by: Acquisition : Your brand acquires new users in the form of website visitors Activation : Acquired users activate, either through email subscription, RSS subscription, or social followings. Retention : Activated users take some sort of action — such as accepting an offer or responding to a call to action within your email marketing. Referral : Retained users refer new users to your brand’s acquisition layer. Revenue : Users make revenue-generating purchase
Tags