Graph theory in Search engines and web connectivity.pptx

PreethiSatsangi 52 views 6 slides Apr 30, 2024
Slide 1
Slide 1 of 6
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6

About This Presentation

Search Engine Algorithms (PageRank algorithm)
Search engines such as Google let us navigate through the World Wide Web without a problem. Once a query is made to search a specific set of words, the engine looks for websites that match the query. After finding millions of matches, how does the engine...


Slide Content

Graph theory in Search engines Preethi Satsangi 2102818 BSc.CS

Understanding the web begins with visualising it as a directed graph, a concept where pages are nodes connected by links that have direction. Vertices and Edges Vertices (Nodes):  Each webpage is a vertex in the graph. The size of the vertex set can be used to gauge the scale of a web graph. Edges (Links):  Hyperlinks that direct from one page to another form the edges. The direction of an edge indicates the flow of navigation from one page to the next. Directed Graphs Unidirectional Flow:  Unlike undirected graphs, where edges have no orientation, the web graph's edges are directed, reflecting the one-way nature of hyperlinks. Cycles:  Cycles occur when a path of directed edges forms a loop, which is significant in understanding web navigation patterns.

Web Crawling This is the process by which search engines visit and index webpages, interpreting the web graph to build a database of information. Crawlers (Spiders):  These bots systematically visit webpages, tracing the graph's edges to discover new vertices. Indexing Strategies:  Different strategies determine the order in which pages are visited and indexed, impacting the freshness and comprehensiveness of the search results.

PageRank Algorithm PageRank is foundational to Google's search technology, applying graph theory to rank web pages. Here is an overview of how the PageRank algorithm works: Link Analysis:  The algorithm treats links as votes, with the idea that pages linked by important pages are themselves likely to be important. Iterative Calculation:  PageRank iteratively calculates the importance of each page based on the importance of the pages that link to it. Damping Factor:  Typically set around 0.85, this factor accounts for the probability that a user may stop following links and start a new search.

Conclusion In sum, graph theory is integral to understanding and navigating the web. By applying graph theory to the web,we gain valuable insights into the structure and dynamics of online information. It is a powerful tool that continues to shape the way we access and analyse data on the internet, making it an indispensable subject for those looking to delve deeper into the field of computer science.  

THANK YOU
Tags