«مقدمهای بر آپاچی سولار» ارائه شده توسط حسن نصر در اسکیلآپ هشتم
Size: 1.79 MB
Language: en
Added: Apr 20, 2018
Slides: 18 pages
Slide Content
Introduction to
Apache Solr
Hassan Nasr Esfahani
Topics
–What we need from a text search engine
–What is Solr?
–Why Solr?
–Concepts And Architecture
–Usage
–Special Features
–Competitors
Text Retrieval vs Database
Retrieval
–Information and Query
–Unstructured vs Structured
–Ambiguous vs Well defined
–Answers
–Relevant documents (ambiguous) vs matched
documents
What we want from text search
engine
Basic Search Features:
–Store some documents with some fields
–Query for documents
Text Search Features
–Find most relevant docs
–Handle Natural language Complications (stop words, stem, tokenizing … )
–Highlight text
–…
Problems with Text Search
SampleProblem
موریمقداصدمحم،شباتک،Tokenization
يویDifferent Letter representation
دوریم،یوریم،موریمSimilar words
راگزومآوملعمSynonymous words
ریشWord ambiguity
،تفر،هب،تسا،اب...Stop words
شراذگSpell errors
نونSpoken language
What is Solr?
–An Open Search Engine
–Written in Java
–Wrapping Apache Lucene
–With REST API
–Fault tolerant
–Scalable
–Distributable