IR, LSI,Reordering of results
If you want to know something about the LSI Indexing, maybe it will be useful if I will explain here about the 3 most important Google Algorithm.
IR (Information Retrieval)
Information retrieval is applied to find any type of information (libraries, research, government, and ... the web). Let's not forget that databases in the past were called "libraries" In time identified two major components of the IR:
Relevance> degree to which the document found meets the intent of search. Relevance increase if search terms are found several times in the content and are possible in the main titles or descriptions.
Popularity> A document is more important than how it is quoted in other works. The popularity of a document is the date and the quality of the works that are reerinta him.
LSI (Latent Semantic Indexing)
This is the science of language processing. LSI analyzes the relationships between words, and assume that this is done naturally and that it establish relationships between words synonymous (replacement). As you discuss on a topic first extract the words "heavy" are around the main word meaning and secondly extract the words used in context of the main word.
eg Level 1 motorcycle (Moto, Motor, motorcycle, Scooter) and Level 2 (Helmets, Racing, Adventure, track, etc.)
The challenge for LSI is to distinguish between human writing and writing automated intentional to manipulate search results. Applied Semantics is probably the company that developed the most that science and was acquired by Google.
Hence we conclude that rather write naturally trying to imagine the use of phrases in different occasions rather than always analyze keyword density per page.
(Re) ordering of results
Patent in 2003 says that "a search engine for searching a corpus improves the relevancy of the results by refining a standard relevancy score based on the inter-connectivity of the initially returned set of documents."
(All patents Google study)
Here is the two algorithms namely Google PageRank and TrustRank, the searching page, then comes the results against the reordering of the correlation between them.
eg could even if you have many links to sites ranked well not be able to climb in search results. This means you may need to have sites that are included in a given class that contains your key words, that the sites are on top for those words or sites that are ranked as of your competitors. At present the highest-rated sites are those which derive their links in the community. (thematic forums, discussion groups, etc.)
__________________
I am working on the To view links or images in signatures your post count must be 1 or greater. You currently have 0 posts. and mantaining To view links or images in signatures your post count must be 1 or greater. You currently have 0 posts. . Stay tuned for the To view links or images in signatures your post count must be 1 or greater. You currently have 0 posts.
|