The Importance of Indexing Links and Factors Affecting It

 

Introduction

(https://en.wikipedia.org/wiki/Search_engine_indexing)

 

To expedite fast and accurate information recovery, search engine indexing is used. They gather, analyze and keep data. The design of this index integrated the concepts from linguistics, cognitive psychology, mathematics, informatics, physics and computer science. Web indexing is the other name for the process in the context of search engines that are designed to find web pages. Full-text indexing of online, natural language document is usually the center for popular search engines, but they can also search for video, audio and graphics.

Purpose

To fully optimize the speed and performance in searching related documents for a search inquiry is the very purpose of storing an index. In the absence of an index, the search engine would test every document found which is very time consuming and requires a lot power in computing. Imagine if you would search for a single keyword that would take hours for the computer to find. It would be very frustrating and useless right? And not only that, additional storage for the computer is also needed to store the index and also a substantial increase in the time needed for just to update. So now we have established how important it is to have index. It will make our search faster and easier which is very convenient for us.

 

Factors Affecting Index Design

These are the factors affecting mainly the design of search engine architecture:

  • Merge Factors- this describes how data (words, subject features) are added to the index during the text corpus traversal. It also determines if the multiple indexers can work in a not synchronized manner. First, the indexer will check if it is just updating an old content or adding a new one.

  • Storage Techniques- this describes as to how the information shall be stored, whether it will be filtered or compressed.

  • Index Size- this describes as to how big or small the needed storage from a computer to store the index.

  • Look up Speed- this describes as to how fast a word can be searched in the inverted index. The speed of looking for an entry in the structure of the data is compared with how rapid it can be removed or renewed.

  • Maintenance- this describes as to how the index will be maintained over a period of time

  • Fault tolerance-this describes as to how significant it is for the service to be dependable. The y deal with index corruption and determines bad data, hardware, schemes, composite partitioning and duplicating.