» Internet » World Wide Web » Website » Search machine
A search machine is a program for the search of documents, which are stored like the World Wide Web e.g. in a computer or a computer network. After input of a search word a search machine supplies a list represented of references to possibly relevant documents, mostly with title and a short excerpt of the respective document. Different search methods application can find.
The substantial components and/or fields of a search machine are
Usually the data procurement takes place automatically, in the WWW via Webcrawler, on an individual computer via regular reading of all files in in listings specified by the user in the local file system.
Search machines can be categorized after a set of characteristics. The three following characteristics are orthogonal to each other. One can decide with the draft of a search machine thus for a possibility out each of the three groups of characteristics, independently of the other characteristics. The most usual and usually-used combination is a index-based (realization) Web search machine (data source) on HTML text documents (kind of the data), like it among other things from the three large search machine offerers Google, Yahoo! Search and MSN search are made available.
Different search machines can scan different kinds of data. First these can be divided roughly into "document types "like text, picture, clay/tone, video and other one. Result pages are arranged as a function of this kind. With a search for text documents a text fragment is usually indicated, which contains the search words. Picture search machines indicate a miniature opinion of the suitable pictures.
A further finer breakdown deals with dataspecific characteristics, which divide not all documents within a kind. If one remains with the example text, then can be searched with Usenet contributions for certain authors, with web pages in the HTML format for the document title.
Depending upon data kind a restriction is possible on a subset of all data of a kind as the further function. This is realized generally over additional search parameters, which exclude a part of the seized data. Alternatively a search machine can be limited to take up from the outset only suitable documents. Examples are for instance a search machine for Weblogs (instead of for the complete Web) or search machines, which process only documents of universities, or excluding documents from a certain country, in a certain language or a certain file format.
A further characteristic for categorization is the source, from which the data seized by the search machine originate. The name of the kind of search machine mostly already describes the source.
Web search machines seize documents from the World Wide Web, Usenet search machines of contributions from the discussion medium Usenet distributed world-wide. Intranet search machines are limited to the computers of the Intranets of a company. Desktopsuchmaschinen recently programs are called, which the local volume of data of an individual computer to make scanable.
If the data procurement is made manually by means of registration or by lectors, one speaks of a catalog or a listing. In such listings like the open the documents are hierarchically organized directory Project in a table of contents after topics.
This section describes differences in the realization of the enterprise of the search machine.
The representation of the search results happens sorted according to relevance (Ranking and/or search rank), for which each search machine consults its own, mostly criteria secretly held. In addition belong:
Some search machines sort search results not only according to relevance for the retrieval query, but permit against payment also influencing control on their expenditure. In the last years however a separation between search results and as "paid hits has itself "marked faded in advertisement interspersed with the large offerers, which is cut to the retrieval query.
We found here 67 articles.
We found here 6 related websites.
Index | Privacy | Terms Of Use | Sitemap | Feedback