Sachin Gupta
Enhancement in Web Crawler using Weighted Page Rank Algorithm based on VOL: Extended Architecture of Web Crawler Sachin Gupta

Name: Enhancement in Web Crawler using Weighted Page Rank Algorithm based on VOL: Extended Architecture of Web Crawler
Price: 46.49 USD
Availability: OutOfStock
Author: Sachin Gupta

Precio

$ 46,49

sin IVA

Pedido desde almacén remoto

Entrega prevista 25 de jun. - 8 de jul.

Nuestros clientes opinan:

Top-vurdering på Google Reviews, baseret på tusinder af anmeldelser.

Política de devolución de 14 días conforme a la legislación europea de protección de los consumidores

Mejor valorado en Trustpilot

Añadir a tu lista de deseos de iMusic

Enhancement in Web Crawler using Weighted Page Rank Algorithm based on VOL: Extended Architecture of Web Crawler

Sachin Gupta

Master's Thesis from the year 2014 in the subject Computer Science - Miscellaneous, , course: M. Tech, language: English, comment: Excellent , abstract: As the World Wide Web is growing rapidly day by day, the number of web pages is increasing into millions and trillions around the world. To make searching much easier for users, search engines came into existence. Web search engines are used to find specific information on the WWW. Without search engines, it would be almost impossible for us to locate anything on the Web unless or until we know a specific URL address. Every search engine maintains a central repository or databases of HTML documents in indexed form. Whenever a user query comes, searching is performed within that database of indexed web pages. The size of repository of every search engine can't accommodate each and every page available on the WWW. So it is desired that only the most relevant and important pages are stored in the database to increase the efficiency of search engines. This database of HTML documents is maintained by special software called "Crawler". A Crawler is software that traverses the web and downloads web pages. Broad search engines as well as many more specialized search tools rely on web crawlers to acquire large collections of pages for indexing and analysis. Since the Web is a distributed, dynamic and rapidly growing information resource, a crawler cannot download all pages. It is almost impossible for crawlers to crawl the whole web pages from World Wide Web. Crawlers crawls only fraction of web pages from World Wide Web. So a crawler should observe that the fraction of pages crawled must be most relevant and the most important ones, not just random pages. In our Work, we propose an extended architecture of web crawler of search engine, to crawl only relevant and important pages from WWW, which will lead to reduced sever overheads. With our proposed architecture we will also be optimizing the crawled data by removing leas

98 pages

Medios de comunicación	Libros Paperback Book (Libro con tapa blanda y lomo encolado)
Publicado	25 de julio de 2014
ISBN13	9783656700043
Editores	Grin Verlag
Páginas	98
Dimensiones	148 × 210 × 6 mm · 157 g
Lengua	Alemán