Friday, March 28, 2014
Week11——Muddiest Points
Unit10 Web Information Retrieval
1. About inverted indexes for web search engines: for the web search engines, they will be very large inverted indexes, how to storage and storage space are serious problems.
2. About web crawler: what is the exact processing of parsing the pages? How should we deal with the web crawl challenges, such as the hardware problems?
1. About inverted indexes for web search engines: for the web search engines, they will be very large inverted indexes, how to storage and storage space are serious problems.
2. About web crawler: what is the exact processing of parsing the pages? How should we deal with the web crawl challenges, such as the hardware problems?
Friday, March 21, 2014
Week10——Reading notes(Unit 10)
Chapter 19 Web search basics
1. Web characteristics: web page authors created content in dozens of languages and thousands of dialects, thus demanding many different forms of stemming and other linguistic operations.
The web graph: static web consisting of static HTML pages together with the hyperlinks between them as a directed graph in which each web page is a node and each hyperlink a directed edge.
Spam: the manipulation of web page content for the purpose of appearing high up in search results for selected keywords.
2. Advertising as the economic model: convey to the viewer a positive feeling about the brand of the company placing the advertisement. (sponsored search or search advertising)
3. Search user experience:
1. Web characteristics: web page authors created content in dozens of languages and thousands of dialects, thus demanding many different forms of stemming and other linguistic operations.
The web graph: static web consisting of static HTML pages together with the hyperlinks between them as a directed graph in which each web page is a node and each hyperlink a directed edge.
Spam: the manipulation of web page content for the purpose of appearing high up in search results for selected keywords.
2. Advertising as the economic model: convey to the viewer a positive feeling about the brand of the company placing the advertisement. (sponsored search or search advertising)
3. Search user experience:
Subscribe to:
Comments (Atom)