Identifying Informative Web Content Blocks using Web

@inproceedings{Gadge2014IdentifyingIW,
  title={Identifying Informative Web Content Blocks using Web},
  author={Jayant Gadge},
  year={2014}
}
Information Extraction has become an important task for discovering useful knowledge or information from the Web. A crawler system, which gathers the information from the Web, is one of the fundamental necessities of Information Extraction. A search engine uses a crawler to crawl and index web pages. Search engine takes into account only the informative content for indexing. In addition to informative content, web pages commonly have blocks that are not the main content blocks and are called… CONTINUE READING

From This Paper

Figures, tables, and topics from this paper.

Citations

Publications citing this paper.

Methods for Removing Noise from Web Pages : A Review

Maya John, Dr. Jayasudha
2016
View 2 Excerpts
Highly Influenced

References

Publications referenced by this paper.
Showing 1-10 of 19 references

Member, IEEE “Repetition-based Web Page Segmentation by Detecting Tag Patterns for Small- Screen Devices

Jinbeom Kang, Jaeyoung Yang, Nonmember, Joongmin Choi
IEEE Transactions on Consumer Electronics, • 2010
View 16 Excerpts
Highly Influenced

S.H.Patil, G.V.Garje,M.S.Bewoor, “Extracting Content Blocks from Web Pages

Manisha Marathe, Dr
REVIEW PAPER International Journal of Recent Trends in Engineering, • 2009
View 2 Excerpts

Similar Papers

Loading similar papers…