Web Robot Detection-Preprocessing Web Logfiles for Robot Detection

@inproceedings{Bomhardtl2006WebRD,
  title={Web Robot Detection-Preprocessing Web Logfiles for Robot Detection},
  author={Christian Bomhardtl and Wolfgang Gaul and Lars Schmidt-Thieme},
  year={2006}
}
Web usage mining has to face the problem that parts of the underlying logfiles are created by robots. While cooperative robots identify themselves and obey to the instructions of server owners not to access parts or all of the pages on the server, malignant robots may camouflage themselves and have to be detected by web robot scanning devices. We describe the methodology of robot detection and show that highly accurate tools can be applied to decide whether session data was generated by a robot… CONTINUE READING
Highly Cited
This paper has 57 citations. REVIEW CITATIONS
23 Citations
9 References
Similar Papers

Citations

Publications citing this paper.
Showing 1-10 of 23 extracted citations

57 Citations

0510'06'09'12'15'18
Citations per Year
Semantic Scholar estimates that this publication has 57 citations based on the available data.

See our FAQ for additional information.

References

Publications referenced by this paper.
Showing 1-9 of 9 references

Frequent generalized subsequences - a problem from webmining

  • W. GAUL, L. SCHMIDT-THIEME
  • 2000
1 Excerpt

In search of invariants for e-business workloads

  • W. MEIRA
  • 2000

Spambot beware detection. h t tp : //www. turns tep

  • G. MULLANE
  • 1998
2 Excerpts

A standard for robot exclusion

  • M. ipaopao.comfesweb. KOSTER
  • 1994
3 Excerpts

Characterizing and modeling robot workload on e - business sites

  • V. ALMEIDA, R. RIEDI, D. MENASCE, W. MEIRA, F. RIBEIRO, R. FON-SECA

Similar Papers

Loading similar papers…