The Unified Logging Infrastructure for Data Analytics at Twitter

  title={The Unified Logging Infrastructure for Data Analytics at Twitter},
  author={George Lee and Jimmy J. Lin and Chuang Liu and Andrew Lorek and Dmitriy V. Ryaboy},
In recent years, there has been a substantial amount of work on large-scale data analytics using Hadoop-based platforms running on large clusters of commodity machines. A lessexplored topic is how those data, dominated by application logs, are collected and structured to begin with. In this paper, we present Twitter’s production logging infrastructure and its evolution from application-specific logging to a unified “client events” log format, where messages are captured in common, well… CONTINUE READING
Highly Cited
This paper has 66 citations. REVIEW CITATIONS

From This Paper

Figures, tables, and topics from this paper.
49 Citations
30 References
Similar Papers


Publications citing this paper.
Showing 1-10 of 49 extracted citations

66 Citations

Citations per Year
Semantic Scholar estimates that this publication has 66 citations based on the available data.

See our FAQ for additional information.


Publications referenced by this paper.
Showing 1-10 of 30 references

Similar Papers

Loading similar papers…