Webzeitgeist: design mining the web

@article{Kumar2013WebzeitgeistDM,
  title={Webzeitgeist: design mining the web},
  author={Ranjitha Kumar and Arvind Satyanarayan and C{\'e}sar Torres and Maxine Lim and Salman Ahmad and Scott R. Klemmer and Jerry O. Talton},
  journal={Proceedings of the SIGCHI Conference on Human Factors in Computing Systems},
  year={2013}
}
Advances in data mining and knowledge discovery have transformed the way Web sites are designed. However, while visual presentation is an intrinsic part of the Web, traditional data mining techniques ignore render-time page structures and their attributes. This paper introduces design mining for the Web: using knowledge discovery techniques to understand design demographics, automate design curation, and support data-driven design tools. This idea is manifest in Webzeitgeist, a platform for… 
Mining Visual Evolution in 21 Years of Web Design
TLDR
This paper presents a curated dataset of 21 years of web design, scraped from the Internet Archive, and demonstrates how the data can be modeled with deep neural networks to enable novel design applications, such as predicting the apparent year of a web design.
Mining Web pages for obtaining design examples
TLDR
This work proposes an approach that populates a database with design examples, specifically layout examples, that crawls the Internet to retrieve Web pages, which are then mined by the frequent subtree mining algorithm.
Web Intelligence Linked Open Data for Website Design Reuse
TLDR
This paper proposes extraction of website-relevant data from online global services considered as linked open data sources, using specially developed web intelligence data miner, and performs pilot feature engineering for finding similar solutions within Domain, Task, and User UI models supplemented by Quality aspects.
Automaticly Generating Web Page From A
TLDR
A method to automate the transforming of the mockup to the web page, and a bottom-up tag generating method based on the Random Forest is proposed to select the tags for elements to meet the basic requirements of the developers.
Searching the Visual Style and Structure of D3 Visualizations
TLDR
A style and structure based search engine for D3 visualizations that allows queries based on their visual style and underlying structure and is found to be significantly more useful and satisfying for finding different designs of D3 charts, than a baseline search engine that only allows keyword search over the webpage containing a chart.
Ply: A Visual Web Inspector for Learning from Professional Webpages
TLDR
Ply is presented, a CSS inspection tool that helps novices use their visual intuition to make sense of professional webpages, and a new visual relevance testing technique to identify properties that have visual effects on the page is introduced.
Rico: A Mobile App Dataset for Building Data-Driven Design Applications
TLDR
Rico is presented, the largest repository of mobile app designs to date, created to support five classes of data-driven applications: design search, UI layout generation, UI code generation, user interaction modeling, and user perception prediction.
Component-based Engineering of Web User Interface Designs for Evolutionary Optimization
  • Maxim Bakaev, V. Khvorostov
  • Computer Science
    2018 19th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD)
  • 2018
TLDR
This paper designate the structure of the components and justify the employment of Drupal as the organizational framework, and specifies the new solutions generation algorithm, per the three WUI dimensions: functionality, layout and visual appearance.
Interactive Exploration of Large-Scale UI Datasets with Design Maps
TLDR
Overall, designers find Design Maps supporting their creativity and indicate that the maps producing consistent whitespacing within cloud points are the most informative ones.
Automaticly Generating Web Page From A Mockup
TLDR
This paper proposes a method to automate the transforming of the mockup to the web page, and extracts the elements based on the color features of the edges using the Random Forest to select the tags for elements.
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 45 REFERENCES
Editorial: special issue on web content mining
TLDR
This special issue focuses on Web content mining, which consists of Web usage mining, Web structure mining, and Web contentmining, which aims to extract/mine useful information or knowledge from Web page contents.
Web usage mining: discovery and applications of usage patterns from Web data
TLDR
A detailed taxonomy of the work in this area, including research efforts as well as commercial offerings is provided, and a brief overview of the WebSIFT system as an example of a prototypical Web usage mining system is given.
Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data
  • B. Liu
  • Computer Science
    Data-Centric Systems and Applications
  • 2007
TLDR
Liu Liu has written a comprehensive text on Web mining, which consists of two parts, where all the essential concepts and algorithms of data mining and machine learning are presented.
Web mining research: a survey
TLDR
This paper surveys the research in the area of Web mining, point out some confusions regarded the usage of the term Web mining and suggest three Web mining categories, which are then situate some of the research with respect to these three categories.
Bricolage: example-based retargeting for web design
TLDR
It is shown that Bricolage can learn to accurately reproduce human page mappings, and that it provides a general, efficient, and automatic technique for retargeting content between a variety of real Web pages.
d.tour: style-based exploration of design example galleries
TLDR
Exper exploratory techniques for finding relevant and inspiring design examples are introduced, including searching by stylistic similarity to a known example design and searching by style-based keyword.
Creating Permanent Test Collections of Web Pages for Information Extraction Research
TLDR
The problem of creating static representations of web pages in order to build sharable ground truth test sets is covered and the solution: WebPageDump, a Firefox extension capable of saving web pages exactly as they are rendered online is introduced.
WebBase: a repository of Web pages
RESTful Web Services
TLDR
This book shows how you can connect to the programmable web with the technologies you already use every day and harness the power of the Web for programmable applications: you just have to work with the Web instead of against it.
Changing how people view changes on the web
TLDR
DiffIE, a browser plug-in that makes content change explicit in a simple and lightweight manner, is presented and it is found that much of its benefit came not from exposing expected change, but rather from drawing attention to unexpected change and helping people build a richer understanding of the Web content they frequent.
...
1
2
3
4
5
...