Yixiang Fang

Learn More
Given a graph G and a vertex q ∈ G, the community search query returns a subgraph ofG that contains vertices related to q. Communities, which are prevalent in attributed graphs such as social networks and knowledge bases, can be used in emerging applications such as product advertisement and setting up of social events. In this paper, we investigate the(More)
Despite its popularity, SimRank is computationally costly, in both time and space. In particular, its recursive nature poses a great challenge in using modern distributed computing power, and also prevents querying similarities individually. Existing solutions suffer greatly from these practical issues. In this paper, we break such dependency for maximum(More)
Twitter is widely used all over the world, and a huge number of hot topics are generated by Twitter users in real time. These topics are able to reflect almost every aspect of people’s daily lives. Therefore, the detection of topics in Twitter can be used in many real applications, such as monitoring public opinion, hot product recommendation and incidence(More)
Both nonparametric and parametric approaches were used to construct SSDs for use in ecological risk assessments. Based on toxicity to representative aquatic species and typical water contaminants of metals and metalloids in China, nonparametric methods based on the bootstrap were statistically superior to the parametric curve-fitting approaches. Knowing(More)
There are many automatic methods that can extract lists of objects from the Web, but they often fail to handle multi-type pages automatically. This paper introduces a new method for record extraction using suffix tree which can find the repeated sub-string. Our method transfers a distinct group of tag paths appearing repeatedly in the DOM tree of the Web(More)
Communities are prevalent in social networks, knowledge graphs, and biological networks. Recently, the topic of community search (CS) has received plenty of attention. Given a query vertex, CS looks for a dense subgraph that contains it. Existing CS solutions do not consider the spatial extent of a community. They can yield communities whose locations of(More)
Trajectory data are abundant and prevalent in systems that monitor the locations of moving objects. In a vehicle location-based service, the positions of vehicles are continuously monitored through GPS; each vehicle is associated with a trajectory that describes its movement history. In species monitoring, animals are attached with sensors, whose positions(More)
Trajectory data are prevalent in systems that monitor the locations of moving objects. In a location-based service, for instance, the positions of vehicles are continuously monitored through GPS; the trajectory of each vehicle describes its movement history. We study joins on two sets of trajectories, generated by two sets M and R of moving objects. For(More)
Given a graph G and a set Q of query nodes, we examine the Steiner Maximum-Connected Subgraph (SMCS). The SMCS, or G's induced subgraph that contains Q with the largest connectivity, can be useful for customer prediction, product promotion, and team assembling. Despite its importance, the SMCS problem has only been recently studied. Existing solutions(More)
Contamination of the aquatic environment by EDCs has received considerable attention from scientists, government officials, and the public. E2, one of the EDCs with high estrogenic effect, has the potential to cause multiple endocrine-disrupting effects, even at small concentrations. In the present review, the toxicity of E2 to aquatic organisms was(More)