- Full text PDF available (13)
We present a detailed description of an algorithm tailored to detect external plagiarism in PAN-09 competition. The algorithm is divided into three steps: a first reduction of the size of the problem by a selection of ten suspicious plagiarists using a n-gram distance on properly recoded texts. A search for matches after T9-like recoding. A " joining… (More)
The complexity of human interactions with social and natural phenomena is mirrored in the way we describe our experiences through natural language. In order to retain and convey such a high dimensional information, the statistical properties of our linguistic output has to be highly correlated in time. An example are the robust observations, still largely… (More)
Hamiltonian systems with a mixed phase space typically exhibit an algebraic decay of correlations and of Poincaré recurrences, with numerical experiments over finite times showing system-dependent power-law exponents. We conjecture the existence of a universal asymptotic decay based on results for a Markov tree model with random scaling factors for the… (More)
We establish a deterministic technique to investigate transport moments of arbitrary order. The theory is applied to the analysis of different kinds of intermittent one-dimensional maps and the Lorentz gas with infinite horizon: the typical appearance of phase transitions in the spectrum of transport exponents is explained. Periodic orbit theory of strongly… (More)
We show that the nontwist phenomena previously observed in Hamiltonian systems exist also in time-reversible non-Hamiltonian systems. In particular, we study the two standard collision-reconnection scenarios and we compute the parameter space breakup diagram of the shearless torus. Besides the Hamiltonian routes, the breakup may occur due to the onset of… (More)
We perform a statistical study of the distances between successive occurrences of a given dinucleotide in the DNA sequence for a number of organisms of different complexity. Our analysis highlights peculiar features of the CG dinucleotide distribution in mammalian DNA, pointing towards a connection with the role of such dinucleotide in DNA methylation.… (More)
The entropy of an ergodic source is the limit of properly rescaled 1-block entropies of sources obtained applying successive non-sequential recursive pairs substitutions ,. In this paper we prove that the cross entropy and the Kullback-Leibler divergence can be obtained in a similar way.
We present a series of results on deterministic transport in chaotic system, obtained in the framework of periodic orbits theory. The emphasis is on intermittent systems, where deviations from complete chaos may induce anomalies on the asymptotic moments' growth.
We perform numerical measurements of the moments of the position of a tracer particle in a two-dimensional periodic billiard model (Lorentz gas) with infinite corridors. This model is known to exhibit a weak form of superdiffusion, in the sense that there is a logarithmic correction to the linear growth in time of the mean-squared displacement. We show… (More)
We consider the billiard dynamics in a striplike set that is tessellated by countably many translated copies of the same polygon. A random configuration of semidispersing scatterers is placed in each copy. The ensemble of dynamical systems thus defined, one for each global choice of scatterers, is called quenched random Lorentz tube. We prove that under… (More)