Satyabrata Sahoo

Learn More
We present an expression measure of a gene, devised to predict the level of gene expression from relative codon bias (RCB). There are a number of measures currently in use that quantify codon usage in genes. Based on the hypothesis that gene expressivity and codon composition is strongly correlated, RCB has been defined to provide an intuitively meaningful(More)
Based on the hypothesis that highly expressed genes are often characterized by strong compositional bias in terms of codon usage, there are a number of measures currently in use that quantify codon usage bias in genes, and hence provide numerical indices to predict the expression levels of genes. With the recent advent of expression measure from the score(More)
Features unique to a transfer-RNA are recognized by the corresponding tRNA-synthetase. Keeping this in view we isolate the discriminating features of all archaeal tRNA. These are our identity elements. Further, we investigate tRNA-characteristics that delineate the different orders of Archaea.
The expression of functional proteins plays a crucial role in modern biotechnology. The free-living cynobacterium Synechocystis PCC 6803 is an interesting model organism to study oxygenic photosynthesis as well as other metabolic processes. Here we analyze a gene expression profiling methodology, RCBS (the scores of relative codon usage bias) to elucidate(More)
Our studies on the bases of codons from 11 completely sequenced archaeal genomes show that, as we move from GC-rich to AT-rich protein-coding gene-containing species, the differences between G and C and between A and T, the purine load (AG content), and also the overall persistence (i.e. the tendency of a base to be followed by the same base) within codons,(More)
Transfer-RNA genes in archaea often have introns intervening between exon sequences. The structural motif at the boundary between exon and intron is the bulge-helix-bulge. Computational investigations of these boundary structures in Haloarcula marismortui lead us to propose that tRNA-isoleucine and tRNA-tryptophan genes are co-located. Precise in silico(More)
This article deals with the relationship between vocabulary (total number of distinct oligomers or “words”) and text-length (total number of oligomers or “words”) for a coding DNA sequence (CDS). For natural human languages, Heaps established a mathematical formula known as Heaps' law, which relates vocabulary to text-length. Our analysis shows that Heaps'(More)
We analyse forty-seven chloroplast genes of the large subunit of RuBisCO, from the algal order Ectocarpales, sourced from GenBank. Codon-usage weighted by the nucleotide base-bias defines our score called the codon-impact-parameter. This score is used to obtain phylogenetic relations amongst the 47 Ectocarpales. We compare our classification with the ones(More)
Research into new methods for identifying highly expressed genes in anonymous genome sequences has been going on for more than 15 years. We presented here an alternative approach based on modified score of relative codon usage bias to identify highly expressed genes in crenarchaeal genomes. The proposed algorithm relies exclusively on sequence features for(More)
The enzyme Glyceraldehyde-3-Phosphate Dehydrogenase (GAPDH) catalyses the decomposition of glucose. The gene that produces the GAPDH is therefore present in a wide class of organisms. We show that for this gene the average value of the fluctuations in nucleotide distribution in the codons, normalized to strand bias, provides a reasonable measure of how the(More)