Hatice Gulcin Ozer

Learn More
—Throughput from sequencing instruments has been increasing in an unprecedented speed, leading to an explosion of the next-generation sequencing (NGS) data, and challenges in storing, managing, and analyzing these datasets. Parallelism is the key in handling large-scale data, and some progress has been made in parallelizing important steps, like sequence(More)
BACKGROUND Chronic lymphocytic leukemia (CLL) is the most common adult leukemia. It is a highly heterogeneous disease, and can be divided roughly into indolent and progressive stages based on classic clinical markers. Immunoglobin heavy chain variable region (IgVH) mutational status was found to be associated with patient survival outcome, and biomarkers(More)
SUMOylation of transcription factors and chromatin proteins is in many cases a negative mark that recruits factors that repress gene expression. In this study, we determined the occupancy of Small Ubiquitin-like MOdifier (SUMO)-1 on chromatin in HeLa cells by use of chromatin affinity purification coupled with next-generation sequencing. We found SUMO-1(More)
Next Generation Sequencing is highly resource intensive. NGS Tasks related to data processing, management and analysis require high-end computing servers or even clusters. Additionally, processing NGS experiments requires suitable storage space and significant manual interaction. At The Ohio State University's Biomedical Informatics Shared Resource, we(More)
Differential expression detection for RNA-seq experiments is often biased by normalization algorithms due to their sensitivity to parametric assumptions on the gene count distributions, extreme values of gene expression, gene length and total number of sequence reads. To overcome limitations of current methodologies, we developed Differential Feature Index(More)
Many biologically significant processes, such as cell differentiation and cell cycle progression, gene transcription and DNA replication, chromosome stability and epigenetic silencing etc. depend on the crucial interactions between cellular proteins and DNA. Chromatin immunoprecipitation (ChIP) is an important experimental technique for studying(More)
New high-throughput sequencing technologies can generate millions of short sequences in a single experiment. As the size of the data increases, comparison of multiple experiments on different cell lines under different experimental conditions becomes a big challenge. In this paper, we investigate ways to compare multiple ChIP-sequencing experiments. We(More)