I was trained as a computational biologist with expertise in biology and computer science. At the University of California, Davis I was first exposed to 454-sequencing, to identify genetic variants, and the revolution of Next-generation Sequencing (NGS) technologies and large-scale data analysis. I quickly became aware of the amount of data that would be generated and the need for a robust and reproducible analysis pipelines. As I developed software and algorithms I also began to see the power of data integration for the purpose of elucidating biological mechanisms as well as the need of public data resources. Before re-entering bioinformatics, I spent two years as a software developer with Infosys. I developed and maintained a suite of large-scale customer relational management tools. This experience gave me insight into industry standards of software design and implantation as well as into data management. It also inspired me to apply these same standards to biological databases in research settings. Large quantitative datasets using global studies extend our knowledge of genes, their products and their interactions. By integrating quantitative datasets with curated, focused experimental and clinical data creates unique comprehensive databases. I have been involved in the design and implementation of databases, ENCODE (Encyclopedia of DNA Elements) and UCSC Genome Browser Projects, integrating scientific information into encyclopedic databases essential for investigation. While on these projects, I also implemented genomic analysis pipelines to facilitate reproducible data analysis in the Amazon Cloud (AWS). Using these approaches, I have been focused on the implementing, optimizing and distributing genomic analysis pipelines to facilitate reproducible data analysis.
In 2014, I moved to UT Southwestern Medical Center (UTSW) and took the opportunity of my position first as a computational biologist in the Green Center for Reproductive Biological Sciences and then in the Bioinformatics Core Facility to further understanding of the human genome by integration of large-scale functional and comparative genomics datasets in cancer. Specifically, in the Lonestar Oncology Network for Epigenetics Therapy and Research (LONESTAR) Consortium, I developed a multi-omics integration pipeline that identifies breast cancer subtype-specific transcription factors (TFs) bound at active enhancers that regulate gene expression patterns determining growth and clinical outcomes. I applied these approaches, in collaboration with Dr. Ping Mu (prostate cancer cell biology), to identify the enhancer landscape and key TFs driving prostate cancer resistance leading to new clinical targets. Currently, I am working on integrating multiple -omics assays to understand transcription factors driving gene regulatory networks in human cancers.
As the co-lead of the Data Analytics Core of the UT Southwestern Kidney Cancer SPORE (Dr. James Brugarolas), I developed the Kidney Cancer Explorer (KCE), facilitate hypothesis generation from clinical and genomic data. Using this framework for KCE, we aim at expanding this project to create a pan-cancer data commons (PC-DC), which will allow researchers to build patient cohorts based on clinical, pathological and genomic information, allowing researchers to identify molecular treads with clinical attributes or vice-versa.
- (2008), Biology
- Graduate School
- Johns Hopkins University (2011), Biology
- Chromatin structure and gene regulation
- Comprehensive Scientific Databases
- Data integration
- Reproducibility and Open Science
- Detecting signatures of inter-regional and inter-specific hybridization among the Chinese rhesus macaque specific pathogen-free (SPF) population using single nucleotide polymorphic (SNP) markers.
- Kanthaswamy S, Satkoski J, Kou A, Malladi V, Glenn Smith D, J. Med. Primatol. 2010 Aug 39 4 252-65
- Canine population data generated from a multiplex STR kit for use in forensic casework.
- Kanthaswamy S, Tom BK, Mattila AM, Johnston E, Dayton M, Kinaga J, Erickson BJ, Halverson J, Fantin D, DeNise S, Kou A, Malladi V, Satkoski J, Budowle B, Smith DG, Koskinen MT, J. Forensic Sci. 2009 Jul 54 4 829-40
- Development of a Chinese-Indian hybrid (Chindian) rhesus macaque colony at the California National Primate Research Center by introgression.
- Kanthaswamy S, Gill L, Satkoski J, Goyal V, Malladi V, Kou A, Basuta K, Sarkisyan L, George D, Smith DG, J. Med. Primatol. 2009 Apr 38 2 86-96
- Analysis of forensic SNPs in the canine mtDNA HV1 mutational hotspot region.
- Baute DT, Satkoski JA, Spear TF, Smith DG, Dayton MR, Malladi VS, Goyal V, Kou A, Kinaga JL, Kanthaswamy S, J. Forensic Sci. 2008 Nov 53 6 1325-33
- Pyrosequencing as a method for SNP identification in the rhesus macaque (Macaca mulatta).
- Satkoski JA, Malhi R, Kanthaswamy S, Tito R, Malladi V, Smith D, BMC Genomics 2008 May 9 256
- Forensic utility of the mitochondrial hypervariable region 1 of domestic dogs, in conjunction with breed and geographic information.
- Himmelberger AL, Spear TF, Satkoski JA, George DA, Garnica WT, Malladi VS, Smith DG, Webb KM, Allard MW, Kanthaswamy S, J. Forensic Sci. 2008 Jan 53 1 81-9