Career Profile
Senior research data expert and bioinformatician with 18+ years of experience in open science, both as a computational biologist and a service provider for scientific computing infrastructure. Passionate about accellerating research through open science practices and robust data management, with experience in software architecture and programming, large-scale computing, big data, leadership, and strategic planning. Strong communicator, able to translate the needs of researchers into technical solutions.
Competencies
Project Management
Project coordination, agile development, kanban, Jira, distributed teams
Top Programming Languages
Python, Java, Go, Javascript, Typescript, HTML, CSS, C++, Bash, R, SQL
Frameworks & Technologies
Numpy/Pandas, NodeJS, Angular 2, OpenAPI, LinkML, JSON Schema, JSON-LD, REST APIs, Jekyll, MongoDB, Jupyter
Devops
Docker, Kubernetes, Ansible, Helm, CI/CD
Scientific expertise
Electron microscopy, structural biology, bioinformatics, statistics, machine learning, AI, phylogenetics, semantic ontologies
Education
Advisor: Prof. Philip Bourne. Thesis: Structure-Preserving Rearrangements: Algorithms for Structural Comparison and Protein Analysis
Advisor: Prof. David Baker. Thesis: Efficient Modeling of Disulfide Bonds with Rosetta
Experiences
Advisor: Prof. Maria Anisimova. Focused on phylogenetic methods for tandem repeat proteins, including software development and benchmarking. Developed novel methods for the Tandem Repeat and applied these systematically to the Armadillo Repeat protein family. (Python, slurm, Apache spark)
Advisors: Prof. Philip Bourne (NIH) and Dr. Guido Capitani (PSI).
- Analysis of protein-protein interactions in crystal structures as part of the EPPIC, the Evolutionary Protein Protein Interface Classifier. (Java, Javascript, NGL, vis.js)
- Assessor for the Critical Assessment of Structure-based Prediction (CASP12) community-wide experiment. (Python, web development)
Advisor: Prof. Philip Bourne. Thesis: Structure-Preserving Rearrangements: Algorithms for Structural Comparison and Protein Analysis
- Structural comparison algorithms for proteins, including CE-Symm for detecting internal pseudosymmetry.
- Deployment of structural analysis tools on the RCSB Protein Data Bank, as well as large-scale systematic analyses. (Java, Open Science Cloud, BioJava core developer, Open Bioinformatics Foundation member)
Certifications
Publications
https://orcid.org/0000-0002-1200-1698
-
OSC-EM: an open community standard for cryo-EM experimental metadata
In preparation.
-
OpenEM: An open data repository for electron microscopy
In preparation.
-
Cryo-EM structure of a single-chain β1-adrenoceptor – AmpC β-lactamase fusion protein
J Struct Biol. (2026) Under review.
-
Structural Dynamics. (2023) 10, 034101. DOI: 10.1063/4.0000178
-
Bioarxiv. (2021) DOI: 10.1101/2021.09.25.461805
-
Frontiers in Bioinformatics. (2021) 1:691865. DOI: 10.3389/fbinf.2021.691865
-
PLoS Computational Biology. (2019) 15(4):e1006842. PMID: 31009453
-
PLOS Computational Biology. (2019) 15(2):e1006791. PMID: 30735498
-
PLOS Computational Biology. (2018) 14(4):e1006104. PMID: 29708963
-
Proteins: Structure, Function, and Bioinformatics. (2017) 29(11):618. PMID: 29071742
-
Proteins: Structure, Function, and Bioinformatics. (2017) 84(suppl 1):370. PMID: 28975675
-
Journal of Medicinal Chemistry. (2017) 60(7):2879–2889. PMID: 28326775
-
Structure. (2017) 25(2):341–352. PMID: 28111021. PDB: 5T89
-
BMC Systems Biology. (2015) 10:26. PMID: 26969117
-
Bioinformatics. (2015) 32(4):481–489. PMID: 26508758
-
Bioinformatics. (2015) 31(8):1316-1318. PMID: 25505094
-
BMC Structural Biology. (2014) 14:22. PMID: 25326082
-
Journal of Molecular Biology. (2013) 426(11):2255–2268. PMID: 24681267
-
Bioinformatics. (2012) 28(20):2693–2695. PMID: 22877863
-
PLoS Computational Biology. (2012) 8(3):e1002445. PMID: 22496628
-
Bioinformatics. (2010) 26(23):2983-2985. PMID: 20937596
-
OSC-EM: an open community standard for cryo-EM experimental metadataIn preparation.
-
OpenEM: An open data repository for electron microscopyIn preparation.
-
Cryo-EM structure of a single-chain β1-adrenoceptor – AmpC β-lactamase fusion proteinJ Struct Biol. (2026) Under review.
-
Structural Dynamics. (2023) 10, 034101. DOI: 10.1063/4.0000178
-
Bioarxiv. (2021) DOI: 10.1101/2021.09.25.461805
-
Frontiers in Bioinformatics. (2021) 1:691865. DOI: 10.3389/fbinf.2021.691865
-
PLoS Computational Biology. (2019) 15(4):e1006842. PMID: 31009453
-
PLOS Computational Biology. (2019) 15(2):e1006791. PMID: 30735498
-
PLOS Computational Biology. (2018) 14(4):e1006104. PMID: 29708963
-
Proteins: Structure, Function, and Bioinformatics. (2017) 29(11):618. PMID: 29071742
-
Proteins: Structure, Function, and Bioinformatics. (2017) 84(suppl 1):370. PMID: 28975675
-
Journal of Medicinal Chemistry. (2017) 60(7):2879–2889. PMID: 28326775
-
Structure. (2017) 25(2):341–352. PMID: 28111021. PDB: 5T89
-
BMC Systems Biology. (2015) 10:26. PMID: 26969117
-
Bioinformatics. (2015) 32(4):481–489. PMID: 26508758
-
Bioinformatics. (2015) 31(8):1316-1318. PMID: 25505094
-
BMC Structural Biology. (2014) 14:22. PMID: 25326082
-
Journal of Molecular Biology. (2013) 426(11):2255–2268. PMID: 24681267
-
Bioinformatics. (2012) 28(20):2693–2695. PMID: 22877863
-
PLoS Computational Biology. (2012) 8(3):e1002445. PMID: 22496628
-
Bioinformatics. (2010) 26(23):2983-2985. PMID: 20937596