I was a core contributor to multiple CACHE Challenge campaigns, where I designed and executed large-scale virtual screening workflows under realistic drug discovery constraints. I collected and curated bioactivity data from ChEMBL to build QSAR and IC50 prediction models, which were used to support target-specific prioritization. I ran docking and post-docking analyses and developed graph-based and transformer-based models to distinguish active compounds from decoys. These models were applied to ultra-large chemical libraries, including multi-billion compound spaces, to reduce false positives. Our integrated IC50 prediction and interaction-based triage significantly improved hit selection compared to docking alone in blinded CACHE benchmarks.
Contribution to Project (2)
I was a lead contributor to the HitTPT project, developing a graph transformer model to distinguish active compounds from decoys in structure-based virtual screening. I introduced a new interaction fingerprint that explicitly encodes protein–ligand contacts, geometric relationships, and chemical context directly from docked complexes. This interaction representation was used across multiple stages of the virtual screening pipeline, including post-docking triage and hit re-ranking. I curated and prepared interaction-level datasets across multiple targets to train and validate the model. The resulting approach consistently achieved top performance, significantly reducing false positives compared to docking and standard ML methods.
Contribution to Project (3)
I contributed to Project by developing a deep learning graph transformer model for binding free energy prediction from protein–ligand complexes. I designed graph representations that capture atomic interactions, geometry, and chemical context from 3D structures. The model was trained and evaluated on curated benchmark datasets of protein–ligand complexes. I conducted systematic benchmarking against seven established methods, including both machine learning and physics-based approaches. The model demonstrated competitive or improved performance, highlighting the strengths of interaction-aware graph transformers for binding affinity estimation.
What I'm looking for
Sector(s) of Interest
IndustryAcademia
Role(s) of Interest
BioinformaticianPostdocScientistStaff Scientist
Research Fields & Keywords
ADMEAIai for scienceAi in pharmacy and biotechnologyai machine learningalgorithmsalphafoldartificial intelligenceBasic drug DiscoveryBioinformatics and Computational BiologyBioinformatics Software DevelopmentC++cheminformaticscomputational biologycomputational chemistryComputational Drug Discoverycomputational sciencescomputer-aided drug designdeep learningDeeping learning and protein structuremachine learningMolecular Dockingmolecular dynamicsmolecular modellingnatural language processingneural networkspostdoctoral research fellowprecision medicineprotein-protein interactionsPythonreinforcement learningRNA biochemistryRNA-protein interactionssmall molecule drug discoveryTeaching
Career Goals
Industry ResearchScience CommunicationsStart-up
Countries of Interest
GermanyUnited Arab EmiratesUnited Kingdom (UK)United States (US)