PapayaDB

PapayaDB

Repository of cathepsin-glycosaminoglycan in silico interactions

A curated computational resource for exploring structural models, simulation-derived descriptors and interaction patterns of cathepsin–glycosaminoglycan complexes.

v0.1 In development · Public record upload ongoing
Try searching for: Cathepsin K Heparin CS4 MM-GBSA

What is PapayaDB?

A molecular-dynamics derived PapayaDB organizes cathepsin–GAG systems into approach-specific records linking proteins, glycosaminoglycans, chain lengths, simulation protocols and downloadable analysis outputs. The database is designed to support comparative interpretation of GAG recognition across the human cysteine cathepsin family.

Computational structural biology model Cathepsin K - CS4 dp6

Database coverage

11
Human cysteine cathepsin proteases
6
Glycosaminoglycan classes
300+
Generated complexes
5000+
Generated simulations

Dataset availability notice

Coverage numbers refer to generated PapayaDB simulation datasets. Public record upload and validation are ongoing, and the complete dataset is expected to be available by autumn 2026.

Three complementary simulation datasets

PapayaDB combines multiscale simulation approaches to capture interaction details and conformational behaviour of cathepsin–GAG complexes.

dp2 · dp4 · dp6

All-atom MD

High-resolution simulations of short cathepsin–GAG oligosaccharide complexes, providing detailed structural and interaction descriptors.

dp16

RS-REMD

Enhanced-sampling simulations designed to explore broader conformational landscapes of longer GAG chains for each cathepsin–GAG pair.

heparin dp6

Coarse-grained MD

Extended-timescale simulations of cathepsin–heparin systems using a SUGRES-compatible heparin representation.

What is inside a PapayaDB record?

PapayaDB records are organized at complex, cluster and representative simulation levels.

1 Complex

Cathepsin, GAG class, chain length and dataset type.

2 Clusters

Docking-derived binding-pose groups describing alternative interaction regions.

3 Representatives

Selected structures used for molecular dynamics and downstream analysis.

4 Descriptors

RMSD, MM-GBSA, LIE, hydrogen bonds, contact maps, plots and CSV files.

From structures to interpretable records

The all-atom dataset follows a multi-stage computational workflow combining electrostatic potential analysis, molecular docking, clustering, representative pose selection and molecular dynamics simulations.

structures → binding region prediction → docking → clustering → representative selection → molecular dynamics → descriptors → database records

Explore PapayaDB

Cathepsins

Protein targets, structural templates, electrostatic potential isosurfaces and available complexes.

Browse Cathepsins

GAGs

Glycosaminoglycan classes, chain lengths and dataset coverage.

Browse GAGs

Complex records

Complex-level records, clusters, representatives and simulation-derived outputs.

Explore Complexes

Development status

PapayaDB is currently under development. The associated manuscript is planned to be submitted in October 2026. For citation, collaboration or early-use questions, contact contact@papayadb.org.