Commit Graph

31 Commits

Author SHA1 Message Date
Simon Mathis
efa7aae8c2 remove .DS_Store files (#167) 2026-01-15 14:10:44 -08:00
jbutch
a9196592ca Fix installation 2025-12-04 19:25:17 -08:00
jbutch
3aa634158d Minor updates to readme 2025-12-03 17:12:19 -08:00
Jasper Butcher
94bb987998 Refactor/checkpoint defaults (#723)
* Registry rework

* Update README 12

* Mc

* Add all option correction
2025-12-03 00:34:45 -08:00
Nathaniel Corley
f3c01c2a2d refactor: rf3 and rfd3 lab training (#709)
* Add current_epoch arg to validation_batch_end, minor corrections to rfd3 validation

* fix: callbacks, readmes

* chore: rf3 checkpoints, quiet mode

* fix: fabric trainer model imports

* chore: commit .env

---------

Co-authored-by: jbutch <jbutch@uw.edu>
2025-12-01 18:14:07 -08:00
jbutch
d603dee7c4 Make format mpnn & with unsafe fixes 2025-11-29 23:39:25 -08:00
Jasper Butcher
7c1a77ff0c Add enzyme example, refactor io (#688)
* Add enzyme example, refactor io

* udpate
2025-11-25 16:45:25 -08:00
Nathaniel Corley
2ee287a1d5 feat: script to predict and score (#635)
* feat: enable jupyter notebooks; cleanup

* fix: apptainer

* fix: more apptainer shenanigans

* feat: add script to predict AND score structures in-memory
2025-11-10 09:03:30 -08:00
Nathaniel Corley
d10cc62c69 test: additional regression tests (#581)
* docs: README warning update

Added notices regarding bug fixes and upcoming changes.

* fix: inference for RF3

* chore: reomve submodules

* feat: update ligand templating, bump atomworks version

* Fix iptm bug, add min ipae and ipde calc and ruff formatting

* refactor: rf3 lab

* wip: save before letting Claude rip

* fix: working rf3

* chore: apptainers, ipd, rf3 runnable

* fix: README, environment, conftest

* add laproteina training and inference with new rf3 structure

* fix: updates to fix RF3

* feat: updates from proteina pipeline, squashfs manager

* fix: updates to support latest Run10 RF3

* fix: element type bug, utils, conformers

* update: ckpt path

* fix: add symlinked apptainer

* fix: MSAs for inference

* fix: apptainers, submodule bump for atomworks

* fix: apptainers

* Wip/attention improvements (#554)

* feat: inference sharding; folders

* refactor: attention module reorganization and cleanup

* fix: regression tests

* feature: RF3 inference with atom arrays

* refactor: move datasets

* tests: additional regression tests

---------

Co-authored-by: Kieran Didi <kieran.didi@gmail.com>
Co-authored-by: Kieran Didi <kdidi@nvidia.com>
2025-10-24 14:57:12 -07:00
ncorley
2abd193d47 fix: apptainers, submodule bump for atomworks 2025-10-08 13:57:07 -07:00
ncorley
423ff10a89 fix: element type bug, utils, conformers 2025-10-07 16:22:28 -07:00
ncorley
bcd3c527cd fix: updates to support latest Run10 RF3 2025-10-07 13:49:42 -07:00
Kieran Didi
77c995cc11 add laproteina training and inference with new rf3 structure 2025-10-02 17:47:14 -07:00
ncorley
6891de148b chore: apptainers, ipd, rf3 runnable 2025-10-02 13:31:34 -07:00
ncorley
e7d26d294a wip: for release 2025-08-10 09:16:08 -07:00
Nathaniel Corley
26ed40fb07 feat: freeze weights, update resolvers (#199)
* feat: freeze weights, update resolvers

* chore: ruff
2025-05-12 14:15:15 -07:00
Nathaniel Corley
03a6dbb2b0 feat: RFScore arbitrary templating project (#179)
feat: updates to arbitrary templating, model loading support
2025-05-01 11:56:57 -07:00
Nathaniel Corley
5eb870cac2 feat: submodules (#117)
* wip: chirals

* feat: updated chiral metrics, comparative distogram metrics

* feat: submodules

* fix: update code for submodules, misc chirals updates
2025-04-11 13:48:48 -07:00
ncorley
51308df97f Revert "Merge main onto aa_design"
This reverts commit 3d3192ee5e.
2025-04-08 21:50:41 -07:00
jbutch
3d3192ee5e Merge main onto aa_design 2025-04-08 19:00:26 -07:00
Nathaniel Corley
22e251716f refactor/remove old files (#110)
* refactor: delete files

* fix: add back files that included imported functions in chiral code

* chore: add back files to archive, update pyproject.yaml and gitignore
2025-04-08 15:09:38 -07:00
Nathaniel Corley
5a492032d5 refactor: new modelhub (#109)
* Initial commit of chiral changes

Initial checkin of chiral feature code

Add chiral metric

* Update the way chiral features are incorporated into the model

Move initialization to new func

use default pytorch reset parameters

fix initialization for chirals

config

rename argument of confidence head

fix initialization for chirals

* refactor: src nest, rename rf2aa to modelhub

* refactor: initial commit without projects

* Initial commit of chiral changes

* Initial checkin of chiral feature code

* Add chiral metric

* Remove option for double residual connection.  Add kq_norm oiptions to base (20250125) config.

* Restoring flag

* config

* rename argument of confidence head

* Update the way chiral features are incorporated into the model

* config

* rename argument of confidence head

* Update the way chiral features are incorporated into the model

* Initial commit of chiral changes

Initial checkin of chiral feature code

Add chiral metric

* Update the way chiral features are incorporated into the model

Move initialization to new func

use default pytorch reset parameters

fix initialization for chirals

config

rename argument of confidence head

fix initialization for chirals

* refactor: new modelhub

---------

Co-authored-by: fdimaio <dimaio@uw.edu>
Co-authored-by: HaotianZhangAI4Science <haotianzhang@zju.edu.cn>
2025-04-08 13:33:17 -07:00
Pascal Sturmfels
9b45c3910a Recurate data 2024-04-27 16:34:10 +00:00
Nate Corley
bb06653ab5 Fix missing random module; Implement logging to W&B with... 2024-03-26 22:16:05 +00:00
Frank DiMaio
237593624f Fixes (from Nate Corley) for AMP and flash attention 2024-03-12 19:10:08 +00:00
Rohith Krishna
faa99bedf9 added ability to regression test datasets 2024-01-31 01:39:46 +00:00
Rohith Krishna
fd53fffe50 Refactor network 2024-01-30 03:16:52 +00:00
Pascal Sturmfels
2b0403e841 Changes to negative item loading 2023-04-14 20:47:45 +00:00
Rohith Krishna
0aee8e0f6a Rebased Frank's branch wrt main 2023-02-28 21:19:08 +00:00
Rohith Krishna
9ee11f19a0 removing reliance on chid2L
changes

dry_run.py which runs through examples to check for loader errors
fixed conflcits

changed dataloaders to use assembly with num_chain=1
fixed fape and lddt

assembly loader for trainer
conflcit

merge conflcits

added covalent capabilities to the dataloader

added in modified residues to training

tweaked sm compl assembly cropping to be more permissive with keeping protein stubs

bug fix to reindex_protein_features_after_atomize, added FD3 phase 3 curation code, training now starts on digs w/o errors

training tweaks

limit number of ligands & protein chains

added additional curation scripts

added column to sm compl dataset csvs with LIGATOMS_RESOLVED

added code to get_train_valid_set to filter out partially resolved ligands

added a boolean flag `diffusion_training` to get_train_valid_set to use different loading logic and datapickles in diffusion training

refactored 'find_residues_to_atomize_covale'

fixed bug with truncating lig/prot partners at a set value, needs to be done before adding atomized residues

removed DATAPKL_DIFFUSION from params variable in data_loader.py because this is being handled upstream in diffusion training scripts

fixed bug in find_residues_to_atomize_covale

minor changes

fixed cropping of partially masked or partially cropped ligands

added argument `use_partial_ligands` to `crop_sm_compl_assembly` to control whether partially masked ligands are included in crop

added p_modres

added in p_atomize_modres into dstilled and validation sets

changed numpy arrays to tensors in get_assembly_msas to avoid type errors

moved p_atomize_modres into data loader params / arguments.py, apply it only in loader_sm_compl_assembly

limit total templates to MAXTPLT after combining templates across different protein chains

minor bug fix in loader_rna

allatom lddt calculation can be "striped" to save GPU memory

bug fix to striped lddt calculation

bug fix lddt

only compute lj between close residues to save GPU mem

bug fix atomize residue

moved try up

bug fix

added checkpointing on call to structure refinement layer

added MAXMASKEDLIGATOMS parameter and code to remove masked ligand atoms if they exceed a certain number within a given ligand

modified assembly croppping to only choose query atoms that are not masked

added new multi-MSA loading method

fixed small bug with removing masked atoms from ligands

minor refactor to handling of chiral index offsets in featurize_single_chain

bug fixes

bug fixes

training tweaks

bug fix

expand_multi_msa bug fix

tweak calc_BB_bond_geom to work with all-atom codebase

minor bug fix

bug fix

training changes

added renamed load_multi_msa() to load_minimal_multi_msa() and added load_multi_msa() to take some code out of loader_sm_compl_assembly()

recurated data to get rid of some buggy examples

tweaked .gitignore

training tweaks

consolidated sm_compl csvs into same file, labeled different datasets in the SUBSET column, updated get_train_valid_sets() to parse this

tweak to get_train_valid_set

Adds negative datasets to data_loader.py

Also some .gitignore improvements, __init__.py files for importing from
non-relative paths.

Added Binder Network to RF model

Added binder loss calculation, weighting

Added dataloading of negatives, new main

The new main script can submit things via submitit, and makes
it slightly more convenient to submit running jobs over multiple
gpus and nodes.

Bug fixes

Fixes to same chain calculations, cropping, arguments, and realignmenT

added smiles loading

Bug fixes, dude validation, intra chain FAPE fixes

minor fixes

changed multi-MSA pairing algorithm.

instead of adding back "filler" sequences, unpaired sequences are stacked on the bottom of the MSA during pairing. the "overlap" region between the 2 input MSAs is kept track of, so that this region is included in any unpaired sequences. this method retains more pairing information than the previous method.

better comments for multi-MSA stuff

training fixes

bug fix

tweaks

tweak

bug fixes to multi MSAs

fixed frame bug

tweak DistributedWeightedSampler to correct rounding error on num examples in last dataset, which can lead to DDP hangs at end of epoch

minor bug fix

minor tweak

bug fix to sampling negative ligand examples

updated datasets to exclude covalent bonds to H

updated curation to remove covalent bonds to H's

curation tweaks

curation tweaks

removed all bad examples at curation time, got rid of filters in get_train_valid_set

minor bug fix after merge
2023-02-28 09:34:09 -08:00
Jue Wang
9098f12b03 moved from RF2_allatom to base folder 2022-09-22 18:09:14 -07:00