foundry

mirror of https://github.com/RosettaCommons/foundry.git synced 2026-06-04 13:24:22 +08:00

Go to file

Ryan McHugh 1fd848a861 feat: expose p_skip of PadDNA to the pipeline so it can be changed in config (#95 )

Co-authored-by: Ryan McHugh <rpmchugh@localhost>

2025-03-27 11:28:17 -07:00

CI automated testing

2024-05-28 01:36:55 +00:00

rf2aa

feat: expose p_skip of PadDNA to the pipeline so it can be changed in config (#95 )

2025-03-27 11:28:17 -07:00

__init__.py

removing reliance on chid2L

2023-02-28 09:34:09 -08:00

.env

refactor: delete data, add makefile, environment yaml, apptainer, env

2025-02-04 21:07:15 -08:00

.gitignore

Recurate data

2024-04-27 16:34:10 +00:00

.gitlab-ci.yml

CI automated testing

2024-05-28 01:36:55 +00:00

apptainer.spec

refactor: delete data, add makefile, environment yaml, apptainer, env

2025-02-04 21:07:15 -08:00

environment.yaml

refactor: delete data, add makefile, environment yaml, apptainer, env

2025-02-04 21:07:15 -08:00

Makefile

refactor: delete data, add makefile, environment yaml, apptainer, env

2025-02-04 21:07:15 -08:00

pyproject.toml

chore: ruff

2025-02-04 21:44:04 -08:00

README.md

fix typo

2024-05-15 19:52:41 +00:00

README.md

RoseTTAFold All-Atom

This repository contains the code to training and running inference on RoseTTAFold All-Atom (RFAA), a neural network that can predict the structures of proteins in complex with DNA, RNA, and/or small molecule ligands.

rf2aa/ contains the model and training code. data/ contains code used to curate the training data from the PDB.

Contributing to RFAA

Set Up

git clone https://git.ipd.uw.edu/jue/RF2-allatom.git
cd RF2-allatom

If you are on digs, the S3nv.sif apptainer has all the relevant packages. To get started coding:

export PYTHONPATH="../RF2-allatom"

First, run the test suite:

apptainer exec --nv /software/containers/versions/SE3nv/SE3nv-20240415.sif pytest tests/

If all the tests pass, you have a stable version of the code.

Running model training

We use a package called hydra to configure different training runs of the model. Config files for different training runs can be found in rf2aa/config/train. The base trainable version is rf2aa/config/train/rf2aa.yaml, to run training with this version, run:

/software/containers/versions/SE3nv/SE3nv-20240415.sif trainer_new.py --config-name rf2aa

These tests are most often run on a4000s on digs. If you have a separate installation of cifutils in your home directory, this can potentially break the tests.

If you make changes in the code, they should NOT break backwards compatibility, e.g. there should be a flag in the yaml files that would make it as if your changes were never committed.

Contributing to model code

Generally, we follow software engineering practices of:

Not duplicating functionality that is already in the code
Keeping functions as short as possible, and splitting complicated functions into multiple functions
Using object oriented programming, which means subclassing already existing classes when possible.
Writing tests for our code and sending small functional PRs for review.
Maintaining code stability and not breaking backwards compatibility for users using the package.

To write new blocks in RF, you can go to the rf2aa/model directory and add the new block into the simulator_blocks.py file (and be sure to add a relevant name in the blocks_factory dictionary). These names can be referenced in hydra configs: see rf2aa.yaml for an example with any keyword arguments necessary to initialize the block.