Commit Graph

126 Commits

Author SHA1 Message Date
Jennifer Wei
c715b13830 Merge remote-tracking branch 'refs/remotes/jnwei/pl_upgrades' into pl_upgrades 2024-05-06 08:42:37 +00:00
Jennifer
0a8ae6a72c Updates low_precision check to use current precision settings. 2024-05-06 08:32:04 +00:00
Jennifer
19c815888d adds reload_dataloaders_every_n_epochs flag 2024-05-06 08:30:34 +00:00
Jennifer
3eed6cb8be more logging changes 2024-05-06 08:29:30 +00:00
Jennifer
53cdb247ab add metric logging to progress bar. 2024-05-06 08:28:40 +00:00
Jennifer
7de0ab00db first pass changes to run with pl 2.1 2024-05-06 08:16:08 +00:00
Jennifer
142c5e65f1 Updates organization of command line flags for pl.Trainer 2024-05-02 03:44:03 -04:00
Jennifer
3a4ed7534d Removes OF copy of zero_to_fp32.py favoring deepspeed.util version 2024-05-02 03:22:55 -04:00
Jennifer
e4f9af23e7 Merge branch 'setup-improvements' of https://github.com/aqlaboratory/openfold into setup-improvements 2024-05-02 03:11:15 -04:00
Jennifer Wei
1952582615 changes required for pytorch2 2024-05-02 02:54:34 -04:00
Jennifer
226c7bac08 DNS: Local profiler changes 2024-05-02 02:54:34 -04:00
Jennifer
eba00d85f7 adds config recording to wandb 2024-05-02 02:54:06 -04:00
Jennifer
870e0db32b Adds experiment_config_json for setting custom configurations with a json. 2024-05-02 02:54:06 -04:00
Jennifer
67454c3982 bugfixes and adds a section to convert optim files 2024-05-02 02:52:04 -04:00
Jennifer Wei
60fbe53990 changes required for pytorch2 2024-05-02 02:36:13 -04:00
Jennifer
ec6058cd56 DNS: Local profiler changes 2024-05-02 02:36:12 -04:00
Jennifer
6b77062dd9 adds config recording to wandb 2024-05-02 02:36:12 -04:00
Jennifer
0f2b292913 Adds experiment_config_json for setting custom configurations with a json. 2024-05-02 02:36:12 -04:00
Jennifer
5bfad07412 Changes resume_model_weights_only flag in train_openfold to reload weights only, without parsing a time step. 2024-05-02 02:36:12 -04:00
Jennifer
a7216c73cf changes reloading weights to account for possible state_dict headers. 2024-05-02 02:36:12 -04:00
Jennifer
711ba0abe2 Add log statement to weight conversion script 2024-05-02 02:36:12 -04:00
Jennifer
dd1dd64137 bugfixes and adds a section to convert optim files 2024-05-02 02:36:12 -04:00
Jennifer
ea142a0a68 fixes deepspeed function definition. 2024-04-19 06:40:37 -04:00
Jennifer
1ae833bfc8 Updates low_precision check to use current precision settings. 2024-04-19 06:14:08 -04:00
Jennifer
bbf989a7bf psivant local pl2 upgrades without mpi configuration 2024-04-16 20:48:33 -04:00
Jennifer Wei
ce000c6061 changes required for pytorch2 2024-04-12 10:14:57 +00:00
Jennifer
523adaf448 adds reload_dataloaders_every_n_epochs flag 2024-04-11 02:57:49 -04:00
Jennifer
577219c112 Removes OF copy of zero_to_fp32.py favoring deepspeed.util version 2024-04-02 05:01:33 -04:00
Jennifer
862635834a add paren to save_hyperparameters 2024-04-02 02:49:16 -04:00
Jennifer
5ff5177bc6 more logging changes 2024-04-01 04:36:34 -04:00
Jennifer
d8418293cd DNS: Local profiler changes 2024-03-29 04:28:27 -04:00
Jennifer
0c3435cc75 add metric logging to progress bar. 2024-03-27 06:06:05 -04:00
Jennifer
cfd2e71981 seed workers fix and validation_epoch_end extra argument 2024-03-25 04:39:39 -04:00
Lukas Jarosch
a56ea9b564 Fix distributed seeding behavior
This adds workers=True to the Lightning seed_everything function which guarantees different random states across all processes in distributed training. Prior to that some processes on different GPUs with the same worker ID could share the same random state.

Note that this will break reproducibility between runs prior to and after this change.

Also removes the seed and supress_output modules that were not used anymore in OpenFold.
2024-03-19 10:05:22 -07:00
Jennifer
6dc34d71f5 first pass changes to run with pl 2.1 2024-03-19 05:30:07 -04:00
Jennifer
6ba0a594c9 adds config recording to wandb 2024-03-13 01:53:37 -04:00
Jennifer
d1fe024b20 Adds experiment_config_json for setting custom configurations with a json. 2024-03-13 01:22:46 -04:00
Jennifer
f4df72173d Changes resume_model_weights_only flag in train_openfold to reload weights only, without parsing a time step. 2024-02-22 04:03:06 -05:00
Jennifer
ddfccd56d7 changes reloading weights to account for possible state_dict headers. 2024-02-20 04:40:40 -05:00
Jennifer
e31e03781f Add log statement to weight conversion script 2024-02-14 00:48:00 -05:00
Jennifer
775f77dd12 bugfixes and adds a section to convert optim files 2024-02-12 05:14:53 -05:00
Christina Floristean
6275091c96 Fixed learning rate scheduler issue, returned to original msa file parsing 2024-02-07 15:47:24 -05:00
Christina Floristean
f65b75fe48 Fix for loading old OF weights into refactored model 2023-11-03 14:26:18 -04:00
Christina Floristean
d7c115371e Merge branch 'main' into multimer 2023-10-27 12:09:39 -04:00
Christina Floristean
7c7dffd081 Refactoring multimer perm alignment, temporary fixes before removing recycling dimension 2023-10-16 21:20:38 -04:00
Christina Floristean
0cf1541ce1 Refactoring multimer data pipeline and permutation alignment. 2023-10-16 15:40:40 -04:00
Sachin Kadyan
6012b9e1c1 Default value for --use_single_seq_mode arg 2023-10-10 16:33:07 -04:00
Sachin Kadyan
395a9f1ba8 Added flag in training script for using sequence embeddings 2023-10-10 16:33:07 -04:00
Geoffrey Yu
3aaf0ca89f update trainining code with new input from new multimer pipeline 2023-09-25 15:02:12 +02:00
Dingquan Yu
da92663d80 Update train_openfold.py 2023-09-10 20:10:16 +02:00