Spaces:
Running
Running
jannisborn
commited on
update
Browse files- model_cards/article.md +3 -67
- model_cards/description.md +1 -1
model_cards/article.md
CHANGED
@@ -1,76 +1,12 @@
|
|
1 |
-
# Model documentation
|
2 |
|
3 |
-
**Algorithm Version**: Which model version to use.
|
4 |
|
5 |
-
**
|
6 |
|
7 |
-
**Protein target**: An AAS of a protein target used for conditioning. Leave blank unless you use `affinity` as a `property goal`.
|
8 |
|
9 |
-
|
10 |
|
11 |
-
**Maximal sequence length**: The maximal number of SMILES tokens in the generated molecule.
|
12 |
|
13 |
-
**Number of samples**: How many samples should be generated (between 1 and 50).
|
14 |
-
|
15 |
-
**Limit**: Hypercube limits in the latent space.
|
16 |
-
|
17 |
-
**Number of steps**: Number of steps for a GP optmization round. The longer the slower. Has to be at least `Number of initial points`.
|
18 |
-
|
19 |
-
**Number of initial points**: Number of initial points evaluated. The longer the slower.
|
20 |
-
|
21 |
-
**Number of optimization rounds**: Maximum number of optimization rounds.
|
22 |
-
|
23 |
-
**Sampling variance**: Variance of the Gaussian noise applied during sampling from the optimal point.
|
24 |
-
|
25 |
-
**Samples for evaluation**: Number of samples averaged for each minimization function evaluation.
|
26 |
-
|
27 |
-
**Max. sampling steps**: Maximum number of sampling steps in an optmization round.
|
28 |
-
|
29 |
-
**Seed**: The random seed used for initialization.
|
30 |
-
|
31 |
-
|
32 |
-
|
33 |
-
# Model card -- PaccMannGP
|
34 |
-
|
35 |
-
**Model Details**: [PaccMann<sup>GP</sup>](https://github.com/PaccMann/paccmann_gp) is a language-based Variational Autoencoder that is coupled with a GaussianProcess for controlled sampling. This model systematically explores the latent space of a trained molecular VAE.
|
36 |
-
|
37 |
-
**Developers**: Jannis Born, Matteo Manica and colleagues from IBM Research.
|
38 |
-
|
39 |
-
**Distributors**: Original authors' code wrapped and distributed by GT4SD Team (2023) from IBM Research.
|
40 |
-
|
41 |
-
**Model date**: Published in 2022.
|
42 |
-
|
43 |
-
**Model version**: A molecular VAE trained on 1.5M molecules from ChEMBL.
|
44 |
-
|
45 |
-
**Model type**: A language-based molecular generative model that can be explored with Gaussian Processes to generate molecules with desired properties.
|
46 |
-
|
47 |
-
**Information about training algorithms, parameters, fairness constraints or other applied approaches, and features**:
|
48 |
-
Described in the [original paper](https://pubs.acs.org/doi/10.1021/acs.jcim.1c00889).
|
49 |
-
|
50 |
-
**Paper or other resource for more information**:
|
51 |
-
[Active Site Sequence Representations of Human Kinases Outperform Full Sequence Representations for Affinity Prediction and Inhibitor Generation: 3D Effects in a 1D Model (2022; *Journal of Chemical Information & Modeling*)](https://pubs.acs.org/doi/10.1021/acs.jcim.1c00889).
|
52 |
-
|
53 |
-
**License**: MIT
|
54 |
-
|
55 |
-
**Where to send questions or comments about the model**: Open an issue on [GT4SD repository](https://github.com/GT4SD/gt4sd-core).
|
56 |
-
|
57 |
-
**Intended Use. Use cases that were envisioned during development**: Chemical research, in particular drug discovery.
|
58 |
-
|
59 |
-
**Primary intended uses/users**: Researchers and computational chemists using the model for model comparison or research exploration purposes.
|
60 |
-
|
61 |
-
**Out-of-scope use cases**: Production-level inference, producing molecules with harmful properties.
|
62 |
-
|
63 |
-
**Factors**: Not applicable.
|
64 |
-
|
65 |
-
**Metrics**: High reward on generating molecules with desired properties.
|
66 |
-
|
67 |
-
**Datasets**: ChEMBL.
|
68 |
-
|
69 |
-
**Ethical Considerations**: Unclear, please consult with original authors in case of questions.
|
70 |
-
|
71 |
-
**Caveats and Recommendations**: Unclear, please consult with original authors in case of questions.
|
72 |
-
|
73 |
-
Model card prototype inspired by [Mitchell et al. (2019)](https://dl.acm.org/doi/abs/10.1145/3287560.3287596?casa_token=XD4eHiE2cRUAAAAA:NL11gMa1hGPOUKTAbtXnbVQBDBbjxwcjGECF_i-WC_3g1aBgU1Hbz_f2b4kI_m1in-w__1ztGeHnwHs)
|
74 |
|
75 |
## Citation
|
76 |
If you use this webservice, please cite:
|
|
|
1 |
+
# Model documentation
|
2 |
|
|
|
3 |
|
4 |
+
**SMILES**:
|
5 |
|
|
|
6 |
|
7 |
+
ell lines in rows and genes in columns
|
8 |
|
|
|
9 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
10 |
|
11 |
## Citation
|
12 |
If you use this webservice, please cite:
|
model_cards/description.md
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
<img align="right" src="https://repository-images.githubusercontent.com/219031433/3729c600-fcdc-11e9-9cdf-60c4a2b41700" alt="logo" width="120" >
|
2 |
|
3 |
-
PaccMann is a
|
|
|
1 |
<img align="right" src="https://repository-images.githubusercontent.com/219031433/3729c600-fcdc-11e9-9cdf-60c4a2b41700" alt="logo" width="120" >
|
2 |
|
3 |
+
PaccMann is a webservice for anticancer compound sensitivity prediction. For details on usage, please see the [PaccMann paper](https://academic.oup.com/nar/article/48/W1/W502/5836770) in *Nucleic Acid Research*.
|