seq_check {DeepPINCS} | R Documentation |
In real-world cases, most of the data are not complete and contains incorrect values, missing values, and so on. Thus, there may be invalid sequences in the data. This function can find such sequences and remove them from the data. For SMILES strings, the function "webchem::is.smiles" is used. A valid amino acid sequence means a string that only contains capital letters of an alphabet.
seq_check(smiles = NULL, AAseq = NULL, outcome = NULL)
smiles |
SMILES strings (default: NULL) |
AAseq |
amino acid sequences (default: NULL) |
outcome |
a variable that indicates how strong two molecules interact with each other or whether there is an interaction between them (default: NULL) |
valid sequences
Dongmin Jung
Dey, N., Wagh, S., Mahalle, P. N., & Pathan, M. S. (Eds.). (2019). Applied machine learning for smart data analysis. CRC Press.
webchem::is.smiles
seq_check(smiles = example_cpi[1, 1], outcome = example_cpi[1, 3])