MaxQtoMSstatsPTMFormat {MSstatsPTM}R Documentation

Convert output of TMT labeled MaxQuant experiment into MSstatsPTM format

Description

Takes as input TMT experiments from MaxQ and converts the data into the format needed for MSstatsPTM. Requires only the modified file from MaxQ (for example Phospho(STY)Sites) and an annotation file for PTM data. To adjust modified peptides for changes in global protein level, unmodified TMT experimental data must also be returned.

Usage

MaxQtoMSstatsPTMFormat(
  sites.data,
  annotation.ptm,
  evidence = NULL,
  proteinGroups = NULL,
  annotation.prot = NULL,
  mod.num = "Single",
  TMT.keyword = "TMT",
  ptm.keyword = "phos",
  which.proteinid.ptm = "Protein",
  which.proteinid.protein = "Leading.razor.protein",
  removeMpeptides = FALSE
)

Arguments

sites.data

modified peptide output from MaxQuant. For example, a phosphorylation experiment would require the Phospho(STY)Sites.txt file

annotation.ptm

data frame annotation file for the ptm level data. Contains column Run, Fraction, TechRepMixture, Mixture, Channel, BioReplicate, Condition.

evidence

for global protein dataset. name of 'evidence.txt' data, which includes feature-level data.

proteinGroups

for global protein dataset, name of 'proteinGroups.txt' data.

annotation.prot

data frame annotation file for the protein level data. Contains column Run, Fraction, TechRepMixture, Mixture, Channel, BioReplicate, Condition.

mod.num

For modified peptide dataset. The number modifications per peptide to be used. If "Single", only peptides with one modification will be used. Otherwise "Total" can be selected which does not cap the number of modifications per peptide. "Single" is the default. Selecting "Total" may confound the effect of different modifications.

TMT.keyword

the sub-name of columns in sites.data file. Default is TMT. This corresponds to the columns in the format Reporter.intensity.corrected.1.TMT1phos___1. Specifically, this parameter indicates the first section of the string TMT1phos (Before the mixture number). If TMT is present in the string, set this value to TMT. Else if TMT is not there (ie string is in the format 1phos) leave this parameter as an empty string (”).

ptm.keyword

the sub-name of columns in the sites.data file. Default is phos. This corresponds to the columns in the format Reporter.intensity.corrected.1.TMT1phos___1. Specifically, this parameter indicates the second section of the string TMT1phos (After the mixture number). If the string is present, set this parameter. Else if this part of the string is empty (ie string is in the format TMT1) leave this parameter as an empty string (”).

which.proteinid.ptm

For PTM dataset, which column to use for protein name. Use 'Proteins'(default) column for protein name. 'Leading.proteins' or 'Leading.razor.protein' or 'Gene.names' can be used instead to get the protein ID with single protein. However, those can potentially have the shared peptides.

which.proteinid.protein

For Protein dataset, which column to use for protein name. Same options as above.

removeMpeptides

If Oxidation (M) modifications should be removed. Default is TRUE.

Value

a list of two data.tables named 'PTM' and 'PROTEIN' in the format required by MSstatsPTM.

Examples


head(raw.input.tmt$PTM)
head(raw.input.tmt$PROTEIN)


[Package MSstatsPTM version 1.4.2 Index]