Talking with my Chemistry PHD friend about this issue of validating that the amino acid order in the peptide is correct, (and how to test for it) - he guided me towards these approaches, and suggested I’d need to work with the analytical chemistry lab (and get some support form them… these are not entirely standardized tests it seems).
1. Edman Degradation
Mechanism: Edman degradation sequentially removes and identifies one amino acid at a time from the N-terminus of a purified peptide. Under mildly alkaline conditions, the N-terminal amino group reacts with phenylisothiocyanate (PITC) to form a phenylthiocarbamyl (PTC) derivative. Subsequent treatment with an anhydrous acid, typically trifluoroacetic acid (TFA), cleaves the peptide bond of the N-terminal residue, yielding a thiazolinone derivative and leaving the rest of the peptide chain intact. The thiazolinone is extracted and converted into a more stable phenylthiohydantoin (PTH)-amino acid in aqueous acid, which is then identified via High-Performance Liquid Chromatography (HPLC) against known standards.
Strengths:
- Provides absolute sequence confirmation, which is highly precise for short peptides (up to 30–50 residues).
- Essential for verifying the exact N-terminal identity of recombinantly expressed proteins in biotech quality control.
Limitations:
- Inefficiency for large proteins: Yield drops with each cycle (typically 98% efficiency per cycle), leading to background noise that obscures the signal after ~50 cycles.
- N-terminal blockage: Completely fails if the N-terminus is chemically modified (e.g., formylation, acetylation, or pyroglutamate formation).
- Cannot identify positionally complex Post-Translational Modifications (PTMs) or cross-links efficiently.
2. Mass Spectrometry (MS)
Mechanism: Mass spectrometry has superseded Edman degradation for high-throughput proteomics. It determines sequences by measuring the mass-to-charge ratio () of ionized peptides and their fragments in a gas phase.
Dominance in Biotech:
- High Sensitivity: Requires only femtomolar to attomolar amounts of sample.
- Throughput & Complexity: Capable of analyzing complex mixtures (e.g., whole-cell lysates) without prior extensive purification.
- PTM Identification: The gold standard for mapping PTMs (phosphorylation, acetylation, advanced glycation end-products), which are critical variables in aging and longevity research.
Standard “Bottom-Up” Workflow:
- Proteolytic Digestion: Cleave the target protein into manageable peptides using a specific protease (e.g., Trypsin, which cleaves strictly C-terminal to Arg and Lys).
- Chromatographic Separation: Separate the complex peptide mixture via liquid chromatography (LC) directly coupled to the mass spectrometer.
- Ionization: Volatilize and ionize peptides using Electrospray Ionization (ESI) or Matrix-Assisted Laser Desorption/Ionization (MALDI).
- Tandem MS (MS/MS) Fragmentation: Isolate specific peptide precursor ions and fragment them using Collision-Induced Dissociation (CID) or Higher-energy Collisional Dissociation (HCD). This primarily breaks the peptide bonds, generating sequence-specific -ions (N-terminal fragments) and -ions (C-terminal fragments).
- Bioinformatic Reconstruction: Match the experimental MS/MS spectra against theoretical spectra generated from protein databases, or use computationally demanding de novo sequencing algorithms when a reference genome is unavailable.
3. Indirect Sequencing via DNA/RNA Sequencing
Mechanism: Deriving the primary structure of a protein by sequencing its corresponding gene or mRNA transcript and utilizing the standard genetic code to computationally translate the nucleotide sequence.
Caveats in Applied Biology:
- The “Static” Genome Problem: Fails to capture the dynamic reality of the proteome. It cannot detect alternative splicing variants unless RNA-seq is specifically utilized and properly mapped.
- Blind to Maturation: Cannot detect vital proteolytic processing events (e.g., cleavage of signal peptides or pro-hormones).
- Blind to PTMs: Offers no data on the ~300 known biological modifications that regulate protein function, stability, and half-life—data essential for developing structural analogs in biopharmaceutical engineering.
4. Hybrid Approaches & The Research Frontier
Modern proteomics relies on multi-omic integration. Combining genetic database searching (genomics/transcriptomics) with high-resolution MS/MS allows for rapid mapping of massive biological systems.
Scholarly Debates & Emerging Frontiers:
A current limitation in the field is the reliance on “bottom-up” MS, which destroys macroscopic protein context. The exact combination of PTMs on a single unbroken protein molecule (its “proteoform”) is lost.
- Top-Down Proteomics: Involves ionizing and fragmenting intact proteins. It maps the true proteoform but is severely limited by decreasing ionization efficiency and complex spectral overlaps as protein mass increases.
- Single-Molecule Protein Sequencing: Analogous to nanopore DNA sequencing, research is currently active in designing nanopores or using fluorophore-labeled Edman-like degradation to read individual protein molecules. This remains a significant technical challenge requiring highly engineered biological or solid-state pores.
Summary Table: Analytical Profiling of Peptides
| Method | Primary Utility | Major Limitations |
|---|---|---|
| Edman Degradation | Absolute N-terminal sequence confirmation of purified short peptides. | Blocked by N-terminal modifications; severe yield drop-off for long chains (>50 residues). |
| Mass Spectrometry (MS) | High-throughput, highly sensitive sequencing; PTM mapping; complex mixture analysis. | Relies heavily on computational reconstruction and databases; susceptible to ion suppression artifacts. |
| Amino Acid Analysis | Accurate quantitation of total amino acid ratios; determining absolute protein concentration. | Provides no sequence order; specific residues (Trp, Asn, Gln, Cys) are damaged or altered during standard hydrolysis. |
| Genetic Sequencing | Rapid, high-accuracy prediction of theoretical primary structure. | Completely ignores translational processing, proteolysis, and all post-translational modifications. |
Would you like me to detail the specific bioinformatic algorithms and statistical scoring models (e.g., SEQUEST, Mascot, or deep learning tools) used for de novo MS/MS sequence reconstruction?
I heard back from Finnrick on my questions about what they test, and don’t test yet:
Thanks for reaching out and for sharing these important concerns from your community!
Here’s how we currently cover the areas you mentioned:
Endotoxins (LPS): Yes, we offer endotoxin testing as a paid add-on.
Heavy Metals: Yes, we provide heavy metals testing, including a basic FDA 4-panel and more extensive panels on request.
Trifluoroacetic Acid (TFA) and other solvents: We offer solvents testing as a paid add-on.
Dimerization and Aggregation: Our standard peptide testing focuses on identity, purity, and quantity via HPLC, which may not fully detect subtle peptide aggregation or dimerization. We’re exploring expanding capabilities here.
Amino Acid Sequence Accuracy: Our identity testing confirms the peptide’s presence but cannot distinguish scrambled sequences that elute similarly in LC. This is a known challenge industry-wide.
Regarding the “reagent” test you mentioned, that typically refers to our standard panel covering identity, purity, and quantity.
We’re actively working to expand testing options based on feedback like yours, so please keep the suggestions coming!
You can find more details and submit samples here: Free Peptide Testing + Optional Endotoxin and Heavy Metals Analysis | Finnrick
Let me know if you’d like to discuss further or have any other questions!
Best,
Caanan
Note that finnrick is not a lab, more like a lab broker. You need to ask them which labs perform the tests you’re interested in. Lab quality in general varies widely.
Yes - but it’s really hard to tell lab quality. It would seem that Finnrick is the source (purchaser) for a large number of tests, so they would likely have the best idea of quality of the tests (but do they do comparisons - using same sample between multiple labs to check reliability)?
Generally the companies (analytical chemistry labs) who do the most of a given type of test, are going to be the ones that can do it best, and in the most reliable manner. Like anything, the more you do something, the more you’ve standardized the process, and the better the consistency is.
Given this, it would seem that the labs that Finnrick uses, are going to be one of the better labs for doing this, just due to the volume. But perhaps there are other labs that do most of the industry testing in the USA? Does anyone know the top Peptide analytical chemistry labs? I’ll try Google Gemini.
Finnrick wouldn’t know lab quality unless they did comparisons between them; they just assume all labs are created equal. Anyway, this is one of the many reasons most peptide communities don’t trust this company and its ratings.
Here is the list of labs they use :
Mz biolabs
Chromate
Janoshik
BT Labs
TrustPointe
Krause Analytical
The only recommendation I can make is for Janoshik. By volume, they probably do 5-10x of the tests that the rest of the labs in this list do combined. By reputation, they are considered the gold standard.
TrustPointe is also solid, although there are some peptides they cannot test, check with them.
I recommend you stay away from BT and Krause.
Agree that Janoshik is the gold standard. Trustpointe also a solid option for the services they offer, and they’re in the USA.