Protein Sequencing Sample Preparation and General Information

Sample Amount
Sample Quantitation
Sample Preparation: Download electro-blotting procedure
Blocked N-terminal
Chemistry
Laboratory Procedures
Terms


The major difficulty in protein sequencing is sample preparation. The major difficulty with sample preparation is that every protein is different and behaves differently when removed from its “environment,” whether this is an in vivo environment or an in vitro environment in which you are used to handling it. If you have a sequencing project, please consult with us from the start of your project if possible. We may be able help you plan your purification strategy to maximize your chance of having enough protein to sequence. At the very least, we can make suggestions to improve your chances.

Sample Amount

Most investigators greatly over-estimate the amount of protein they have. This is understandable in light of the difficulty and the amount of labor involved in protein purification. The MB&P Core has an Applied Biosystems (ABI) Model 494 sequencer which will sequence proteins in the low picomolar range. The model 494 sequencer uses standard Edman degradation, which has been modified to accommodate automated sequencing. The 494 will perform both gas phase and pulsed liquid sequencing. It is ideal for a situation where a mixture of sample types, such as gel blots or samples in solution, is encountered.

The manufacturer of our instrument, Applied BioSystems, guarantees that the instrument will sequence 10 picomoles of standard protein with >93% repetitive yield. That means that with 10 pmol of protein, for example, about 0.5 micrograms of a protein with a molecular weight of 50,000, the instrument is able to sequence twenty amino acids and still remain well within the detection limits of the PTH analyzer (see Chemistry below). The protein standard that is used to demonstrate this is beta-lactoglobulin, which is a readily purified protein and easily sequenced.

Sample Quantitation

The two most common causes of unsuccessful sequencing are insufficient sample and a blocked N-terminus. The best way to distinguish between the two problems is sample quantitation. If there is sufficient protein present and no sequence is obtained, the N-terminal is probably blocked. Therefore, quantitation is an absolute requirement for sample submission. Blotting to PVDF membrane is the method of choice for most unknown proteins. For blotted samples an easy way to estimate the number of pmoles in your protein sample is to run it with known quantities of at least two concentrations of unstained standards in adjacent lanes of the same gel/blot. Try to choose known proteins whose molecular weights closely approximate your protein(s) of interest. We recommend BioRad unstained molecular weight markers; they are available in several molecular weight ranges to suit the needs of your experiment. Bovine serum albumin is not recommended as a standard since it reacts atypically with many stains. Densitometry should be used to compare the staining intensity of your sample to the intensity of the known quantities of protein standards. There is a laser densitometer and appropriate software in the core facility available 24 hours per day, seven days per week. We like Invitrogen’s colloidal Coomassie Blue kit; the sensitivity of staining is about 50ng and it is easy to use.

Blocked N-terminal

Many proteins acquire blocked N-termini during purification. A procedure called internal sequencing is usually used on these samples unless the N-terminal blocking group can be easily removed. Internal sequencing requires that the N-terminally blocked protein be subjected to proteolytic digestion. The fragments are then separated by micro-bore or capillary HPLC, then collected and sequenced individually. Fractions containing more than one peptide are difficult to interpret. Even if the sequences of all the species are known, sorting out which amino acids belong to which peptide is sometimes not possible due to “preview” and “lag” phenomena which occur as a result of the sequencing process. Sometimes two proteins that occur in two vastly different quantities - e.g., 100 pmol and 10 pmol - can be co-sequenced and provide usable data.

Sample Preparation

Samples are loaded onto the sequencer in two ways: in solution or blotted to PVDF membranes. For proteins or peptides in solution, the sample is dried onto a specially treated glass fiber filter. This method is suitable for most proteins and peptides. Usually, samples presented in solution have been isolated by traditional means such as ion exchange and/or size exclusion chromatography, RP-HPLC, etc. Proteins can also be loaded on the sequencer blotted to a PVDF membrane. These can be electro-blotted samples from PAGE gels or samples adsorbed to PVDF (ABI’s ProSorb cartridge) membrane. This technique is particularly useful for samples containing buffers that may interfere with the sequencing chemistry. We recommend ABI’s specially treated ProBlot PVDF Membrane for electro-blotting samples. You can download the procedure HERE. Other brands of specially treated PVDF membrane work as well. Transferring the sample to PVDF membrane has become the preferred method for sample presentation. This method offers a slightly easier sample preparation and a lower background on the sequencer. The only problem with PVDF blots is that small peptides of <30 amino acids tend to wash off the membrane during the sequencing operation. Samples should be cut out and trimmed as closely as possible. Multiple bands of the same protein can be submitted if conditions prevent having sufficient protein in a single band. Blotted samples should be submitted dry in 1.5mL eppendorf tubes. Please consult the MB&P Staff for help quantitating proteins prepared by chromatographic methods.

Chemistry

The chemistry for automated protein sequencing is a slight variation of the method developed by Pehr Edman in the 1950's. It consists of three main steps: coupling, cleavage, and conversion. The free N-terminus of the first amino acid in the protein is coupled to phenylisothiocyanide. This complex is subsequently cleaved off the protein with a strong acid, trifluoroacetic acid (TFA), leaving a free N-terminus on the protein. The resulting complex, the anilinothiazolinone (ATZ) amino acid, is extracted and removed to a separate vessel and converted in aqueous TFA to the more stable phenylthiohydantoin (PTH) derivative. The chemistry is then cycled again and additional PTH amino acids are made in the order in which they appear in the protein. The PTH amino acids are then analyzed by reverse phase HPLC. In automated sequencing, one cycle takes approximately thirty minutes. The limit of this chemistry is the amount of PTH amino acid you can “see” with the UV detector. This means that every step of the chemistry must be optimized to give a maximum yield of all PTH amino acids. This, unfortunately, is a major source of variability in protein sequencing. Some amino acids do not exhibit average behavior when subjected to the Edman chemistry. Since it can’t be predicted a priori when they will occur in a sequence, changes in the chemistry can’t be made to accommodate them. Typically, the amino acids Cys, Ser, Pro, and Gly are problematic for different reasons. Detection of Cys requires an additional derivatization step. Other amino acids may present problems to a lesser extent. These problems are usually related to the protein’s primary structure.

Laboratory procedures

The optimization of the Model 494 will be checked every twenty sequencing runs. Drying times and reagent and solvent deliveries will be checked before each run. The PTH analyzer will be pre-cycled to ensure adequate separation of all PTH-amino acids at the lowest possible concentration of standards. Biweekly, or once in every twenty runs, a control protein of known sequence, beta-lactoglobulin, will be run after the optimization procedure. The standard will be run at a sufficiently low level to ensure sequencer performance at that level. These test runs will be compared to previous runs to determine if the instrument is performing correctly. If it is not performing to our standards, we will troubleshoot and correct the problem.

Other Terms

Samples that do not meet the criteria for sequencable samples will be put on the instrument at the normal charge of $250. If there is no discernable sequence or there are unexpected multiple sequences, the instrument will be stopped at the discretion of the operator. If a protein is sequencing particularly well and in the opinion of the operator it will sequence beyond the requested number of amino acids, a reasonable attempt will be made to contact the investigator for permission to continue sequencing. If the investigator cannot be contacted, the sample filter or blot will be preserved with the chance that additional sequence can be obtained if the sample is reloaded on the instrument.



   

sponsored by