Breaking News
May 3, 2019 - Vaping and Smoking May Signal Greater Motivation to Quit
May 3, 2019 - Dementia looks different in brains of Hispanics
May 3, 2019 - Short-Staffed Nursing Homes See Drop In Medicare Ratings
May 3, 2019 - Study of teens with eating disorders explores how substance users differ from non-substance users
May 3, 2019 - Scientists develop new video game that may help in the study of Alzheimer’s
May 3, 2019 - Arc Bio introduces Galileo Pathogen Solution product line at ASM Clinical Virology Symposium
May 3, 2019 - Cornell University study uncovers relationship between starch digestion gene and gut bacteria
May 3, 2019 - How to Safely Use Glucose Meters and Test Strips for Diabetes
May 3, 2019 - Anti-inflammatory drugs ineffective for prevention of Alzheimer’s disease
May 3, 2019 - Study tracks Pennsylvania’s oil and gas waste-disposal practices
May 3, 2019 - Creating a better radiation diagnostic test for astronauts
May 3, 2019 - Vegans are often deficient in these four nutrients
May 3, 2019 - PPDC announces seed grants to develop medical devices for children
May 3, 2019 - Study maps out the frequency and impact of water polo head injuries
May 3, 2019 - Research on Reddit identifies risks associated with unproven treatments for opioid addiction
May 3, 2019 - Good smells may help ease tobacco cravings
May 3, 2019 - Medical financial hardship found to be very common among people in the United States
May 3, 2019 - Researchers develop multimodal system for personalized post-stroke rehabilitation
May 3, 2019 - Study shows significant mortality benefit with CABG over percutaneous coronary intervention
May 3, 2019 - Will gene-editing of human embryos ever be justifiable?
May 3, 2019 - FDA Approves Dengvaxia (dengue vaccine) for the Prevention of Dengue Disease in Endemic Regions
May 3, 2019 - Why Tonsillitis Keeps Coming Back
May 3, 2019 - Fighting the opioid epidemic with data
May 3, 2019 - Maggot sausages may soon be a reality
May 3, 2019 - Deletion of ATDC gene prevents development of pancreatic cancer in mice
May 2, 2019 - Targeted Therapy Promising for Rare Hematologic Cancer
May 2, 2019 - Alzheimer’s disease is a ‘double-prion disorder,’ study shows
May 2, 2019 - Reservoir bugs: How one bacterial menace makes its home in the human stomach
May 2, 2019 - Clinical, Admin Staff From Cardiology Get Sneak Peek at Epic
May 2, 2019 - Depression increases hospital use and mortality in children
May 2, 2019 - Vicon and NOC support CURE International to create first gait lab in Ethiopia
May 2, 2019 - Researchers use 3D printer to make paper organs
May 2, 2019 - Viral infection in utero associated with behavioral abnormalities in offspring
May 2, 2019 - U.S. Teen Opioid Deaths Soaring
May 2, 2019 - Opioid distribution data should be public
May 2, 2019 - In the Spotlight: “I’m learning every single day”
May 2, 2019 - 2019 Schaefer Scholars Announced
May 2, 2019 - Podcast: KHN’s ‘What The Health?’ Bye-Bye, ACA, And Hello ‘Medicare-For-All’?
May 2, 2019 - Study describes new viral molecular evasion mechanism used by cytomegalovirus
May 2, 2019 - SLU study suggests a more equitable way for Medicare reimbursement
May 2, 2019 - Scientists discover first gene involved in lower urinary tract obstruction
May 2, 2019 - Researchers identify 34 genes associated with increased risk of ovarian cancer
May 2, 2019 - Many low-income infants receive formula in the first few days of life, finds study
May 2, 2019 - Global study finds high success rate for hip and knee replacements
May 2, 2019 - Taking depression seriously: What is it?
May 2, 2019 - With Head Injuries Mounting, Will Cities Put Their Feet Down On E-Scooters?
May 2, 2019 - Scientists develop small fluorophores for tracking metabolites in living cells
May 2, 2019 - Study casts new light into how mothers’ and babies’ genes influence birth weight
May 2, 2019 - Researchers uncover new brain mechanisms regulating body weight
May 2, 2019 - Organ-on-chip systems offered to Asia-Pacific regions by Sydney’s AXT
May 2, 2019 - Adoption of new rules drops readmission penalties against safety net hospitals
May 2, 2019 - Kids and teens who consume zero-calorie sweetened beverages do not save calories
May 2, 2019 - Improved procedure for cancer-related erectile dysfunction
May 2, 2019 - Hormone may improve social behavior in autism
May 2, 2019 - Alzheimer’s disease may be caused by infectious proteins called prions
May 2, 2019 - Even Doctors Can’t Navigate Our ‘Broken Health Care System’
May 2, 2019 - Study looks at the impact on criminal persistence of head injuries
May 2, 2019 - Honey ‘as high in sugars as table sugar’
May 2, 2019 - Innovations to U.S. food system could help consumers in choosing healthy foods
May 2, 2019 - FDA Approves Mavyret (glecaprevir and pibrentasvir) as First Treatment for All Genotypes of Hepatitis C in Pediatric Patients
May 2, 2019 - Women underreport prevalence and intensity of their own snoring
May 2, 2019 - Concussion summit focuses on science behind brain injury
May 2, 2019 - Booker’s Argument For Environmental Justice Stays Within The Lines
May 2, 2019 - Cornell research explains increased metastatic cancer risk in diabetics
May 2, 2019 - Mount Sinai study provides fresh insights into cellular pathways that cause cancer
May 2, 2019 - Researchers to study link between prenatal pesticide exposures and childhood ADHD
May 2, 2019 - CoGEN Congress 2019: Speakers’ overviews
May 2, 2019 - A new strategy for managing diabetic macular edema in people with good vision
May 2, 2019 - Sagent Pharmaceuticals Issues Voluntary Nationwide Recall of Ketorolac Tromethamine Injection, USP, 60mg/2mL (30mg per mL) Due to Lack of Sterility Assurance
May 2, 2019 - Screen time associated with behavioral problems in preschoolers
May 2, 2019 - Hormone reduces social impairment in kids with autism | News Center
May 2, 2019 - Researchers synthesize peroxidase-mimicking nanozyme with low cost and superior catalytic activity
May 2, 2019 - Study results of a potential drug to treat Type 2 diabetes in children announced
May 2, 2019 - Multigene test helps doctors to make effective treatment decisions for breast cancer patients
May 2, 2019 - UNC School of Medicine initiative providing unique care to dementia patients
May 2, 2019 - Nestlé Health Science and VHP join forces to launch innovative COPES program for cancer patients
May 2, 2019 - Study examines how our brain generates consciousness and loses it during anesthesia
May 2, 2019 - Transition Support Program May Aid Young Adults With Type 1 Diabetes
May 2, 2019 - Study shows how neutrophils exacerbate atherosclerosis by inducing smooth muscle-cell death
May 2, 2019 - Research reveals complexity of how we make decisions
A Single Platform to Sequence All Monoclonal Antibody Proteins

A Single Platform to Sequence All Monoclonal Antibody Proteins

An interview with Mingjie Xie, CEO of Rapid Novor, conducted by James Ives

Monoclonal antibodies are used throughout life science research, please give an overview of why the sequencing of antibody proteins is important?

Primary sequences of antibody proteins are one of the important pieces of information researchers need to know at an early stage of the antibody drug discovery, research and development process.

Credit Design_Cells| Shutterstock

With the sequence information, one can re-make the exact same antibody recombinantly, or perform additional engineering such as isotype switching, subtype switching, species switching and reformatting.

How big are the differences between different antibody protein species, subtypes and formats? What are the challenges to having a unified solution?

Antibody proteins are beautiful crafts from mother nature. Each and every antibody clone is unique, and therefore they all have their own unique sequences.

The differences between antibody protein sequences from difference species, even in those conserved framework regions, could be quite significant. Sequence motifs that frequently present in one species, may not be found in another species.

This difference will have a cascade effect on mass spec experiments. For example, an experiment protocol that works well for mouse antibodies may not work as well for hamster antibodies; or a protocol may work for one subtype but not the others, as some enzymes may not work as effectively.

This is one of the main challenges we have to overcome to design a unified solution.

What techniques have been proposed to address the antibody protein sequencing problem?

Over the years, several papers have been published to address the antibody protein sequencing problem. From the manual sequencing and assembly approach published 25 years ago, to the homology database assisted sequencing algorithms that can achieve over 90% accuracy, to the self-claimed automated full-length sequencing software released in recent years. But none of them have been widely adopted in the real world.

What challenges have these methodologies faced?

There are many challenges facing scientists when sequencing antibody proteins, both expected and unexpected.

One of the expected challenges here is the overfitting problem given the small and limited training dataset available publicly. All published works in the literature have trained their algorithms with only a few proteins. This overfits the algorithm on those few proteins, but the algorithm does not work well on new proteins. This is very likely the main reason why an algorithm works well on the original publication but works terribly in third party studies.

Some other expected challenges include, for example, heterogeneity, which increases the complexity of the sample. The experiments may be suboptimal, especially when the protocols are ‘borrowed’ from the general proteomics experiments. Some peptides just don’t fly well and therefore not generating any signals.

In reality, there are also many unexpected challenges we have learned the hard way.

There are “contaminants”. BSA is a common stabilizing/blocking agent. When added to the antibody sample, it becomes a major issue for sequencing if not removed. For example, 1% BSA usually means the amount of BSA is 10 times higher than the target antibody protein. The majority of the mass spectrometry time will be spent on the BSA proteins instead of the antibody of interest.

There may be multiple chains. In about 15% of the monoclonal antibodies we sequenced, we observed the presence of additional light chains. The separation of the multiple chains is not always possible and thus increased the complexity of the sequencing analysis.

The “monoclonal” antibody may also be buried in a background of polyclonal antibodies. One example is when sequencing the monoclonal antibody from the ascites fluids. The ascites often contain polyclonal antibodies from the host animal.

Another example is the use of non-serum-free medium during cell culture, especially when cheap fetal calf serum or newborn calf serum is used. The supernatant will contain all the bovine serum proteins including bovine polyclonal antibodies.

Those background polyclonal antibodies, in both examples, will greatly interfere with the signals generated from the target antibody, especially in the CDR regions, and thus make the sequencing work very difficult if at all possible.

Please give an overview of the concept and workflow of REmAb™ sequencing technology from Rapid Novor.

When creating our REmAb™ sequencing technologies, we are very clear on our goal. We are not trying to demonstrate the ability to sequence one antibody protein, or a specific type of antibody proteins. Our goal is to create a robust and routine monoclonal antibody protein sequencing solution. The key here is robust and routine. We want to create a technology platform that can sequence any given antibody proteins, from any species, isotypes and in any formats.

With this goal in mind, we ‘borrowed’ the Agile methodology commonly used in software development, where uncertainties and changes are the norm. We recognized the importance of both the experiment and informatics components in the sequencing process.

This is why we are the first team focusing on the development of antibody protein sequencing technologies who has built both the in-house mass spectrometry lab and proprietary sequencing software together. In fact, this combined expertise had allowed us to rapidly iterate and improve the technologies to maintain the position of world leader in this field.

The general concept and procedure for our REmAb™ antibody protein sequencing contains four major steps.

  1. Enzymatically digest the antibody protein into shorter peptides.
  2. Generate high mass accuracy spectra data with a Thermo Orbitrap Fusion mass spectrometer.
  3. Sequence each shorter peptide using Novor de novo peptide sequencing engine.
  4. Assemble the peptide sequences back to the long protein sequence, and accurately determine Isoleucine and Leucine using our WILD™ method.

How does the REmAb sequencing technology overcome the challenges?

The short answer is through sophisticated machine learning.

One of the key factors in the successful use of machine learning is the quality and the quantity of data.

A common concept in computer science and mathematics is “garbage in, garbage out (or GIGO)”. which means that the quality of the output is determined by the quality of the input. With our in-house mass spectrometry lab, we were able to quickly iterate and improve experiments and generate high quality data for all situations.

Over the past years, we have successfully sequenced over 400 antibody proteins, including all common species, formats and samples with various qualities. This is not only a great achievement for our team, but has also provided great assets to advance the technologies. We have effectively built the world’s largest and growing mass spectrometry dataset for sequencing antibody proteins. With this large dataset, we avoided the overfitting problem.

Does the REmAb sequencing technology have issues with overfitting to trial data?

No. Mass spectrometry data analysis and machine learning is one of our core competences. We had invested heavily at a very early stage in building the large dataset to avoid overfitting. We also built the internal system to periodically train the algorithms with new data, thus improve our sequencing platform over time and adapt to new challenges.

What advantages does Rapid Novor have over other sequencing service providers?

There are three key advantages of our REmAb™ antibody protein sequencing services:

  1. high accuracy,
  2. high throughput,
  3. our ability to deal with difficult samples.

On the high accuracy aspect, with our WILD™ method, the first commercial service to accurately distinguish Isoleucine and Leucine using mass spectrometry, we can now be certain on each and every amino acids in the antibody protein.  We never settle for anything less than 100%. Even 99% accuracy is NOT good enough. It means a 1% error rate and thus on average two amino acids will be wrong in the VH and VL regions.

We have the highest throughput in antibody protein sequencing in the world. All major steps in the sequencing workflow have been automated. We have built internal software systems so that our scientists can review and curate sequencing results in an accurate and timely fashion. Even for large batch of samples, e.g. 30 mAb proteins, we are able to deliver all the sequencing reports in weeks instead of months. Here is the kind of throughput we promise to our customers.

We have the ability to deal with difficult-to-sequence samples, particularly, samples with multiple chains or polyclonal antibody background. So long as the target antibody proteins is dominant in the sample (>80% of total protein amount), we are able to derive the full sequences accurately. We recently lowered the amount of sample required for the sequencing work from 200ug to 100ug. And our record is deriving the full heavy and light chain sequences from only 12ug of mAb protein sample.

Given the issues with other sequencing solutions, how do you know that the REmAb sequencing results are correct?

Well, there is the ultimate blind test performed by our customers in independent labs.

We derive the sequences from the original antibody protein sample. Once the customers received the sequences, they will perform downstream confirmations or validations that fit their research purposes. Very often, in one of the steps, the customers will make the recombinant antibodies with the sequences we provided and test the binding. The recombinant antibody binds exactly as the original antibody.

What does the future hold for monoclonal antibody protein sequencing and the REmAb technique?

Our REmAb antibody protein sequencing technologies have been advancing rapidly in the past a few years, both in terms of the sequencing throughput and the complexity of the samples we can handle. At the same time, the cost had dropped considerably. This trend will continue in the future. What this means to researchers is that the technology will become more and more accessible.

As we continuously improving our ability to sequence monoclonal antibody proteins, we have seen the wide application and demand of sequencing polyclonal antibodies directly from blood. That’s exactly what we have been developing internally in the past year in order to make this a reality.

Where can readers find more information?

Our website,, is a great resource for information related to antibody protein sequencing.

About Mingjie Xie

Mr. Mingjie Xie, MSc, MBA, is the co-founder and CEO of Rapid Novor Inc. He is a computer scientist by training, received his MSc degree from Western University in the field of bioinformatics. He received his MBA degree from Richard Ivey School of Business to pursue his interests in business. Prior to co-founding Rapid Novor Inc, Mingjie is the COO of a bioinformatics software company.

About author

Related Articles