Database Commons
Database Commons

a catalog of worldwide biological databases

Database Profile

PAD

General information

URL: http://naturalantibody.com/pad
Full name: Patented Antibody Database
Description: The Patented Antibody Database comprises antibody sequences found in patent documents from primary sources (USPTO, WIPO) and third parties (DDBJ, EBI). The database currently encompasses ca 267,722 antibody chains (148,774 heavy chains and 118,948 light chains) from 19,037 patent families.
Year founded: 2016
Last update:
Version:
Accessibility:
Accessible
Country/Region: Germany

Classification & Tag

Data type:
Data object:
Database category:
Major species:
Keywords:

Contact information

University/Institution: Technical University of Denmark
Address:
City:
Province/State:
Country/Region: Germany
Contact name (PI/Team): Konrad Krawczyk
Contact email (PI/Helpdesk): konrad@naturalantibody.com

Publications

33722161
Data mining patented antibody sequences. [PMID: 33722161]
Konrad Krawczyk, Andrew Buchanan, Paolo Marcatili

The patent literature should reflect the past 30 years of engineering efforts directed toward developing monoclonal antibody therapeutics. Such information is potentially valuable for rational antibody design. Patents, however, are designed not to convey scientific knowledge, but to provide legal protection. It is not obvious whether antibody information from patent documents, such as antibody sequences, is useful in conveying engineering know-how, rather than as a legal reference only. To assess the utility of patent data for therapeutic antibody engineering, we quantified the amount of antibody sequences in patents destined for medicinal purposes and how well they reflect the primary sequences of therapeutic antibodies in clinical use. We identified 16,526 patent families covering major jurisdictions (e.g., US Patent and Trademark Office (USPTO) and World Intellectual Property Organization) that contained antibody sequences. These families held 245,109 unique antibody chains (135,397 heavy chains and 109,712 light chains) that we compiled in our Patented Antibody Database (PAD, http://naturalantibody.com/pad). We find that antibodies make up a non-trivial proportion of all patent amino acid sequence depositions (e.g., 11% of USPTO Full Text database). Our analysis of the 16,526 families demonstrates that the volume of patent documents with antibody sequences is growing, with the majority of documents classified as containing antibodies for medicinal purposes. We further studied the 245,109 antibody chains from patent literature to reveal that they very well reflect the primary sequences of antibody therapeutics in clinical use. This suggests that the patent literature could serve as a reference for previous engineering efforts to improve rational antibody design.

MAbs. :13(1) | 21 Citations (from Europe PMC, 2025-12-20)

Ranking

All databases:
775/6895 (88.774%)
Health and medicine:
193/1738 (88.953%)
775
Total Rank
20
Citations
20
z-index

Community reviews

Not Rated
Data quality & quantity:
Content organization & presentation
System accessibility & reliability:

Word cloud

Related Databases

Citing
Cited by

Record metadata

Created on: 2022-04-22
Curated by:
Lin Liu [2022-06-20]
Sicheng Luo [2022-05-06]
Sicheng Luo [2022-05-03]
Yuxin Qin [2022-04-22]