Saturday 29 January 2022

Переписка с GPuccio: размер множества белковых кластеров

Я спросил его, откуда данные о существовании порядка 2000 белковых кластеров (superfamilies).

Ответ GP:

My number (2000) for superfamilies was derived from the SCOP classification (probably the oldest classification of protein structure). It is now SCOP2, and the current number of superfamilies is 2783, while the families are now 5840.

Of course there are now many different categorizations, and the numbers depend critically on how one defines a superfamily or a family (or a fold, which should be the highest level of categorization). The SCOP definitions are rather clear, and I think they are rather sound. However, 15000 seems a rather excessive number for superfamilies. Have you any idea of the reference?

Here is a link to the current statistics in SCOP:

https://scop.mrc-lmb.cam.ac.uk/stats

And here is a Wikipedia link for SCOP:

https://en.wikipedia.org/wiki/Structural_Classification_of_Proteins_database

Another important classification is Pfam. There you find families (about 11000), domains (about 6000) and clans (about 600). Here is a link to a recent paper.

https://academic.oup.com/nar/article/49/D1/D412/5943818

И здесь GP верен себе, последовательно давая фору оппонентам. Я бы округлил до 3000 :)

No comments:

Post a Comment

Запись дня

Нерегулярность и неаддитивность функции

Claude Shannon by Alfred Eisenstaedt / The LIFE Picture Collection / Getty Рассмотрим стандартное возражение эволюционистов*, в котором утве...