![](/user_photo/_userpic.png)
Огурцов А. Н. Методы бииоинформационного анализа
..pdfS2 (BLOSSUM80) = s(G,G) + s(S,S) + s(A,G) + + s(H , H ) + s(K,Q) = =9 +7 −0 −3 +1−4 +9 −2 −1−4 −1−2 −1+10 −3 +6 +
+[−14](пропуск) −1−1−2 +12 +0 −3 +10 +3 −5 −3 +7 −6 + +2 +7 +6 −4 +10 +[−18](пропуск) +0 −1+7 +12 + 2 = 41.
2.6. МОДЕЛЬНЫЕ ВЫРАВНИВАНИЯ
Ниже приведены результаты парного выравнивания, выполненные программой BLAST, при поиске гомологов для заданной аминокислотной последовательности альфа-субъединицы гемоглобина человека
HBA_HUMAN
>sp|P69905|HBA_HUMAN Hemoglobin subunit alpha OS=Homo sapiens GN=HBA1 PE=1 SV=2 MVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHFDLSHGSAQVKGHGKKVADALTNAVAHVDDMPN ALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPAEFTP
AVHASLDKFLASVSTVLTSKYR
Фрагменты каждого выравнивания, вместе с матрицами замен аминокислот 2–9, будут использованы в задачах по вычислению счёта выравнивания и составления таблицы динамического программирования глобального выравнивания двух последовательностей.
Выравнивание 1 с бэта-2 субъединицей гемоглобина Rattus norvegicus (Серая крыса).
hemoglobin subunit beta-2 [Rattus norvegicus] Length=147
GENE ID: 100134871 LOC100134871 | beta globin minor gene [Rattus norvegicus] (10 or fewer PubMed links)
Score = 115 bits (288), Expect = 1e-24, Method: Compositional matrix adjust. Identities = 63/145 (44%), Positives = 87/145 (60%), Gaps = 8/145 (5%)
Query |
3 |
LSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF-DLSHGSA-----QV |
56 |
|||||
Sbjct |
4 |
L+ A+K |
V |
WGKV +A |
GAEAL R+ + +P T+ YF F DLS SA |
QV |
61 |
|
LTDAEKATVSGLWGKV--NADNVGAEALGRLLVVYPWTQRYFSKFGDLSSASAIMGNPQV |
||||||||
Query |
57 |
KGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPA |
116 |
|||||
Sbjct |
62 |
K HGKKV +A |
+ + H+D++ |
|
+ LS+LH KL VDP NF+LL + +++ L |
HL |
121 |
|
KAHGKKVINAFNDGLKHLDNLKGTFAHLSELHCDKLHVDPENFRLLGNMIVIVLGHHLGK |
||||||||
Query |
117 |
EFTPAVHASLDKFLASVSTVLTSKY |
141 |
|
|
|||
Sbjct |
122 |
EFTP |
A+ |
K +A V++ L |
KY |
146 |
|
|
EFTPCAQAAFQKVVAGVASALAHKY |
|
|
40
Выравнивание 2 с бэта-субъединицей гемоглобина Papio anubis (Павиан догеровский, Анубис).
hemoglobin subunit beta [Papio anubis] Length=147
GENE ID: 100137310 HBB | hemoglobin, beta [Papio anubis] (10 or fewer PubMed links)
Score = 114 bits (284), Expect = 4e-24, Method: Compositional matrix adjust. Identities = 63/145 (44%), Positives = 86/145 (60%), Gaps = 8/145 (5%)
Query |
3 |
LSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF-DLSH-----GSAQV |
56 |
|||||
Sbjct |
4 |
L+P +K |
V A WGKV |
+ |
E G EAL R+ + +P T+ +F F DLS |
G+ +V |
61 |
|
LTPEEKNAVTALWGKV--NVDEVGGEALGRLLVVYPWTQRFFDSFGDLSSPAAVMGNPKV |
||||||||
Query |
57 |
KGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPA |
116 |
|||||
Sbjct |
62 |
K HGKKV |
A ++ + H+D++ |
|
+ LS+LH KL VDP NFKLL + L+ |
LA H |
121 |
|
KAHGKKVLGAFSDGLNHLDNLKGTFAQLSELHCDKLHVDPENFKLLGNVLVCVLAHHFGK |
||||||||
Query |
117 |
EFTPAVHASLDKFLASVSTVLTSKY |
141 |
|
|
|||
Sbjct |
122 |
EFTP V A+ K +A V+ |
L |
KY |
146 |
|
|
|
EFTPQVQAAYQKVVAGVANALAHKY |
|
|
Выравнивание 3 с бэта-субъединицей гемоглобина major chain Rattus norvegicus (Серая крыса).
hemoglobin, beta adult major chain [Rattus norvegicus] Length=147
GENE ID: 361619 Hbb-b1 | hemoglobin, beta adult major chain [Rattus norvegicus]
(10 or fewer PubMed links)
Score = 110 bits (275), Expect = 4e-23, Method: Compositional matrix adjust. Identities = 58/145 (40%), Positives = 88/145 (61%), Gaps = 8/145 (5%)
Query |
3 |
LSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF-DLSH-----GSAQV |
56 |
|||||
Sbjct |
4 |
L+ A+K |
V |
WGKV + |
E GAE+L + + +P T+ YF F DLS |
G+ QV |
61 |
|
LTDAEKATVNGLWGKV--NPVEIGAESLASLLIVYPWTQRYFSKFGDLSSVSAIMGNPQV |
||||||||
Query |
57 |
KGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPA |
116 |
|||||
Sbjct |
62 |
K HG+KV +A |
+ + H+D++ |
|
++LS+LH KL VDP NF+LL + +++ + |
HL |
121 |
|
KAHGEKVINAFDDGLKHLDNLKGTFASLSELHCDKLHVDPENFRLLGNMIVIMMGHHLGK |
||||||||
Query |
117 |
EFTPAVHASLDKFLASVSTVLTSKY |
141 |
|
|
|||
Sbjct |
122 |
EFTP+ |
A+ |
K +A V++ L |
KY |
146 |
|
|
EFTPSAQAAFQKVVAGVASALAHKY |
|
|
Выравнивание 4 с гемоглобином эпсилон Bos taurus (Дикий бык).
hemoglobin, epsilon 1 [Bos taurus]
41
Length=147
GENE ID: 539994 HBE1 | hemoglobin, epsilon 1 [Bos taurus] (10 or fewer PubMed links)
Score = 106 bits (265), Expect = 5e-22, Method: Compositional matrix adjust. Identities = 57/145 (40%), Positives = 84/145 (58%), Gaps = 8/145 (5%)
Query |
3 |
LSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF-DLSHGSA-----QV |
56 |
|||||
Sbjct |
4 |
+ +K |
+ |
WGKV |
E G EAL R+ + +P T+ +F F +LS SA |
+V |
61 |
|
FTAEEKAAITGLWGKVNVE--EAGGEALGRLLVVYPWTQRFFDSFGNLSSASAIMGNPKV |
||||||||
Query |
57 |
KGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPA |
116 |
|||||
Sbjct |
62 |
K HGKKV |
+ |
A+ ++D++ |
A + LS+LH KL VDP NF+LL + +++ LA H |
|
121 |
|
KAHGKKVLTSFGEAIKNLDNLKGAFAKLSELHCDKLHVDPENFRLLGNVIVIILATHFGR |
||||||||
Query |
117 |
EFTPAVHASLDKFLASVSTVLTSKY |
141 |
|
|
|||
Sbjct |
122 |
EFTP V A+ |
K ++ V+T L |
KY |
146 |
|
|
|
EFTPDVQAAWQKLVSGVATALAHKY |
|
|
Выравнивание 5 с субъединицей бэта гемоглобина Ovis aries (Домашняя овца).
hemoglobin subunit beta [Ovis aries] Length=145
GENE ID: 100049064 HBB | hemoglobin, beta [Ovis aries] (10 or fewer PubMed links)
Score = 106 bits (265), Expect = 6e-22, Method: Compositional matrix adjust. Identities = 64/146 (44%), Positives = 88/146 (61%), Gaps = 8/146 (5%)
Query |
2 |
VLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF-DLSH-----GSAQ |
55 |
||||
Sbjct |
1 |
+L+ +K V |
WGKV |
E GAEAL R+ + +P T+ +F HF DLS |
+A+ |
58 |
|
MLTAEEKAAVTGFWGKV--KVDEVGAEALGRLLVVYPWTQRFFEHFGDLSSADAVMNNAK |
|||||||
Query |
56 |
VKGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLP |
115 |
||||
Sbjct |
59 |
VK HGKKV D+ +N V H+DD+ |
|
+ LS+LH KL VDP NF+LL + L+V LA H |
118 |
||
VKAHGKKVLDSFSNGVQHLDDLKGTFAQLSELHCDKLHVDPENFRLLGNVLVVVLARHHG |
|||||||
Query |
116 |
AEFTPAVHASLDKFLASVSTVLTSKY |
141 |
|
|
||
Sbjct |
119 |
+EFTP + A |
K +A V+ L |
+Y |
144 |
|
|
SEFTPVLQAEFQKVVAGVANALAHRY |
|
|
Выравнивание 6 с субъединицей бэта-1 гемоглобина Mus musculus (Мышь домовая).
hemoglobin subunit beta-1 [Mus musculus] Length=147
GENE ID: 15129 Hbb-b1 | hemoglobin, beta adult major chain [Mus musculus] (Over 100 PubMed links)
42
Score = 106 bits (264), Expect = 7e-22, Method: Compositional matrix adjust. Identities = 62/145 (43%), Positives = 87/145 (60%), Gaps = 8/145 (5%)
Query |
3 |
LSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF-DLSH-----GSAQV |
56 |
|||||
Sbjct |
4 |
L+ A+K |
V |
WGKV |
+A E G EAL R+ + +P T+ YF F DLS |
G+A+V |
61 |
|
LTDAEKAAVSGLWGKV--NADEVGGEALGRLLVVYPWTQRYFDSFGDLSSASAIMGNAKV |
||||||||
Query |
57 |
KGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPA |
116 |
|||||
Sbjct |
62 |
K HGKKV |
|
A + + H+D + |
++LS+LH KL VDP NF+LL + +++ L |
HL |
121 |
|
KAHGKKVITAFNDGLNHLDSLKGTFASLSELHCDKLHVDPENFRLLGNMIVIVLGHHLGK |
||||||||
Query |
117 |
EFTPAVHASLDKFLASVSTVLTSKY |
141 |
|
|
|||
Sbjct |
122 |
+FTPA |
A+ |
K +A V+ |
L KY |
146 |
|
|
DFTPAAQAAFQKVVAGVAAALAHKY |
|
|
Выравнивание 7 с субъединицей бэта гемоглобина Gallus gallus (Банкивский петух).
hemoglobin subunit beta [Gallus gallus] Length=147
GENE ID: 396485 HBG2 | hemoglobin, gamma G [Gallus gallus] (Over 10 PubMed links)
Score = 105 bits (263), Expect = 1e-21, Method: Compositional matrix adjust. Identities = 57/141 (41%), Positives = 82/141 (59%), Gaps = 8/141 (5%)
Query |
7 |
DKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF-DLSH-----GSAQVKGHG |
60 |
|||||
Sbjct |
8 |
+K |
+ |
WGKV + E GAEAL R+ + +P T+ +F F +LS |
G+ |
V+ HG |
65 |
|
EKQLITGLWGKV--NVAECGAEALARLLIVYPWTQRFFASFGNLSSPTAILGNPMVRAHG |
||||||||
Query |
61 |
KKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPAEFTP |
120 |
|||||
Sbjct |
66 |
KKV |
+ |
+AV ++D++ N |
S LS+LH KL VDP NF+LL |
L++ LAAH |
+FTP |
125 |
KKVLTSFGDAVKNLDNIKNTFSQLSELHCDKLHVDPENFRLLGDILIIVLAAHFSKDFTP |
||||||||
Query |
121 |
AVHASLDKFLASVSTVLTSKY |
141 |
|
|
|
||
Sbjct |
126 |
|
A+ K + V+ L KY |
146 |
|
|
|
|
ECQAAWQKLVRVVAHALARKY |
|
|
|
Выравнивание 8 с субъединицей бэта-1 гемоглобина зародыша Xenopus laevis (Обыкновенная шпорцевая лягушка).
hemoglobin larval subunit beta-1 [Xenopus laevis] Length=147
GENE ID: 379431 hbd | hemoglobin, delta [Xenopus laevis] (10 or fewer PubMed links)
Score = 105 bits (261), Expect = 1e-21, Method: Compositional matrix adjust. Identities = 61/145 (43%), Positives = 86/145 (60%), Gaps = 8/145 (5%)
43
Query |
3 |
LSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF-DLSH-----GSAQV |
56 |
||||||||
Sbjct |
4 |
LS +K+ + A W KV |
G +AL R+ + FP T+ YF F +LS+ |
G+A+V |
61 |
||||||
LSADEKSAINAVWSKVNIEND--GHDALTRLLVVFPWTQRYFSSFGNLSNVAAISGNAKV |
|||||||||||
Query |
57 |
KGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPA |
116 |
||||||||
Sbjct |
62 |
+ HGKKV |
A+ |
++ H+DD+ N LS LS |
HA +L VDP NFK L+ |
L++ LA |
L A |
121 |
|||
RAHGKKVLSAVDESIHHLDDIKNFLSVLSTKHAEELHVDPENFKRLADVLVIVLAGKLGA |
|||||||||||
Query |
117 |
EFTPAVHASLDKFLASVSTVLTSKY |
141 |
|
|
|
|
|
|||
Sbjct |
122 |
FTP V A+ +KF A + |
L+ Y |
146 |
|
|
|
|
|
||
AFTPQVQAAWEKFSAGLVAALSHGY |
|
|
|
|
|
||||||
Выравнивание |
9 |
с субъединицей |
ро гемоглобина |
|
Gallus |
gallus |
(Банкивский петух).
hemoglobin subunit rho [Gallus gallus] Length=147
GENE ID: 419079 HBG1 | hemoglobin, gamma A [Gallus gallus] (10 or fewer PubMed links)
Score = 104 bits (260), Expect = 2e-21, Method: Compositional matrix adjust. Identities = 56/144 (39%), Positives = 84/144 (59%), Gaps = 8/144 (5%)
Query |
4 |
SPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF-DLSH-----GSAQVK |
57 |
|||
Sbjct |
5 |
S +K |
+ + W KV |
E GAEAL R+ + +P T+ +F +F +LS |
G+ +V+ |
62 |
SAEEKQLITSVWSKVNVE--ECGAEALARLLIVYPWTQRFFDNFGNLSSPTAIIGNPKVR |
||||||
Query |
58 |
GHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPAE |
117 |
|||
|
|
HGKKV |
+ AV ++D++ N + LS+LH KL VDP NF+LL + L++ LAAH + |
|
Sbjct |
63 |
AHGKKVLSSFGEAVKNLDNIKNTYAKLSELHCEKLHVDPENFRLLGNILIIVLAAHFTKD 122 |
|
Query |
118 |
FTPAVHASLDKFLASVSTVLTSKY |
141 |
Sbjct |
123 |
FTP A K ++ V+ L KY |
146 |
FTPTCQAVWQKLVSVVAHALAYKY |
Выравнивание 10 с субъединицей гамма гемоглобина Oryctolagus cuniculus (Дикий кролик).
hemoglobin subunit gamma [Oryctolagus cuniculus] Length=147
GENE ID: 100328858 HBG2 | G-gamma globin [Oryctolagus cuniculus] (10 or fewer PubMed links)
Score = 104 bits (260), Expect = 2e-21, Method: Compositional matrix adjust. Identities = 59/145 (41%), Positives = 85/145 (59%), Gaps = 8/145 (5%)
Query |
3 |
LSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF-DLSHGSA-----QV 56 |
|||
Sbjct |
4 |
+ +K + + W V |
+ GAEAL R+ + +P T+ +F F +LS SA |
+V |
61 |
FTAEEKAAITSTWKLVDVE--DAGAEALGRLLVVYPWTQRFFDSFGNLSSSSAIMGNPKV |
44
Query |
57 |
KGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPA |
116 |
||
Sbjct |
62 |
K HGKKV |
A +AV +VDD+ N |
+ LS+LH +L VDP NFKLL + L++ LA + |
121 |
KAHGKKVLTAFGDAVKNVDDLKNTFAHLSELHCDRLHVDPENFKLLGNVLVIVLAKYFGK |
|||||
Query |
117 |
EFTPAVHASLDKFLASVSTVLTSKY |
141 |
|
|
Sbjct |
122 |
EFTP V ++ |
K +A V+T L KY |
146 |
|
EFTPQVQSAWQKLVAGVATALAHKY |
|
Выравнивание 11 с субъединицей эпсилон-4 гемоглобина Bos taurus (Дикий бык).
hemoglobin subunit epsilon-4 [Bos taurus] Length=147
GENE ID: 513108 HBE4 | hemoglobin, beta, epsilon 4 [Bos taurus] (10 or fewer PubMed links)
Score = 104 bits (259), Expect = 3e-21, Method: Compositional matrix adjust. Identities = 53/145 (37%), Positives = 80/145 (56%), Gaps = 8/145 (5%)
Query |
3 |
LSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHFDLSH------GSAQV |
56 |
||||
Sbjct |
4 |
+ +K V + W KV |
G E+L R+ + +P T+ +F F + |
G+ +V |
61 |
||
FTTEEKAAVASLWAKVNVEV--VGGESLARLLIVYPWTQRFFDSFGNLYSESAIMGNPKV |
|||||||
Query |
57 |
KGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPA |
116 |
||||
Sbjct |
62 |
K HG+KV ++ |
NA+ H+DD+ |
|
+ LS+LH KL VDP NF+LL + +L+ LA H |
121 |
|
KAHGRKVLNSFGNAIEHMDDLKGTFADLSELHCDKLHVDPENFRLLGNMILIVLATHFSK |
|||||||
Query |
117 |
EFTPAVHASLDKFLASVSTVLTSKY |
141 |
|
|
||
Sbjct |
122 |
EFTP + AS K |
+V+ L |
KY |
146 |
|
|
EFTPQMQASWQKLTNAVANALAHKY |
|
|
Выравнивание 12 с белком LOC445037 Danio rerio (Данио рерио, аквариумная рыбка).
hypothetical protein LOC445037 [Danio rerio] Length=147
GENE ID: 445037 zgc:92880 | zgc:92880 [Danio rerio] (10 or fewer PubMed links)
Score = 103 bits (258), Expect = 3e-21, Method: Compositional matrix adjust. Identities = 57/144 (40%), Positives = 81/144 (57%), Gaps = 8/144 (5%)
Query |
4 |
SPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF-DLSHGSA-----QVK 57 |
|||||
Sbjct |
5 |
S +++ + + W K+ |
+ E G + L R+ + +P T+ YF F DLS SA |
+V |
62 |
||
SDSERKTIASVWSKI--NVDEIGPQTLARVLVVYPWTQRYFGAFGDLSCASAIMGNPKVS |
|||||||
Query |
58 |
GHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPAE |
117 |
||||
Sbjct |
63 |
HGK V |
AL AV +VDD+ |
+ LS LH KL VDP NFKLL+ CL + +A + |
|
122 |
|
EHGKTVLKALEKAVKNVDDIKTTYAQLSQLHCEKLNVDPDNFKLLADCLSIVIATNFGPA |
|||||||
Query |
118 |
FTPAVHASLDKFLASVSTVLTSKY |
141 |
|
|
||
Sbjct |
123 |
F PAV ++ |
K L+ V |
LTS+Y |
146 |
|
|
FNPAVQSTWQKLLSVVVAALTSRY |
|
|
45
Выравнивание 13 с субъединицей бэта-С гемоглобина Ovis aries (Домашняя овца).
hemoglobin subunit beta-C [Ovis aries] Length=142
GENE ID: 100134870 LOC100134870 | beta-C globin [Ovis aries] (10 or fewer PubMed links)
Score = 103 bits (258), Expect = 4e-21, Method: Compositional matrix adjust. Identities = 62/141 (44%), Positives = 85/141 (61%), Gaps = 8/141 (5%)
Query |
7 |
DKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF-DLSH-----GSAQVKGHG |
60 |
|||
Sbjct |
3 |
+K + W KV |
E GAEAL R+ + +P T+ +F HF DLS |
G+A+VK HG |
60 |
|
NKALITGFWSKV--KVDEVGAEALGRLLVVYPWTQRFFEHFGDLSTADAVLGNAKVKAHG |
||||||
Query |
61 |
KKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPAEFTP |
120 |
|||
Sbjct |
61 |
KKV D+ +N V H+DD+ |
|
+ LS+LH KL VDP NF+LL + L+V LA H EFTP |
120 |
|
KKVLDSFSNGVQHLDDLKGTFAQLSELHCDKLHVDPENFRLLGNVLVVVLARHFGKEFTP |
||||||
Query |
121 |
AVHASLDKFLASVSTVLTSKY |
141 |
|
|
|
Sbjct |
121 |
+ A K +A V++ L |
+Y |
141 |
|
|
ELQAEFQKVVAGVASALAHRY |
|
|
Выравнивание 14 с субъединицей гамма-1 гемоглобина Papio anubis (Павиан догеровский, Анубис).
hemoglobin subunit gamma-1 [Papio anubis] Length=147
GENE ID: 100137314 HBG1 | hemoglobin, gamma A [Papio anubis]
Score = 103 bits (257), Expect = 5e-21, Method: Compositional matrix adjust. Identities = 55/145 (38%), Positives = 85/145 (59%), Gaps = 8/145 (5%)
Query |
3 |
LSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF-DLSH-----GSAQV |
56 |
|||||
Sbjct |
4 |
+ +K |
V + W K+ |
E G EAL R+ + +P T+ +F F +LS |
G+ +V |
61 |
||
FTAEEKAAVTSLWSKINVE--ETGGEALGRLLVVYPWTQRFFDSFGNLSSPSAILGNPKV |
||||||||
Query |
57 |
KGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPA |
116 |
|||||
Sbjct |
62 |
K HGKKV |
+ |
+A+ ++D++ |
A + LS+LH KL VDP NFKLL + +++ LA H |
121 |
||
KAHGKKVLTSFGDAIKNMDNLKTAFAKLSELHCDKLHVDPENFKLLGNVMVIILATHFGR |
||||||||
Query |
117 |
EFTPAVHASLDKFLASVSTVLTSKY |
141 |
|
|
|||
Sbjct |
122 |
EFTP V A+ |
K +++V+ L |
KY |
146 |
|
|
|
EFTPEVQAAWQKLVSAVAIALAHKY |
|
|
Выравнивание 15 с гемоглобином дэльта Xenopus (Silurana) tropicalis
(Водная лягушка силураны).
hemoglobin, delta [Xenopus (Silurana) tropicalis] Length=147
46
GENE ID: 448665 hbd | hemoglobin, delta [Xenopus (Silurana) tropicalis] (10 or fewer PubMed links)
Score = 103 bits (257), Expect = 5e-21, Method: Compositional matrix adjust. Identities = 61/145 (43%), Positives = 85/145 (59%), Gaps = 8/145 (5%)
Query |
3 |
LSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF-DLSH-----GSAQV |
56 |
|||
Sbjct |
4 |
LS +K+ + A W KV |
G EAL R+ + +P T+ YF F +LS+ |
G+A+V |
61 |
|
LSADEKSAINAVWQKVDIEKD--GHEALTRLLVVYPWTQRYFSSFGNLSNVTAISGNAKV |
||||||
Query |
57 |
KGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPA |
116 |
|||
Sbjct |
62 |
GHGKKV A+ A+ H+DD+ + LSALS HA +L VDP NFK L+ L + +A |
L |
121 |
||
HGHGKKVLSAVDEAIHHIDDIKHFLSALSKKHAQELHVDPENFKRLADVLAIVVAGKLGT |
||||||
Query |
117 |
EFTPAVHASLDKFLASVSTVLTSKY |
141 |
|
|
|
Sbjct |
122 |
FTP V A+ +KF A + |
L+ Y |
146 |
|
|
AFTPQVQAAWEKFSAGLVAALSHGY |
|
|
Выравнивание 16 с субъединицей бэта гемоглобина Bos taurus (Дикий бык).
hemoglobin subunit beta [Bos taurus] Length=145
GENE ID: 280813 HBB | hemoglobin, beta [Bos taurus] (10 or fewer PubMed links)
Score = 101 bits (252), Expect = 2e-20, Method: Compositional matrix adjust. Identities = 63/146 (44%), Positives = 86/146 (59%), Gaps = 8/146 (5%)
Query |
2 |
VLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF-DLSHGSA-----Q |
55 |
||||
Sbjct |
1 |
+L+ +K |
V A WGKV |
E G EAL R+ + +P T+ +F F DLS A |
+ |
58 |
|
MLTAEEKAAVTAFWGKV--KVDEVGGEALGRLLVVYPWTQRFFESFGDLSTADAVMNNPK |
|||||||
Query |
56 |
VKGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLP |
115 |
||||
Sbjct |
59 |
VK HGKKV D+ +N + H+DD+ |
|
+ALS+LH KL VDP NFKLL + L+V LA + |
|
118 |
|
VKAHGKKVLDSFSNGMKHLDDLKGTFAALSELHCDKLHVDPENFKLLGNVLVVVLARNFG |
|||||||
Query |
116 |
AEFTPAVHASLDKFLASVSTVLTSKY |
141 |
|
|
||
Sbjct |
119 |
EFTP + A |
K +A V+ L |
+Y |
144 |
|
|
KEFTPVLQADFQKVVAGVANALAHRY |
|
|
Выравнивание 17 с субъединицей эпсилон гемоглобина Pan troglodytes (Обыкновенный шимпанзе).
hemoglobin subunit epsilon [Pan troglodytes] Length=147
GENE ID: 100190972 HBE1 | hemoglobin, epsilon 1 [Pan troglodytes] (10 or fewer PubMed links)
Score = 101 bits (252), Expect = 2e-20, Method: Compositional matrix adjust. Identities = 55/141 (40%), Positives = 84/141 (60%), Gaps = 8/141 (5%)
47
Query |
7 |
DKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF-DLSH-----GSAQVKGHG |
60 |
||||
Sbjct |
8 |
+K V + W K+ |
E G EAL R+ + +P T+ +F F +LS |
G+ +VK HG |
65 |
||
EKAAVTSLWSKMNVE--EAGGEALGRLLVVYPWTQRFFDSFGNLSSPSAILGNPKVKAHG |
|||||||
Query |
61 |
KKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPAEFTP |
120 |
||||
Sbjct |
66 |
KKV |
+ +A+ ++D++ |
A + LS+LH KL VDP NFKLL + +++ LA H EFTP |
125 |
||
KKVLTSFGDAIKNMDNLKPAFAKLSELHCDKLHVDPENFKLLGNVMVIILATHFGKEFTP |
|||||||
Query |
121 |
AVHASLDKFLASVSTVLTSKY |
141 |
|
|
||
Sbjct |
126 |
V A+ |
K +++V+ L |
KY |
146 |
|
|
EVQAAWQKLVSAVAIALAHKY |
|
|
Выравнивание 18 с субъединицей эпсилон гемоглобина Gallus gallus (Банкивский петух).
hemoglobin subunit epsilon [Gallus gallus] Length=147
GENE ID: 428115 HBE | hemoglobin, beta [Gallus gallus] (10 or fewer PubMed links)
Score = 101 bits (251), Expect = 2e-20, Method: Compositional matrix adjust. Identities = 55/144 (39%), Positives = 82/144 (57%), Gaps = 8/144 (5%)
Query |
4 |
SPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF-DLSH-----GSAQVK |
57 |
||||
Sbjct |
5 |
S +K |
+ + W KV |
E GAEAL R+ + +P T+ +F F +LS |
G+ +V+ |
62 |
|
SAEEKQLITSVWSKVNVE--ECGAEALARLLIVYPWTQRFFASFGNLSSPTAIMGNPRVR |
|||||||
Query |
58 |
GHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPAE |
117 |
||||
Sbjct |
63 |
HGKKV |
+ AV ++D++ N |
+ LS+LH KL VDP NF+LL |
L++ LA+H + |
122 |
|
AHGKKVLSSFGEAVKNLDNIKNTYAKLSELHCDKLHVDPENFRLLGDILIIVLASHFARD |
|||||||
Query |
118 |
FTPAVHASLDKFLASVSTVLTSKY |
141 |
|
|
||
Sbjct |
123 |
FTPA |
+ K + V+ L KY |
146 |
|
|
|
FTPACQFAWQKLVNVVAHALARKY |
|
|
Выравнивание 19 с белком LOC100216007 Xenopus (Silurana) tropicalis
(Водная лягушка силураны).
hypothetical protein LOC100216007 [Xenopus (Silurana) tropicalis] Length=147
GENE ID: 100216007 LOC100216007 | hypothetical protein LOC100216007 [Xenopus (Silurana) tropicalis] (10 or fewer PubMed links)
Score = 101 bits (251), Expect = 3e-20, Method: Compositional matrix adjust. Identities = 55/142 (39%), Positives = 83/142 (59%), Gaps = 8/142 (5%)
Query |
7 |
DKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF-DLSH-----GSAQVKGHG 60 |
|||
Sbjct |
8 |
+K + + W KV |
+ G EAL R+ + +P T+ YF F +LS+ |
G+A+V+ HG |
65 |
EKAAITSVWQKVNLE--QDGHEALTRLLVVYPWTQRYFSSFGNLSNVTAVSGNAKVRAHG |
48
Query |
61 |
KKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPAEFTP |
120 |
|||
Sbjct |
66 |
KV |
A+ NA+AH+D++ |
L |
LS+ HA KL VDP NFK L L++ LA+ L FTP |
125 |
NKVLSAVGNAIAHLDNVKGTLHDLSETHAFKLHVDPENFKRLGEVLVIVLASKLGTAFTP |
||||||
Query |
121 |
AVHASLDKFLASVSTVLTSKYR |
142 |
|
||
Sbjct |
126 |
+ |
+ +KF+A + L+ |
Y |
147 |
|
QIQGAWEKFVAVLVDALSQGYN |
|
Выравнивание 20 с субъединицей эпсилон гемоглобина Macaca mulatta (Макак-резус).
hemoglobin subunit epsilon [Macaca mulatta] Length=147
GENE ID: 715414 HBE1 | hemoglobin, epsilon 1 [Macaca mulatta] (10 or fewer PubMed links)
Score = 100 bits (248), Expect = 5e-20, Method: Compositional matrix adjust. Identities = 54/145 (38%), Positives = 84/145 (58%), Gaps = 8/145 (5%)
Query |
3 |
LSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF-DLSH-----GSAQV |
56 |
|||||
Sbjct |
4 |
+ +K |
V + W K+ |
E G EAL R+ + +P T+ +F F +LS |
G+ +V |
61 |
||
FTAEEKAAVTSLWSKMNVE--ETGGEALGRLLVVYPWTQRFFDSFGNLSSPSAILGNPKV |
||||||||
Query |
57 |
KGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPA |
116 |
|||||
Sbjct |
62 |
K HGKKV |
+ |
+A+ ++D++ |
|
+ LS+LH KL VDP NFKLL + +++ LA H |
121 |
|
KAHGKKVLTSFGDAIKNMDNLKITFAKLSELHCDKLHVDPENFKLLGNVMVIILATHFGK |
||||||||
Query |
117 |
EFTPAVHASLDKFLASVSTVLTSKY |
141 |
|
|
|||
Sbjct |
122 |
EFTP V A+ |
K +++V+ L |
KY |
146 |
|
|
|
EFTPEVQAAWQKLVSAVAIALAHKY |
|
|
Выравнивание 21 с субъединицей бэта-2 гемоглобина Xenopus laevis (Обыкновенная шпорцевая лягушка).
hemoglobin subunit beta-2 [Xenopus laevis] Length=147
GENE ID: 397871 hbg2-b | hemoglobin, gamma G [Xenopus laevis] (10 or fewer PubMed links)
Score = 99.8 bits (247), Expect = 7e-20, Method: Compositional matrix adjust. Identities = 54/142 (39%), Positives = 84/142 (60%), Gaps = 8/142 (5%)
Query |
7 |
DKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF-DLSH-----GSAQVKGHG |
60 |
||||
Sbjct |
8 |
+K |
+ + W KV |
G +AL R+ + +P T+ YF +F +LS+ |
G+A+V+ HG |
65 |
|
EKAAITSVWQKVNVE--HDGHDALGRLLIVYPWTQRYFSNFGNLSNSAAVAGNAKVQAHG |
|||||||
Query |
61 |
KKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPAEFTP |
120 |
||||
Sbjct |
66 |
KKV |
A+ NA++H+D + ++L LS +HA +L VDP NFK |
L++ L A L FTP |
125 |
||
KKVLSAVGNAISHIDSVKSSLQQLSKIHATELFVDPENFKRFGGVLVIVLGAKLGTAFTP |
49
Query |
121 |
AVHASLDKFLASVSTVLTSKYR |
142 |
Sbjct |
126 |
V A+ +KF+A + L+ Y |
147 |
KVQAAWEKFIAVLVDGLSQGYN |
Выравнивание 22 с глобином эпсилон Gallus gallus (Банкивский петух).
epsilon globin [Gallus gallus] Length=147
GENE ID: 428114 HBE1 | hemoglobin, epsilon 1 [Gallus gallus] (10 or fewer PubMed links)
Score = 99.0 bits (245), Expect = 1e-19, Method: Compositional matrix adjust. Identities = 54/141 (39%), Positives = 81/141 (58%), Gaps = 8/141 (5%)
Query |
7 |
DKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF-DLSH-----GSAQVKGHG |
60 |
||||||
Sbjct |
8 |
+K |
+ |
WGKV + |
E GAEAL R+ + +P T+ +F F +LS |
G+ |
V+ HG |
65 |
|
EKQLITGLWGKV--NVAECGAEALARLLIVYPWTQRFFASFGNLSSATAIIGNPMVRAHG |
|||||||||
Query |
61 |
KKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPAEFTP |
120 |
||||||
Sbjct |
66 |
KKV |
+ |
AV ++D++ |
+ + LS LH KL VDP NF+LL |
L++ LA+H |
+FTP |
125 |
|
KKVLSSFGEAVKNLDNIKKSFAQLSKLHCDKLHVDPENFRLLGDILIIVLASHFSKDFTP |
|||||||||
Query |
121 |
AVHASLDKFLASVSTVLTSKY |
141 |
|
|
|
|||
Sbjct |
126 |
A A+ K + V+ L +Y |
146 |
|
|
|
|||
ASQAAWQKMVRVVAHALAHEY |
|
|
|
Выравнивание 23 с субъединицей эпсилон-2 гемоглобина Bos taurus (Дикий бык).
hemoglobin subunit epsilon-2 [Bos taurus] Length=147
GENE ID: 513240 HBE2 | globin, beta, epsilon 2 [Bos taurus] (10 or fewer PubMed links)
Score = 98.6 bits (244), Expect = 1e-19, Method: Compositional matrix adjust. Identities = 51/145 (36%), Positives = 79/145 (55%), Gaps = 8/145 (5%)
Query |
3 |
LSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHFDLSH------GSAQV |
56 |
|||||
Sbjct |
4 |
+ + V + W KV |
G E+L R+ + |
P T+ +F F + |
G+ +V |
61 |
||
FTTEENVAVASLWAKVNVEV--VGGESLARLLIVCPWTQRFFDSFGNLYSESAIMGNPKV |
||||||||
Query |
57 |
KGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPA |
116 |
|||||
Sbjct |
62 |
K +G+KV ++ |
NA+ H+DD+ |
+ LS+LH |
KL VDP NF+LL + +L+ LA H |
121 |
||
KVYGRKVLNSFGNAIKHMDDLKGTFADLSELHCDKLHVDPENFRLLGNMILIVLATHFSK |
||||||||
Query |
117 |
EFTPAVHASLDKFLASVSTVLTSKY |
141 |
|
|
|
||
Sbjct |
122 |
EFTP + A+ K |
+V+ |
LT KY |
146 |
|
|
|
EFTPQMQAAWQKLTNAVANALTHKY |
|
|
|
50
Выравнивание 24 с гемоглобином альфа Rattus norvegicus (Серая крыса).
hemoglobin, gamma A [Rattus norvegicus] Length=147
GENE ID: 94164 Hbg1 | hemoglobin, gamma A [Rattus norvegicus] (10 or fewer PubMed links)
Score = 97.8 bits (242), Expect = 3e-19, Method: Compositional matrix adjust. Identities = 52/145 (36%), Positives = 83/145 (58%), Gaps = 8/145 (5%)
Query |
3 |
LSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF-DLSH-----GSAQV |
56 |
||||
Sbjct |
4 |
+ +K |
+ + W KV |
+ G E L R+ + +P T+ +F F +LS |
G+ ++ |
61 |
|
FTAEEKAAIISIWEKVDLE--KIGGETLGRLLIVYPWTQRFFDKFGNLSSALAIMGNPRI |
|||||||
Query |
57 |
KGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPA |
116 |
||||
Sbjct |
62 |
+ HGKKV |
+L +AV ++D++ |
+ LS+LH KL VDP NFKLL + L++ L+ H |
121 |
||
RAHGKKVLTSLGSAVENMDNLKETFAHLSELHCDKLHVDPQNFKLLGNMLVIVLSTHFAK |
|||||||
Query |
117 |
EFTPAVHASLDKFLASVSTVLTSKY |
141 |
|
|
||
Sbjct |
122 |
EFTP V A+ K + |
V+ L+ KY |
146 |
|
|
|
EFTPEVQAAWQKLVMGVANALSHKY |
|
|
Выравнивание 25 с бэта-глобином Salmo salar (Сёмга).
beta-globin [Salmo salar] Length=147
GENE ID: 100136584 LOC100136584 | beta-globin [Salmo salar] (10 or fewer PubMed links)
Score = 96.3 bits (238), Expect = 7e-19, Method: Compositional matrix adjust. Identities = 54/142 (39%), Positives = 82/142 (58%), Gaps = 8/142 (5%)
Query |
6 |
ADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF-DLS-----HGSAQVKGH |
59 |
||||
Sbjct |
7 |
A+K+ + A WGKV |
+ E G |
AL R+ + +P T+ YF F D+S |
G+ +V H |
64 |
|
AEKSTISAVWGKVDIN--EVGPLALARVLIVYPWTQRYFGSFGDVSTPAAIMGNPKVAAH |
|||||||
Query |
60 |
GKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPAEFT |
119 |
||||
Sbjct |
65 |
GK V |
AL AV ++ ++ |
+LS+ HA+KL VDP NF++L+ |
L + +AA A FT |
124 |
|
GKVVCGALDKAVKNMGNILATYKSLSETHANKLFVDPDNFRVLADVLTIVIAAKFGASFT |
|||||||
Query |
120 |
PAVHASLDKFLASVSTVLTSKY |
141 |
|
|
||
Sbjct |
125 |
P + A+ |
KF+ V |
+ S+Y |
146 |
|
|
PEIQATWQKFMKVVVAAMGSRY |
|
|
Выравнивание 26 с субъединицей бэта-Н1 гемоглобина Mus musculus (Мышь домовая).
hemoglobin subunit beta-H1 [Mus musculus] Length=147
51
GENE ID: 15132 Hbb-bh1 | hemoglobin Z, beta-like embryonic chain [Mus musculus]
(Over 10 PubMed links)
Score = 95.5 bits (236), Expect = 1e-18, Method: Compositional matrix adjust. Identities = 51/145 (36%), Positives = 81/145 (56%), Gaps = 8/145 (5%)
Query |
3 |
LSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF-DLSH-----GSAQV |
56 |
|||||
Sbjct |
4 |
+ +K |
+ + W KV |
+ G E L R+ + +P T+ +F F +LS |
G+ ++ |
61 |
||
FTAEEKAAITSIWDKVDLE--KVGGETLGRLLIVYPWTQRFFDKFGNLSSALAIMGNPRI |
||||||||
Query |
57 |
KGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPA |
116 |
|||||
Sbjct |
62 |
+ HGKKV |
+L |
V ++D++ |
+ LS+LH KL VDP NFKLL + L++ L+ H |
121 |
||
RAHGKKVLTSLGLGVKNMDNLKETFAHLSELHCDKLHVDPENFKLLGNMLVIVLSTHFAK |
||||||||
Query |
117 |
EFTPAVHASLDKFLASVSTVLTSKY |
141 |
|
|
|||
Sbjct |
122 |
EFTP V A+ |
K + |
V+ L+ KY |
146 |
|
|
|
EFTPEVQAAWQKLVIGVANALSHKY |
|
|
Выравнивание 27 с субъединицей бэта гемоглобина Equus caballus (Лошадь домашняя).
hemoglobin subunit beta [Equus caballus] Length=147
GENE ID: 100054109 HBB | hemoglobin, beta [Equus caballus] (10 or fewer PubMed links)
Score = 95.1 bits (235), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 62/145 (43%), Positives = 84/145 (58%), Gaps = 8/145 (5%)
Query |
3 |
LSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF-DLSH-----GSAQV |
56 |
|||||
Sbjct |
4 |
LS +K |
V A W KV |
E G EAL R+ + +P T+ +F F DLS+ |
G+ +V |
61 |
||
LSGEEKAAVLALWDKVNEE--EVGGEALGRLLVVYPWTQRFFDSFGDLSNPGAVMGNPKV |
||||||||
Query |
57 |
KGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPA |
116 |
|||||
Sbjct |
62 |
K HGKKV |
+ |
V H+D++ |
|
+ALS+LH KL VDP NF+LL + L+V LA H |
121 |
|
KAHGKKVLHSFGEGVHHLDNLKGTFAALSELHCDKLHVDPENFRLLGNVLVVVLARHFGK |
||||||||
Query |
117 |
EFTPAVHASLDKFLASVSTVLTSKY |
141 |
|
|
|||
Sbjct |
122 |
+FTP + AS |
K +A V+ L |
KY |
146 |
|
|
|
DFTPELQASYQKVVAGVANALAHKY |
|
|
Выравнивание 28 с бэта-типом глобина эмбриона Oncorhynchus mykiss (Радужная форель).
embryonic beta-type globin [Oncorhynchus mykiss] Length=147
GENE ID: 100135791 LOC100135791 | embryonic beta-type globin [Oncorhynchus mykiss] (10 or fewer PubMed links)
Score = 94.0 bits (232), Expect = 4e-18, Method: Compositional matrix adjust.
52
Identities = 49/124 (40%), Positives = 72/124 (59%), Gaps = 6/124 (4%)
Query |
24 |
EYGAEALERMFLSFPTTKTYFPHFDLSHGSAQVKG------HGKKVADALTNAVAHVDDM |
77 |
|||
Sbjct |
23 |
+ G |
AL R + +P T+ YF +F + +A ++G |
HGK V |
L AV ++DD+ |
82 |
DVGPAALSRCLVVYPWTQRYFGNFGNLYNAAAIQGNPMVAAHGKTVLRGLDRAVKNMDDI |
||||||
Query |
78 |
PNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPAEFTPAVHASLDKFLASVSTVL |
137 |
|||
Sbjct |
83 |
|
+ LS LH+ KLRVDP NF+LL+ CL + +AA + A+FT V + |
KFLA V + L |
142 |
|
KATYAELSVLHSEKLRVDPDNFRLLADCLTIVVAARMGADFTADVQGAFQKFLAVVVSSL |
||||||
Query |
138 |
TSKY |
141 |
|
|
|
Sbjct |
143 |
+Y |
146 |
|
|
|
GRQY |
|
|
|
Выравнивание 29 с субъединицей бэта-4 гемоглобина Oncorhynchus mykiss (Радужная форель).
hemoglobin subunit beta-4 [Oncorhynchus mykiss] Length=148
GENE ID: 100136291 LOC100136291 | beta-globin [Oncorhynchus mykiss] (10 or fewer PubMed links)
Score = 90.1 bits (222), Expect = 6e-17, Method: Compositional matrix adjust. Identities = 54/142 (39%), Positives = 80/142 (57%), Gaps = 9/142 (6%)
Query |
7 |
DKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF-DLS-----HGSAQVKGHG |
60 |
|||||
Sbjct |
8 |
+++ + |
WGK+ |
E G +AL R+ + P T+ +F F +LS |
G+ V HG |
65 |
||
ERSAIVGLWGKISVD--EIGPQALARLLIVSPWTQRHFSTFGNLSTPAAIMGNPAVAKHG |
||||||||
Query |
61 |
KKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHL-PAEFT |
119 |
|||||
Sbjct |
66 |
K V |
L AV ++DD+ N |
+ALS +H+ KL VDP NF+LL+ C+ V +AA L PA F+ |
125 |
|||
KTVMHGLDRAVQNLDDIKNTYTALSVMHSEKLHVDPDNFRLLADCITVCVAAKLGPAVFS |
||||||||
Query |
120 |
PAVHASLDKFLASVSTVLTSKY |
141 |
|
|
|||
Sbjct |
126 |
+ |
KFLA V + L |
+Y |
147 |
|
|
|
ADTQEAFQKFLAVVVSALGRQY |
|
|
Выравнивание 30 с субъединицей бэта-1 гемоглобина Xenopus (Silurana) tropicalis (Водная лягушка силураны).
hemoglobin subunit beta-1 [Xenopus (Silurana) tropicalis] Length=147
GENE ID: 394453 hbg1 | hemoglobin, gamma A [Xenopus (Silurana) tropicalis] (10 or fewer PubMed links)
Score = 86.7 bits (213), Expect = 6e-16, Method: Compositional matrix adjust. Identities = 47/146 (33%), Positives = 77/146 (53%), Gaps = 10/146 (6%)
Query |
3 |
LSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF-DLS------HGSAQ 55 |
||
Sbjct |
4 |
L+ ++ + W K+ A |
G +AL M ++P T+ YF F +LS |
H +A |
LTAKERQLITGTWSKICAKT--LGKQALGSMLYTYPWTQRYFSSFGNLSSIEAIFHNAA- 60 |
||||
|
|
|
53 |
|
Query |
56 |
VKGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLP |
115 |
|||
Sbjct |
61 |
V HG+KV ++ |
A+ H+DD+ |
+ LS H+ L VDP NFK |
C ++++A L |
120 |
VATHGEKVLTSIGEAIKHMDDIKGYYAQLSKYHSETLHVDPYNFKRFCSCTIISMAQTLQ |
||||||
Query |
116 |
AEFTPAVHASLDKFLASVSTVLTSKY |
141 |
|
|
|
Sbjct |
121 |
+FTP + A+ +K |
A+++ L Y |
146 |
|
|
EDFTPELQAAFEKLFAAIADALGKGY |
|
|
Выравнивание 31 с гемоглобином бэта эмбриональным-2 Danio rerio (Данио рерио, аквариумная рыбка).
hemoglobin beta embryonic-2 [Danio rerio] Length=147
GENE ID: 405772 hbbe2 | hemoglobin beta embryonic-2 [Danio rerio] (10 or fewer PubMed links)
Score = 86.3 bits (212), Expect = 8e-16, Method: Compositional matrix adjust. Identities = 48/124 (39%), Positives = 72/124 (59%), Gaps = 6/124 (4%)
Query |
24 |
EYGAEALERMFLSFPTTKTYFPHF-DLSHGSA-----QVKGHGKKVADALTNAVAHVDDM |
77 |
||||
Sbjct |
23 |
E G +AL+R + +P T+ YF F +L + A |
+V HG |
V |
L A+ ++DD+ |
82 |
|
EAGPKALQRALIVYPWTQRYFGSFGNLYNAEAIINNPKVAAHGTVVLHGLDRAMKNMDDI |
|||||||
Query |
78 |
PNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPAEFTPAVHASLDKFLASVSTVL |
137 |
||||
Sbjct |
83 |
N |
+ LS LH+ KL VDP NF+LL+ CL + +A+ + A FT |
+ A+ |
KFLA V + L |
142 |
|
KNTYAELSVLHSEKLHVDPDNFRLLADCLTIVIASTMGAAFTADMQAAWQKFLAVVVSAL |
|||||||
Query |
138 |
TSKY |
141 |
|
|
|
|
Sbjct |
143 |
+Y |
146 |
|
|
|
|
QRQY |
|
|
|
|
Выравнивание 32 с белком LOC417972 Gallus gallus (Банкивский петух).
hypothetical protein LOC417972 [Gallus gallus] Length=151
GENE ID: 417972 GBE | eye-globin [Gallus gallus] (10 or fewer PubMed links)
Score = 83.2 bits (204), Expect = 6e-15, Method: Compositional matrix adjust. Identities = 50/151 (34%), Positives = 77/151 (51%), Gaps = 13/151 (8%)
Query |
1 |
MVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF------DLSHGSA |
54 |
|||||||
Sbjct |
1 |
M |
S A+ |
+ + AW K+ |
A + G |
L RMF P TK+YF HF |
+ |
S |
60 |
|
MSFSEAEVQSARGAWEKMYVDAEDNGTAVLVRMFTEHPDTKSYFTHFKGMDSAEEMKQSD |
||||||||||
Query |
55 |
QVKGHGKKVADALTNAVAHVDDMPNALSALSDL---HAHKLRVDPVNFKLLSHCLLVTLA |
111 |
|||||||
Sbjct |
61 |
QV+GHGK+V |
A+ + V H+D+ |
L L+ L |
HA +L++DP NF+++ |
|
+L |
118 |
||
QVRGHGKRVFTAINDMVQHLDNTEAFLGILNPLGQKHATQLKIDPKNFRIICDIILQL-- |
||||||||||
Query |
112 |
AHLPAEFTPAVHASLDKFLASVSTVLTSKYR |
142 |
|
|
|
||||
Sbjct |
119 |
+ |
+F |
AS +K |
+ T LT+ Y+ |
147 |
|
|
|
|
--MEEKFGGDCKASFEKVTNEICTHLTNIYK |
|
|
|
54
Выравнивание 33 с цитоглобином-2 Danio rerio (Данио рерио, аквариумная рыбка).
cytoglobin-2 [Danio rerio] Length=179
GENE ID: 554176 cygb2 | cytoglobin 2 [Danio rerio] (10 or fewer PubMed links)
Score = 80.9 bits (198), Expect = 4e-14, Method: Compositional matrix adjust. Identities = 48/149 (33%), Positives = 78/149 (53%), Gaps = 11/149 (7%)
Query |
3 |
LSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF-DLS-----HGSAQV |
56 |
||||
Sbjct |
19 |
L+ ++ +K |
W +V A + G |
L R F++FP+ K YF F D+ |
S+Q+ |
78 |
|
LTDVERGIIKDTWARVYASCEDVGVTILIRFFVNFPSAKQYFSQFQDMEDPEEMEKSSQL |
|||||||
Query |
57 |
KGHGKKVADALTNAVAHVDDMPNALSALSDL----HAHKLRVDPVNFKLLSHCLLVTLAA |
112 |
||||
|
|
+ H ++V +A+ |
V ++ D P |
+S++ L |
HA K +V+PV FK+LS +L |
LA |
|
Sbjct 79 RKHARRVMNAINTVVENLHD-PEKVSSVLVLVGKAHAFKYKVEPVYFKILSGVILEILAE 137
Query 113 HLPAEFTPAVHASLDKFLASVSTVLTSKY 141
FTP V S K +A++ +T Y
Sbjct 138 EFGECFTPEVQTSWSKLMAALYWHITGAY 166
Выравнивание 34 с цитоглобином Xenopus laevis (Обыкновенная шпорцевая лягушка).
cytoglobin [Xenopus laevis] Length=179
GENE ID: 447575 cygb | cytoglobin [Xenopus laevis] (10 or fewer PubMed links)
Score = 77.0 bits (188), Expect = 5e-13, Method: Compositional matrix adjust. Identities = 42/149 (29%), Positives = 77/149 (52%), Gaps = 9/149 (6%)
Query |
3 |
LSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHFD------LSHGSAQV |
56 |
|||||
Sbjct |
19 |
++ +++ +K W +V A+ |
+ G |
L R F++FP+ K +F F |
GS Q+ |
78 |
||
ITESERGVIKETWARVYANCEDVGVSILIRFFVNFPSAKQHFSQFKHMEDPLEMEGSVQL |
||||||||
Query |
57 |
KGHGKKVADALTNAVAHVDD---MPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAH |
113 |
|||||
Sbjct |
79 |
+ HG++V |
A+ + V ++ D |
+ |
LS + |
HA K +V+PV FK+L+ |
+L A |
138 |
RKHGRRVMGAVNSVVENLGDPEKVTTVLSIVGKSHALKHKVEPVYFKILTGVMLEVFAEE |
||||||||
Query |
114 |
LPAEFTPAVHASLDKFLASVSTVLTSKYR |
142 |
|
|
|||
Sbjct |
139 |
+FTP V |
+K + + + + S Y+ |
167 |
|
|
||
YAKDFTPDVQLVWNKLRSLIYSHVQSAYK |
|
|
Выравнивание 35 с цитоглобином Gallus gallus (Банкивский петух).
cytoglobin [Gallus gallus] Length=179
GENE ID: 427802 CYGB | cytoglobin [Gallus gallus] (10 or fewer PubMed links)
55
Score = 76.6 bits (187), Expect = 6e-13, Method: Compositional matrix adjust. Identities = 44/149 (30%), Positives = 76/149 (52%), Gaps = 9/149 (6%)
Query |
3 |
LSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHFD------LSHGSAQV |
56 |
|||||
Sbjct |
19 |
+S A+K |
++ |
W +V A+ |
+ G L R F++FP+ K YF F |
S Q+ |
78 |
|
ISDAEKKVIQETWSRVYANCEDVGVSILIRFFVNFPSAKQYFSQFKHMDDTLEMERSLQL |
||||||||
Query |
57 |
KGHGKKVADALTNAVAHVDD---MPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAH |
113 |
|||||
Sbjct |
79 |
+ H ++V |
A+ |
V ++DD + + L+ + |
HA K +V+PV FK L+ +L |
+A |
138 |
|
RKHAQRVMGAINTVVENLDDPEKVSSVLALVGKAHALKHKVEPVYFKKLTGVMLEVIAEA |
||||||||
Query |
114 |
LPAEFTPAVHASLDKFLASVSTVLTSKYR |
142 |
|
|
|||
Sbjct |
139 |
+FTP |
H + |
K |
+ T +T+ Y+ |
167 |
|
|
YGNDFTPEAHGAWTKMRTLIYTHVTAAYK |
|
|
Выравнивание 36 с цитоглобином Xenopus (Silurana) tropicalis (Водная лягушка силураны).
cytoglobin [Xenopus (Silurana) tropicalis] Length=179
GENE ID: 448640 cygb | cytoglobin [Xenopus (Silurana) tropicalis] (10 or fewer PubMed links)
Score = 76.3 bits (186), Expect = 7e-13, Method: Compositional matrix adjust. Identities = 42/149 (29%), Positives = 78/149 (53%), Gaps = 9/149 (6%)
Query |
3 |
LSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHFD------LSHGSAQV |
56 |
||||||
Sbjct |
19 |
++ +++ |
+K |
W +V A+ |
+ G |
L R F++FP+ K +F F |
GS Q+ |
78 |
|
ITESERGVIKETWARVYANCEDVGVSILIRFFVNFPSAKQHFSQFKHMEDPLEMEGSVQL |
|||||||||
Query |
57 |
KGHGKKVADALTNAVAHVDD---MPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAH |
113 |
||||||
Sbjct |
79 |
+ H ++V |
A+ + V ++ D |
+ |
LS + |
HA K +VDPV FK+L+ |
+L +A |
138 |
|
RKHARRVMGAVNSVVENLGDPEKITTVLSIVGKSHALKHKVDPVYFKILTGVMLEVIAEE |
|||||||||
Query |
114 |
LPAEFTPAVHASLDKFLASVSTVLTSKYR |
142 |
|
|
||||
Sbjct |
139 |
+FTP V |
+ +K + + + + S Y+ |
167 |
|
|
|||
YAKDFTPDVQLAWNKLRSHLYSHVLSAYK |
|
|
Выравнивание 37 с цитоглобином Oncorhynchus mykiss (Радужная форель).
cytoglobin [Oncorhynchus mykiss] Length=177
GENE ID: 100136084 cygb | cytoglobin [Oncorhynchus mykiss]
Score = 75.9 bits (185), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 42/140 (30%), Positives = 70/140 (50%), Gaps = 9/140 (6%)
Query |
3 |
LSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHFDLSHG------SAQV |
56 |
|
Sbjct |
19 |
L +++ +K W KV + + G L R+F++FP++K YF F |
SAQ+ |
78 |
LCDSEREMIKDTWAKVYQNCDDVGVAILIRLFVNFPSSKQYFSQFQQVEDPGELERSAQL |
56
Query |
57 |
KGHGKKVADALTNAVAHV---DDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAH |
113 |
||
Sbjct |
79 |
+ H ++V +A+ |
V ++ |
D M + L + HA + V+PV FK+L +L L A |
138 |
RKHSRRVMNAINTLVENLHDGDKMVSVLKLVGKAHALRHNVEPVYFKILCGVILEVLVAD |
|||||
Query |
114 |
LPAEFTPAVHASLDKFLASV |
133 |
|
|
Sbjct |
139 |
P TP V + |
K L ++ |
158 |
|
FPDYITPEVAVAWTKLLDAI |
|
Выравнивание 38 с цитоглобином-2 Oryzias latipes (Медака, рыбка семейства оризиевых).
cytoglobin-2 [Oryzias latipes] Length=194
GENE ID: 100049377 cygb-2 | cytoglobin-2 [Oryzias latipes] (10 or fewer PubMed links)
Score = 74.3 bits (181), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 44/148 (30%), Positives = 75/148 (51%), Gaps = 9/148 (6%)
Query |
3 |
LSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF-DLS-----HGSAQV |
56 |
|||
Sbjct |
34 |
LS A+ ++ |
WG V + |
+ G L R F++FP+ K YF F D+ |
S+Q+ |
93 |
LSDAEMEIIQHTWGHVYKNCEDVGVSVLIRFFVNFPSAKQYFSQFQDMQDPEEMEKSSQL |
||||||
Query |
57 |
KGHGKKVADALTNAVAHVDD---MPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAH |
113 |
|||
|
|
+ H ++V +A+ |
V ++ D |
+ + L+ + HA K +V+P+ FK+ S +L |
L+ |
|
Sbjct |
94 |
RQHARRVMNAINTVVENLQDPEKVSSVLALVGKAHAVKHKVEPIYFKIXSGVMLSVLSED 153 |
||
Query |
114 |
LPAEFTPAVHASLDKFLASVSTVLTSKY |
141 |
|
Sbjct |
154 |
P FT V |
K +A+V +T Y |
181 |
FPEFFTAEVQLVWTKLMAAVYWHVTGAY |
Выравнивание 39 с цитоглобином-1 Danio rerio (Данио рерио, аквариумная рыбка).
cytoglobin-1 [Danio rerio] Length=174
GENE ID: 246090 cygb1 | cytoglobin 1 [Danio rerio] (10 or fewer PubMed links)
Score = 72.4 bits (176), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 46/149 (31%), Positives = 72/149 (49%), Gaps = 10/149 (6%)
Query |
3 |
LSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF-DLS-----HGSAQV |
56 |
||||
Sbjct |
16 |
L+ D ++ W |
V A |
G L R F +FP+ K YF HF +L |
+AQ+ |
75 |
|
LTEEDVCVIQDTWKPVYAERDNAGVAVLVRFFTNFPSAKQYFEHFRELQDPAEMQQNAQL |
|||||||
Query |
57 |
KGHGKKVADALTNAVAHV---DDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAH |
113 |
||||
Sbjct |
76 |
K HG++V +AL |
V ++ |
D + |
+ + HA + +VDPV FK+L+ +L |
L |
135 |
KKHGQRVLNALNTLVENLRDADKLNTIFNQMGKSHALRHKVDPVYFKILAGVILEVLVEA |
57
Query |
114 |
LPAEFTPA-VHASLDKFLASVSTVLTSKY |
141 |
Sbjct |
136 |
P F+PA V +S K + + + Y |
164 |
FPQCFSPAEVQSSWSKLMGILYWQMNRVY |
КОНТРОЛЬНЫЕ ВОПРОСЫ
1.Какие существуют два метода количественного измерения расстояния между двумя данными строками последовательностей?
2.Какие бывают виды штрафов за делеции?
3.Что такое матрица РАМ?
4.Чему соответствует расстояние между последовательностями в 1 РАМ?
5.Показать, чему равна вероятность мутации замены V↔M,если в матрице РАМ250 соответствующая цена мутации равна +2?
6.В чём отличие принципа построения матриц BLOSUM от РАМ?
7.Для выравнивания №39 альфа-субъединицы гемоглобина человека HBA_HUMAN с цитоглобином-1 Danio rerio вычислить с помощью матрицы BLOSUM80 и аффинного штрафа за
пропуски γ(g) = −d −(g −1)e (при d =8 и e = 4) счёт следующего фрагмента выравнивания
GAEALERMFLSFPTTKTYFPHF-DLS-----HGSAQV
GVAVLVRFFTNFPSAKQYFEHFRELQDPAEMQQNAQL
3.АЛГОРИТМЫ ВЫРАВНИВАНИЯ
3.1.АЛГОРИТМ ДИНАМИЧЕСКОГО ПРОГРАММИРОВАНИЯ
Алгоритм глобального оптимального выравнивания двух последо-
вательностей, дающий максимальное количество баллов (максимальный вес (счёт, score)) основан на математическом методе, называемом динами-
ческим программированием.
58
Основное достоинство этого метода состоит в том, что он гарантирует глобальный оптимум: наилучший результат выравнивания при заданном наборе параметров – матрице замещений и штрафных значениях для пропусков – без каких-либо приближений.
Основной недостаток метода состоит в том, что многие выравнивания двух данных последовательностей могут привести к оптимальному числу баллов, при этом совершенно не обязательно, что хотя бы одно из них имеет отношение к биологически корректному выравниванию (имеет биологический смысл). Например, при сравнении последовательностей α- и β-цепей гемоглобина цыпленка В. Фитч и Т. Смит (W. Fitch и T. Smith) нашли 17 выравниваний, каждое давало одинаковое оптимальное число баллов, из которых корректным, (используя дополнительную информацию о пространственной структуре белков) оказалось только одно. И вообще, оказалось, что для этой задачи существует 1317 выравниваний, которые дают число баллов в пределах 5% от оптимума.
Есть еще один недостаток – время, требуемое для выравнивания двух последовательностей длиной п и т пропорционально размеру редактируемой матрицы, то есть, пропорционально произведению m ×n . Вычислительную сложность алгоритма обозначают O( f ) . Поскольку обычно п и т одного порядка, про алгоритм говорят, что он требует
O(n2 ) времени (или памяти). В области анализа биологических после-
довательностей обычными компьютерами (а не суперкомпьютерами)
алгоритмы O(n2 ) дают удовлетворительные результаты, а вот алгоритмы
O(n3 ) возможно применять только для очень коротких последователь-
ностей.
Таким образом, метод динамического программирования будет слишком медленным при поиске соответствия для пробной последовательности в полной базе данных последовательностей, и ещё меньше он подходит для выравниваний "все-против-всех". Проблема поиска в базе данных – это на самом деле проблема поиска соответствия
59