Sequence Similarity Clusters for the Entities in PDB 4HG4

Entity #1 | Chains: A,B,C,D,E,F,G,H,I
Hemagglutinin HA1 protein, length: 327 (BLAST)
Sequence Similarity Cutoff Rank Chains in Cluster Cluster ID / Name
100 % 2 2 5208
95 % 10 10 2317
90 % 11 11 1983
70 % 83 84 141
50 % 146 147 54
40 % 245 257 15
30 % 251 263 26
Entity #2 | Chains: a,b,c,d,e,f,g,h,i
Hemagglutinin HA2 protein, length: 174 (BLAST)
Sequence Similarity Cutoff Rank Chains in Cluster Cluster ID / Name
100 % 7 7 2557
95 % 11 11 1957
90 % 84 85 122
70 % 145 148 28
50 % 241 255 11
40 % 249 265 14
30 % 249 265 25
Entity #3 | Chains: J,L,N,P,R,T,V,X,Z
Fab 2G1 heavy chain protein, length: 223 (BLAST)
Sequence Similarity Cutoff Rank Chains in Cluster Cluster ID / Name
100 % 1 1 6097
95 % 1 1 6659
90 % 19 19 1540
70 % 1671 1850 1
50 % 3445 3813 1
40 % 3446 3814 1
30 % 4117 4537 1
Entity #4 | Chains: K,M,O,Q,S,U,W,Y,z
Fab 2G1 light chain protein, length: 214 (BLAST)
Sequence Similarity Cutoff Rank Chains in Cluster Cluster ID / Name
100 % 1 1 6129
95 % 1 1 6688
90 % 417 476 4
70 % 1418 1568 3
50 % 3446 3813 1
40 % 3447 3814 1
30 % 4118 4537 1

Instructions

In the table for each entity, view a list of similar sequences by selecting the link associated with the percentage cutoff.



View more detailed documentation on the redundancy reduction and sequence clustering procedure used by RCSB PDB.