Sequence Similarity Clusters for the Entities in PDB 4HG4

Entity #1 | Chains: A,B,C,D,E,F,G,H,I
Hemagglutinin HA1 protein, length: 327 (BLAST)
Sequence Similarity Cutoff Rank Chains in Cluster Cluster ID / Name
100 % 2 2 6092
95 % 10 10 2736
90 % 11 11 2345
70 % 83 85 157
50 % 153 160 80
40 % 288 307 17
30 % 295 314 26
Entity #2 | Chains: a,b,c,d,e,f,g,h,i
Hemagglutinin HA2 protein, length: 174 (BLAST)
Sequence Similarity Cutoff Rank Chains in Cluster Cluster ID / Name
100 % 7 7 2976
95 % 11 11 2306
90 % 84 86 138
70 % 152 161 45
50 % 284 305 11
40 % 292 315 15
30 % 292 315 27
Entity #3 | Chains: J,L,N,P,R,T,V,X,Z
Fab 2G1 heavy chain protein, length: 223 (BLAST)
Sequence Similarity Cutoff Rank Chains in Cluster Cluster ID / Name
100 % 1 1 7112
95 % 1 1 7678
90 % 28 28 1215
70 % 1985 2238 2
50 % 4099 4619 1
40 % 4099 4619 1
30 % 4861 5440 1
Entity #4 | Chains: K,M,O,Q,S,U,W,Y,z
Fab 2G1 light chain protein, length: 214 (BLAST)
Sequence Similarity Cutoff Rank Chains in Cluster Cluster ID / Name
100 % 1 1 7143
95 % 1 1 7708
90 % 617 697 3
70 % 2021 2280 1
50 % 4100 4619 1
40 % 4100 4619 1
30 % 4862 5440 1

Instructions

In the table for each entity, view a list of similar sequences by selecting the link associated with the percentage cutoff.



View more detailed documentation on the redundancy reduction and sequence clustering procedure used by RCSB PDB.