Sequence Similarity Clusters for the Entities in PDB 4HG4

Entity #1 | Chains: A,B,C,D,E,F,G,H,I
Hemagglutinin HA1 protein, length: 327 (BLAST)
Sequence Similarity Cutoff Rank Chains in Cluster Cluster ID / Name
100 % 2 2 5065
95 % 10 10 2257
90 % 11 11 1938
70 % 81 82 142
50 % 135 136 57
40 % 237 248 15
30 % 243 254 26
Entity #2 | Chains: a,b,c,d,e,f,g,h,i
Hemagglutinin HA2 protein, length: 174 (BLAST)
Sequence Similarity Cutoff Rank Chains in Cluster Cluster ID / Name
100 % 7 7 2495
95 % 11 11 1908
90 % 82 83 119
70 % 137 140 28
50 % 233 246 11
40 % 241 256 14
30 % 241 256 25
Entity #3 | Chains: J,L,N,P,R,T,V,X,Z
Fab 2G1 heavy chain protein, length: 223 (BLAST)
Sequence Similarity Cutoff Rank Chains in Cluster Cluster ID / Name
100 % 1 1 5937
95 % 1 1 6516
90 % 19 19 1502
70 % 1610 1782 1
50 % 3323 3677 1
40 % 3324 3678 1
30 % 3933 4338 1
Entity #4 | Chains: K,M,O,Q,S,U,W,Y,z
Fab 2G1 light chain protein, length: 214 (BLAST)
Sequence Similarity Cutoff Rank Chains in Cluster Cluster ID / Name
100 % 1 1 5969
95 % 1 1 6545
90 % 406 461 4
70 % 1375 1519 3
50 % 3324 3677 1
40 % 3325 3678 1
30 % 3934 4338 1

Instructions

In the table for each entity, view a list of similar sequences by selecting the link associated with the percentage cutoff.



View more detailed documentation on the redundancy reduction and sequence clustering procedure used by RCSB PDB.