Sequence Similarity Clusters for the Entities in PDB 4HCA

Entity #1 | Chains: A
Trans-acting T-cell-specific transcription factor GATA-3 protein, length: 115 (BLAST)
Sequence Similarity Cutoff Rank Chains in Cluster Cluster ID / Name
100 % 3 3 20532
95 % 3 3 17647
90 % 3 3 17186
70 % 5 5 8740
50 % 5 5 7972
40 % 5 5 7343
30 % 5 5 6519
Entity #2 | Chains: X
DNA (5'-D(*AP*AP*TP*GP*TP*CP*CP*AP*TP*CP*TP*GP*AP*TP*AP*AP*GP*AP*CP*G)-3') dna, length: 20 (BLAST)
Sequence Similarity Cutoff Rank Chains in Cluster Cluster ID / Name
Entity #3 | Chains: Y
DNA (5'-D(*TP*TP*CP*GP*TP*CP*TP*TP*AP*TP*CP*AP*GP*AP*TP*GP*GP*AP*CP*A)-3') dna, length: 20 (BLAST)
Sequence Similarity Cutoff Rank Chains in Cluster Cluster ID / Name

Instructions

In the table for each entity, view a list of similar sequences by selecting the link associated with the percentage cutoff.



View more detailed documentation on the redundancy reduction and sequence clustering procedure used by RCSB PDB.