Frustratometer classes
The frustratometer package provides a handful of classes used to encapsulate the data.
- class frustratometer.Structure[source]
Bases:
object
- classmethod full_pdb(pdb_file: Path | str, chain: str | None = None, aligned_sequence: str = None, filtered_aligned_sequence: str = None, distance_matrix_method: str = 'CB', pdb_directory: Path = PosixPath('/home/docs/checkouts/readthedocs.org/user_builds/frustratometer/checkouts/latest/docs'), repair_pdb: bool = False)[source]
Generates structure object
- Parameters:
pdb_file (str) – PDB file path
chain (str) – PDB chain name. If “chain=None”, all chains will be included.
distance_matrix_method (str) – The method to use for calculating the distance matrix. Defaults to ‘CB’, which uses the CB atom for all residues except GLY, which uses the CA atom. Other options are ‘CA’ for using only the CA atom, and ‘minimum’ for using the minimum distance between all atoms in each residue.
pdb_directory (str) – Directory where repaired pdb will be downloaded
repair_pdb (bool) – If False, provided pdb file will be repaired with missing residues inserted and heteroatoms removed.
- Return type:
Structure object
- classmethod spliced_pdb(pdb_file: Path | str, chain: str, seq_selection: str, aligned_sequence: str = None, filtered_aligned_sequence: str = None, distance_matrix_method: str = 'CB', repair_pdb: bool = True, pdb_directory: str = '/home/docs/checkouts/readthedocs.org/user_builds/frustratometer/checkouts/latest/docs')[source]
Generates substructure object
- Parameters:
pdb_file (str) – PDB file path
chain (str) – PDB chain name
seq_selection (str) – Subsequence selection command, using Prody select module.
*If wanting to use original PDB indexing, set seq_selection as: “resnum {initial_index}to{final_index}”
*If wanting to use absolute PDB indexing (first residue has index=0), set seq_selection as: “resindex {initial_index}to{final_index}”
Note that using “to” will create a substructure including both the initial and final designated residue. If you would like to not include the final desginated residue, replace “to” with “:”
Note that the algorithm will account for any gaps in the PDB and adjust the provided residue range accordingly.
distance_matrix_method (str) – The method to use for calculating the distance matrix. Defaults to ‘CB’, which uses the CB atom for all residues except GLY, which uses the CA atom. Other options are ‘CA’ for using only the CA atom, and ‘minimum’ for using the minimum distance between all atoms in each residue.
pdb_directory (str) – Directory where repaired pdb will be downloaded
repair_pdb (bool) – If True, provided pdb file will be repaired with missing residues inserted and heteroatoms removed.
- Return type:
Substructure object
- class frustratometer.AWSEM(pdb_structure, expose_indicator_functions=False, **parameters)[source]
Bases:
Frustratometer
- q = 20
- aa_map_awsem_list = [0, 0, 4, 3, 6, 13, 7, 8, 9, 11, 10, 12, 2, 14, 5, 1, 15, 16, 19, 17, 18]
- aa_map_awsem_x = array([[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0], [ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0], [ 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4], [ 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3], [ 6, 6, 6, 6, 6, 6, 6, 6, 6, 6, 6, 6, 6, 6, 6, 6, 6, 6, 6, 6, 6], [13, 13, 13, 13, 13, 13, 13, 13, 13, 13, 13, 13, 13, 13, 13, 13, 13, 13, 13, 13, 13], [ 7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7], [ 8, 8, 8, 8, 8, 8, 8, 8, 8, 8, 8, 8, 8, 8, 8, 8, 8, 8, 8, 8, 8], [ 9, 9, 9, 9, 9, 9, 9, 9, 9, 9, 9, 9, 9, 9, 9, 9, 9, 9, 9, 9, 9], [11, 11, 11, 11, 11, 11, 11, 11, 11, 11, 11, 11, 11, 11, 11, 11, 11, 11, 11, 11, 11], [10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10], [12, 12, 12, 12, 12, 12, 12, 12, 12, 12, 12, 12, 12, 12, 12, 12, 12, 12, 12, 12, 12], [ 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2], [14, 14, 14, 14, 14, 14, 14, 14, 14, 14, 14, 14, 14, 14, 14, 14, 14, 14, 14, 14, 14], [ 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5], [ 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1], [15, 15, 15, 15, 15, 15, 15, 15, 15, 15, 15, 15, 15, 15, 15, 15, 15, 15, 15, 15, 15], [16, 16, 16, 16, 16, 16, 16, 16, 16, 16, 16, 16, 16, 16, 16, 16, 16, 16, 16, 16, 16], [19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19, 19], [17, 17, 17, 17, 17, 17, 17, 17, 17, 17, 17, 17, 17, 17, 17, 17, 17, 17, 17, 17, 17], [18, 18, 18, 18, 18, 18, 18, 18, 18, 18, 18, 18, 18, 18, 18, 18, 18, 18, 18, 18, 18]])
- aa_map_awsem_y = array([[ 0, 0, 4, 3, 6, 13, 7, 8, 9, 11, 10, 12, 2, 14, 5, 1, 15, 16, 19, 17, 18], [ 0, 0, 4, 3, 6, 13, 7, 8, 9, 11, 10, 12, 2, 14, 5, 1, 15, 16, 19, 17, 18], [ 0, 0, 4, 3, 6, 13, 7, 8, 9, 11, 10, 12, 2, 14, 5, 1, 15, 16, 19, 17, 18], [ 0, 0, 4, 3, 6, 13, 7, 8, 9, 11, 10, 12, 2, 14, 5, 1, 15, 16, 19, 17, 18], [ 0, 0, 4, 3, 6, 13, 7, 8, 9, 11, 10, 12, 2, 14, 5, 1, 15, 16, 19, 17, 18], [ 0, 0, 4, 3, 6, 13, 7, 8, 9, 11, 10, 12, 2, 14, 5, 1, 15, 16, 19, 17, 18], [ 0, 0, 4, 3, 6, 13, 7, 8, 9, 11, 10, 12, 2, 14, 5, 1, 15, 16, 19, 17, 18], [ 0, 0, 4, 3, 6, 13, 7, 8, 9, 11, 10, 12, 2, 14, 5, 1, 15, 16, 19, 17, 18], [ 0, 0, 4, 3, 6, 13, 7, 8, 9, 11, 10, 12, 2, 14, 5, 1, 15, 16, 19, 17, 18], [ 0, 0, 4, 3, 6, 13, 7, 8, 9, 11, 10, 12, 2, 14, 5, 1, 15, 16, 19, 17, 18], [ 0, 0, 4, 3, 6, 13, 7, 8, 9, 11, 10, 12, 2, 14, 5, 1, 15, 16, 19, 17, 18], [ 0, 0, 4, 3, 6, 13, 7, 8, 9, 11, 10, 12, 2, 14, 5, 1, 15, 16, 19, 17, 18], [ 0, 0, 4, 3, 6, 13, 7, 8, 9, 11, 10, 12, 2, 14, 5, 1, 15, 16, 19, 17, 18], [ 0, 0, 4, 3, 6, 13, 7, 8, 9, 11, 10, 12, 2, 14, 5, 1, 15, 16, 19, 17, 18], [ 0, 0, 4, 3, 6, 13, 7, 8, 9, 11, 10, 12, 2, 14, 5, 1, 15, 16, 19, 17, 18], [ 0, 0, 4, 3, 6, 13, 7, 8, 9, 11, 10, 12, 2, 14, 5, 1, 15, 16, 19, 17, 18], [ 0, 0, 4, 3, 6, 13, 7, 8, 9, 11, 10, 12, 2, 14, 5, 1, 15, 16, 19, 17, 18], [ 0, 0, 4, 3, 6, 13, 7, 8, 9, 11, 10, 12, 2, 14, 5, 1, 15, 16, 19, 17, 18], [ 0, 0, 4, 3, 6, 13, 7, 8, 9, 11, 10, 12, 2, 14, 5, 1, 15, 16, 19, 17, 18], [ 0, 0, 4, 3, 6, 13, 7, 8, 9, 11, 10, 12, 2, 14, 5, 1, 15, 16, 19, 17, 18], [ 0, 0, 4, 3, 6, 13, 7, 8, 9, 11, 10, 12, 2, 14, 5, 1, 15, 16, 19, 17, 18]])
- auc()
Computes area under the curve of the receiver-operating characteristic. Function intended
- frustration(sequence: str = None, kind: str = 'singleresidue', mask: array = None, aa_freq: array = None, correction: int = 0)
- native_energy(sequence: str = None, ignore_couplings_of_gaps: bool = False, ignore_fields_of_gaps: bool = False)
- plot_roc()
- roc()
- scores()
- view_frustration(sequence: str = None, single: str = 'singleresidue', pair: str = 'mutational', aa_freq: array = None, correction: int = 0)
- class frustratometer.DCA[source]
Bases:
Frustratometer
- classmethod from_distance_matrix(potts_model: dict, distance_matrix: array, sequence: str, sequence_cutoff: float | None = None, distance_cutoff: float | None = None)[source]
- classmethod from_potts_model_file(pdb_structure, potts_model_file: str, reformat_potts_model: bool = False, sequence_cutoff: float | None = None, distance_cutoff: float | None = None)[source]
- classmethod from_pottsmodel(potts_model: dict, pdb_file: str | None = None, chain: str | None = None, sequence_cutoff: float | None = None, distance_cutoff: float | None = None, distance_matrix_method='minimum')[source]
- classmethod from_pfam_alignment(sequence: str, PFAM_ID: str, pdb_file: str, pdb_chain: str, download_all_alignment_files: bool, alignment_files_directory: str, sequence_cutoff: float | None = None, distance_cutoff: float | None = None, distance_matrix_method='minimum')[source]
- classmethod from_hmmer_alignment(sequence: str, PFAM_ID: str, pdb_file: str, pdb_chain: str, download_all_alignment_files: bool, alignment_files_directory: str, alignment_output_file: bool, alignment_sequence_database: str, sequence_cutoff: float | None = None, distance_cutoff: float | None = None, distance_matrix_method='minimum')[source]
- classmethod from_alignment(alignment: str, pdb_file: str, chain: str, sequence_cutoff: float | None = None, distance_cutoff: float | None = None, distance_matrix_method='minimum')[source]
- property sequence
- property pdb_file
- property pdb_name
Returns PDBid from pdb name
- property chain
- property pfamID
Returns pfamID from pdb name
- property alignment_type
- property alignment_sequence_database
- property download_all_alignment_files
- property alignment_files_directory
- property alignment_output_file
- property sequence_cutoff
- property distance_cutoff
- auc()
Computes area under the curve of the receiver-operating characteristic. Function intended
- frustration(sequence: str = None, kind: str = 'singleresidue', mask: array = None, aa_freq: array = None, correction: int = 0)
- native_energy(sequence: str = None, ignore_couplings_of_gaps: bool = False, ignore_fields_of_gaps: bool = False)
- plot_roc()
- roc()
- scores()
- view_frustration(sequence: str = None, single: str = 'singleresidue', pair: str = 'mutational', aa_freq: array = None, correction: int = 0)
- vmd(sequence: str = None, single: str = 'singleresidue', pair: str = 'mutational', aa_freq: array = None, correction: int = 0, max_connections: int = 100)
- property distance_matrix_method
- property potts_model_file
- property potts_model
- class frustratometer.Map(map_array)[source]
Bases:
object
- classmethod from_sequences(sequence_a, sequence_b, substitution_matrix='BLOSUM62', match_score=2, mismatch_score=-1, open_gap_score=-0.5, extend_gap_score=-0.1, target_end_gap_score=-0.01, query_end_gap_score=-0.01)[source]
- property map_array