ARTINA structure determination

Job ID: 57f94036879b4bde9ef0076dc7f3ba38


Job date/time
Job submittedApril 5, 2022, 3:37 p.m.
Job startedApril 5, 2022, 3:50 p.m.
Job completedApril 5, 2022, 10:18 p.m.
Job call details
App identifierapp_csa_struct_calc
Queuepublic
Job status finished
Job hardware details
CPU credits420.0
GPU credits0.0

Download all job results: results.zip



Automated peak picking results

[print table]

Spectrum name Number of peaks Score
(Strong / Weak)
Download
[F] - original signal coordinates (folded peaks), [U] - unfolded peak lists
Expected Strong Weak All Precision Recall F1 Fm Strong Weak All
C13NOESY_@ALI_@POS - 3041 3893 6362 - - - - CSV [F], PEAKS [F], LIST [F]
CSV [U], PEAKS [U], LIST [U]
CSV [F], PEAKS [F], LIST [F]
CSV [U], PEAKS [U], LIST [U]
CSV [F], PEAKS [F], LIST [F]
C13NOESY_@ARO - 183 250 596 - - - - CSV [F], PEAKS [F], LIST [F]
CSV [U], PEAKS [U], LIST [U]
CSV [F], PEAKS [F], LIST [F]
CSV [U], PEAKS [U], LIST [U]
CSV [F], PEAKS [F], LIST [F]
CBCANH 354 743 1055 2118 0.666 / 0.5 1.398 / 1.49 0.903 / 0.748 0.965 / 0.863 CSV [F], PEAKS [F], LIST [F]
CSV [U], PEAKS [U], LIST [U]
CSV [F], PEAKS [F], LIST [F]
CSV [U], PEAKS [U], LIST [U]
CSV [F], PEAKS [F], LIST [F]
CBCAcoNH 208 457 680 1245 0.69 / 0.497 1.515 / 1.626 0.948 / 0.762 1.022 / 0.899 CSV [F], PEAKS [F], LIST [F]
CSV [U], PEAKS [U], LIST [U]
CSV [F], PEAKS [F], LIST [F]
CSV [U], PEAKS [U], LIST [U]
CSV [F], PEAKS [F], LIST [F]
CCHTOCSY_@ALI 1419 1566 2228 5587 0.973 / 0.775 1.074 / 1.216 1.021 / 0.947 1.022 / 0.971 CSV [F], PEAKS [F], LIST [F]
CSV [U], PEAKS [U], LIST [U]
CSV [F], PEAKS [F], LIST [F]
CSV [U], PEAKS [U], LIST [U]
CSV [F], PEAKS [F], LIST [F]
HBHAcoNH 249 517 726 1613 0.704 / 0.531 1.461 / 1.549 0.95 / 0.791 1.014 / 0.907 CSV [F], PEAKS [F], LIST [F]
CSV [U], PEAKS [U], LIST [U]
CSV [F], PEAKS [F], LIST [F]
CSV [U], PEAKS [U], LIST [U]
CSV [F], PEAKS [F], LIST [F]
N15HSQC 131 162 223 405 0.966 / 0.762 1.194 / 1.297 1.068 / 0.96 1.074 / 0.994 CSV [F], PEAKS [F], LIST [F]
CSV [U], PEAKS [U], LIST [U]
CSV [F], PEAKS [F], LIST [F]
CSV [U], PEAKS [U], LIST [U]
CSV [F], PEAKS [F], LIST [F]
N15NOESY_@NEG - 1714 2145 3676 - - - - CSV [F], PEAKS [F], LIST [F]
CSV [U], PEAKS [U], LIST [U]
CSV [F], PEAKS [F], LIST [F]
CSV [U], PEAKS [U], LIST [U]
CSV [F], PEAKS [F], LIST [F]

Table 1. Results of automated peak picking. Peak lists with low F1 score (< 0.25) are highlighed in red. Frequently, low F1 scores indicate wrong spectrum annotations in the project summary, such as axes transposition (e.g. C-H vs. H-C) or wrong experiment type specification.



Peak picking results compared to 1000+ spectra benchmark

Figure 1 is available only for projects with known protein sequence and non-NOESY spectra types.

Figure 1. Evaluation of automated peak picking output in reference to a benchmark of 1000+ NMR spectra. Each box plot presents the distribution of the F1 score calculated for all benchmark spectra of the given type. Red dots present F1 scores calculated for the output of this peak picking application call. Relative comparison of F1 scores serves as a quality verification of the peak picking output. Scores above the median (orange line) indicate that the automated peak picking routine was able to process a spectrum more accurately than 50% of the spectra in the benchmark. Low F1 scores might be related to a low signal-to-noise ratio of the spectrum or to a wrong definition of the spectrum type in the project summary.




Data consistency check: spectra referencing

Proper spectra referencing is a major precondition for automated shift assignment and structure calculation. The table below presents results of the CYANA PeakMatch algorithm (Buchner et al., 2013), which has been executed for each pair of spectra that share at least two axes. Each row in the table contains an estimate of the relative shift between target and reference spectrum. The score presented in the last column (Equation 2, Buchner et al., 2013) represents the confidence of shift identification (the higher the better).

The relative shifts listed in the table below should be verified manually and corrected before submitting an automated shift assignment or structure calculation job. They can be applied to the target spectrum using specific tags (e.g. w1:0.025 to increase the chemical shift coordinates in the 1st dimension of the spectrum and peak list by 0.025 ppm) to define spectrum shifts in the project summary. It is not necessary to repeat automated peak picking after entering such tags in the project summary. The system will apply these corrections automatically.


[print table]

Compared peak lists Relative shift Score
Reference spectrum Target spectrum
HBHAcoNH N15HSQC N=0.1, H=0.015 1 1.24
N15HSQC N15NOESY_@NEG N=0.0, HN=-0.007 1 0.3
CBCAcoNH N15HSQC N=0.1, H=0.007 1 1.78
CBCANH N15HSQC N=-0.1, H=0.007 1 1.35

1 Non-zero shifts are significant if greater than 0.2 ppm for 13C or 15N, or 0.015 ppm for 1H.
2 Relative referencing with scores below 0.1 is uncertain.

Table 2. Results of automated spectra referencing. Rows presenting possible referencing errors are highlighted in red. The relative shift is reported for spectra having at least 25 strong cross-peaks (Table 1).




Automated shift assignment results


Figure 1.The diagram illustrates atoms in the protein sequence with color-coded chemical shift information. Dark-blue rectangles represent atoms assigned confidently by the FLYA algorithm, whereas for light-blue ones the assignment is uncertain. The row labeled HN/Hα shows for each residue HN on the left and Hα in the center. The N/Cα/C' row shows for each residue the N, Cα, and C' assignments from left to right. The rows βη show the sidechain assignments for the heavy atoms in the center and for the hydrogen atoms to the left and right. In the case of branched side-chains, the corresponding row is split into an upper part for one branch and a lower part for the other branch.


[print table]

Spectrum name Expected Assigned expected Measured Assigned measured Ratio Assigned peak lists
C13NOESY_@ALI_@POS_@FLYA 12050 4756 (39.47%) 4709 2851 (60.54%) 1.7 XEASY Sparky
C13NOESY_@ARO_@FLYA 608 246 (40.46%) 329 133 (40.43%) 1.8 XEASY Sparky
CBCANH_@FLYA 352 269 (76.42%) 531 241 (45.39%) 1.1 XEASY Sparky
CBCAcoNH_@FLYA 176 147 (83.52%) 312 144 (46.15%) 1.0 XEASY Sparky
CCHTOCSY_@ALI_@FLYA 1419 825 (58.14%) 1544 718 (46.5%) 1.1 XEASY Sparky
HBHAcoNH_@FLYA 249 167 (67.07%) 373 148 (39.68%) 1.1 XEASY Sparky
N15HSQC_@FLYA 119 104 (87.39%) 196 95 (48.47%) 1.1 XEASY Sparky
N15NOESY_@NEG_@FLYA 3351 1889 (56.37%) 2356 1267 (53.78%) 1.5 XEASY Sparky
ALL 18324 8403 (45.86%) 10350 5597 (54.08%) 1.5

Table 1. Results of the automated shift assignment. Expected: Number of peaks expected in the spectrum. Assigned expected: Number of expected peaks that are assigned that to a measured (picked) peak. The percentage is relative to the number of expected peaks. Measured: Number of measured (picked) peaks. Assigned measured: Number of measured peaks that have one or more expected peaks assigned to it. The percentage is relative to the number of measured peaks. Ratio: The average number of expected peaks that are assigned to the same measured peak. The average is taken over all measured peaks to which at least on expected peak is assigned. Color coding: Rows corresponding to spectra with low percentage of assigned peaks (<15%) are highlighted in red. The most common factors associated with low percentage of assigned peaks are: (a) wrong spectra axes annotation (e.g. transposition of H-HC axes in C13NOESY, more information), (b) systematic shift between spectra (e.g. HCCHTOCSY shifted in C dimension by -2.7 ppm in reference to C13HSQC), (c) low number of signals that are visible in the spectrum (compared to number of signals that are expected to be present in the spectrum given the protein sequence), (d) inverted axis (inverted ppm scale in one of spectra dimensions).


Automated structure calculation results

Fig. Superposition of two structure proposals (proposal 1, proposal 2) determined by NMRtist (click image to show protein visualization at full scale). Structures are shown in three different orientations. The parts of the structure that are not well-defined are drawn with transparency. The two proposals are calculated with a different number of NOESY cross peaks. Download aligned structure proposals [combined.pdb] . If a "Test run" was executed, NMRtist may output only a single structure proposal.





PropertyProposal 1Proposal 2
Structure structure.pdb structure.pdb
Distance restraints restraints.upl restraints.upl
Structure calculation details cyanatable.txt cyanatable.txt
Total number of distance restraints 2399 2383
Number of intraresidual restraints (|i-j| = 0) 547 542
Number of sequential restraints (|i-j| = 1) 616 621
Number of medium-range restraints (1 < |i-j| < 5) 565 563
Number of long-range restraints (|i-j| ≥ 5) 671 657
Number of torsion angle restraints 140 140
CYANA target function value 2.67 � 0.09 +� 3.10 � 0.06 +�
Distance restraint violations > 0.2 Å 8 � 2 5 � 1
Maximal distance restraint violation 0.20 � 0.03 + 0.19 � 0.05 +
Angle restraint violations > 5.0° 4 � 0 1 � 0
Maximal angle restraint violation 10.28 � 0.82 � 7.14 � 0.35 �
Residues in most favored Ramachandran plot regions 82.0 % 85.1 %
Residues in additionally allowed regions 17.9 % 14.9 %
Residues in generously allowed regions 0.1 % 0.0 %
Residues in disallowed regions 0.0 % 0.0 %

Table 1. Structure calculation statistics