C5320 Theoretical Concepts of NMR
Luk ´aˇs ˇZ´ıdek
May 27, 2022
ii
Contents
How to use this text 1
I Classical Introduction 3
Before we start:
Classical electromagnetism 5
0.1 Electric ﬁeld, electric charge, electric dipole . . . . . . . . . . . . . . . . . . . . . . . 5
0.2 Magnetic ﬁeld and magnetic dipole . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
0.3 Source of the electric ﬁeld . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
0.4 Origin of the magnetic ﬁeld . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
0.5 Electrodynamics and magnetodynamics . . . . . . . . . . . . . . . . . . . . . . . . . . 9
0.6 SUPPORTING INFORMATION . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
0.6.1 Potential energy of an electric dipole . . . . . . . . . . . . . . . . . . . . . . . 10
0.6.2 Current loop as a magnetic dipole . . . . . . . . . . . . . . . . . . . . . . . . . 10
0.6.3 Precession . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
0.6.4 Electromotive force (voltage) . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
Nuclear magnetic resonance 15
1.1 Nuclear magnetic moments in chemical substances . . . . . . . . . . . . . . . . . . . . 15
1.2 Polarization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
1.3 Coherence . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
1.4 Chemical shift . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
1.5 SUPPORTING INFORMATION . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
1.5.1 Calculating averages . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
1.5.2 Polarization and bulk magnetization . . . . . . . . . . . . . . . . . . . . . . . 25
1.5.3 Changing Cartesian coordinate frame . . . . . . . . . . . . . . . . . . . . . . . 26
1.5.4 Rotation in complex representation . . . . . . . . . . . . . . . . . . . . . . . . 29
1.5.5 Rotating coordinate frame . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
1.5.6 Chemical shift tensor . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
1.5.7 Oﬀset eﬀects . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
1.5.8 Evolution of magnetization in B0 . . . . . . . . . . . . . . . . . . . . . . . . . 34
1.5.9 Evolution of magnetization in B0 + B1 . . . . . . . . . . . . . . . . . . . . . . 35
iii
iv CONTENTS
1.5.10 Selective pulses . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36
Relaxation 41
2.1 Relaxation due to chemical shift anisotropy . . . . . . . . . . . . . . . . . . . . . . . . 41
2.2 Adiabatic contribution to relaxation . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42
2.3 Including non-adiabatic contribution to relaxation . . . . . . . . . . . . . . . . . . . . 44
2.4 Internal motions, structural changes . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46
2.5 Bloch equations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47
2.6 SUPPORTING INFORMATION . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49
2.6.1 Loss of coherence . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49
2.6.2 Stochastic molecular motions: diﬀusion . . . . . . . . . . . . . . . . . . . . . . 52
2.6.3 Isotropic rotational diﬀusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54
2.6.4 Time correlation function . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54
2.6.5 Return to equilibrium . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57
Signal acquisition and processing 63
3.1 NMR experiment . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63
3.2 NMR signal acquisition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64
3.3 Fourier transformation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64
3.4 Consequence of ﬁnite signal acquisition . . . . . . . . . . . . . . . . . . . . . . . . . . 68
3.5 Discrete Fourier transformation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69
3.6 Consequence of discrete signal acquisition . . . . . . . . . . . . . . . . . . . . . . . . . 71
3.7 Zero ﬁlling . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71
3.8 Phase correction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 73
3.9 Apodization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74
3.10 SUPPORTING INFORMATION . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75
3.10.1 Setting up NMR experiment . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75
3.10.2 Quadrature detection and demodulation . . . . . . . . . . . . . . . . . . . . . 75
3.10.3 Noise accumulation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 76
3.10.4 Mathematical description of Fourier transformation . . . . . . . . . . . . . . . 77
3.10.5 Fourier transformation of an ideal NMR signal . . . . . . . . . . . . . . . . . . 78
3.10.6 Properties of continuous Fourier transformation . . . . . . . . . . . . . . . . . 78
3.10.7 Causality and reconstruction of imaginary signal . . . . . . . . . . . . . . . . . 79
3.10.8 Spectral width, resolution, and sampling . . . . . . . . . . . . . . . . . . . . . 79
3.10.9 Discrete ideal signal . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81
3.10.10Zero- and ﬁrst-order phase corrections . . . . . . . . . . . . . . . . . . . . . . 82
3.10.11Dolph–Chebyshev window . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83
II Quantum description 85
Review of quantum mechanics 87
4.1 Wave function and state of the system . . . . . . . . . . . . . . . . . . . . . . . . . . 87
4.2 Superposition and localization in space . . . . . . . . . . . . . . . . . . . . . . . . . . 88
CONTENTS v
4.3 Operators and possible results of measurement . . . . . . . . . . . . . . . . . . . . . . 88
4.4 Expected result of measurement . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91
4.5 Operators of position and momentum, commutators . . . . . . . . . . . . . . . . . . . 93
4.6 Operator of energy and equation of motion . . . . . . . . . . . . . . . . . . . . . . . . 95
4.7 Operator of angular momentum . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 96
4.8 Operator of orbital magnetic moment . . . . . . . . . . . . . . . . . . . . . . . . . . . 98
4.9 SUPPORTING INFORMATION . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99
4.9.1 Classical mechanics: Newton, Lagrange, Hamilton . . . . . . . . . . . . . . . . 99
4.9.2 Lagrangian and Hamiltonian including magnetism . . . . . . . . . . . . . . . . 100
4.9.3 Legendre transformation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102
4.9.4 Calculating square . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102
4.9.5 Orthogonality and normalization of monochromatic waves . . . . . . . . . . . 104
4.9.6 Eigenfunctions and eigenvalues, operator of momentum . . . . . . . . . . . . . 104
4.9.7 Operator of position . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 105
4.9.8 Commutation relations of the position and momentum operators . . . . . . . . 105
4.9.9 Projection operator . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 106
4.9.10 Schr¨odinger equation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 107
4.9.11 Limitation of wave equation to ﬁrst time derivative . . . . . . . . . . . . . . . 108
4.9.12 Commutators of angular momentum operators . . . . . . . . . . . . . . . . . . 110
4.9.13 Angular momentum and rotation . . . . . . . . . . . . . . . . . . . . . . . . . 110
4.9.14 Rotation described by Wigner matrices . . . . . . . . . . . . . . . . . . . . . . 112
4.9.15 Eigenvalues of angular momentum operators . . . . . . . . . . . . . . . . . . . 115
4.9.16 Eigenfunctions of angular momentum operators . . . . . . . . . . . . . . . . . 117
Spin 121
5.1 Dirac equation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 121
5.2 Operator of the spin magnetic moment . . . . . . . . . . . . . . . . . . . . . . . . . . 122
5.3 Operators of spin angular momentum . . . . . . . . . . . . . . . . . . . . . . . . . . . 123
5.4 Eigenfunctions and eigenvalues of ˆIz . . . . . . . . . . . . . . . . . . . . . . . . . . . . 124
5.5 Evolution, eigenstates and energy levels . . . . . . . . . . . . . . . . . . . . . . . . . . 125
5.6 Real particles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 127
5.7 SUPPORTING INFORMATION . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 130
5.7.1 Special theory of relativity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 130
5.7.2 Relativistic momentum and energy . . . . . . . . . . . . . . . . . . . . . . . . 132
5.7.3 Relativistic quantum mechanics . . . . . . . . . . . . . . . . . . . . . . . . . . 133
5.7.4 Finding the matrices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 136
5.7.5 Solution of the Dirac equation . . . . . . . . . . . . . . . . . . . . . . . . . . . 138
5.7.6 Relation between Dirac and Schr¨odinger equations . . . . . . . . . . . . . . . . 142
5.7.7 Hamiltonian of spin magnetic moment . . . . . . . . . . . . . . . . . . . . . . 143
5.7.8 Spin magnetogyric ratio . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 145
5.7.9 The factor of one half in the eigenvalues of ˆIz . . . . . . . . . . . . . . . . . . 146
5.7.10 Eigenfunctions of ˆIx and ˆIy . . . . . . . . . . . . . . . . . . . . . . . . . . . . 147
5.7.11 Stationary states and energy level diagram . . . . . . . . . . . . . . . . . . . . 148
vi CONTENTS
5.7.12 Oscillatory states . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 149
5.7.13 Evolution in general alternating magnetic ﬁelds . . . . . . . . . . . . . . . . . 150
5.7.14 Evolution in rotating magnetic ﬁelds . . . . . . . . . . . . . . . . . . . . . . . 152
5.7.15 Evolution in non-rotating magnetic ﬁelds . . . . . . . . . . . . . . . . . . . . . 153
5.7.16 Modifying factor ξ in description of non-rotating ﬁelds . . . . . . . . . . . . . 155
5.7.17 Factor ξ approximated by power series expansion . . . . . . . . . . . . . . . . 157
Ensemble of non-interacting spins 165
6.1 Mixed state . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 165
6.2 Populations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 167
6.3 Coherence . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 168
6.4 Basis sets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 170
6.5 Liouville-von Neumann equation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 171
6.6 General strategy of analyzing NMR experiments . . . . . . . . . . . . . . . . . . . . . 172
6.7 SUPPORTING INFORMATION . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 174
6.7.1 Indistinguishable particles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 174
6.7.2 Separation of spin wave function . . . . . . . . . . . . . . . . . . . . . . . . . . 175
6.7.3 Separation of variables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 176
6.7.4 Phases and coherences . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 177
6.7.5 From Schr¨odinger to Liouville-von Neumann equation . . . . . . . . . . . . . . 178
6.7.6 Rotation in operator space . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 179
Chemical shift, one-pulse experiment 181
7.1 Operator of the observed quantity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 181
7.2 Hamiltonian of the static ﬁeld B0 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 181
7.3 Hamiltonian of the radio ﬁeld B1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 182
7.4 Hamiltonian of chemical shift . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 183
7.5 Secular approximation and averaging . . . . . . . . . . . . . . . . . . . . . . . . . . . 183
7.6 Thermal equilibrium as the initial state . . . . . . . . . . . . . . . . . . . . . . . . . . 184
7.7 Relaxation due to chemical shift anisotropy . . . . . . . . . . . . . . . . . . . . . . . . 185
7.8 One-pulse experiment . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 185
7.9 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 188
7.10 SUPPORTING INFORMATION . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 190
7.10.1 Decomposition of chemical shift Hamiltonian . . . . . . . . . . . . . . . . . . . 190
7.10.2 Density matrix in thermal equilibrium . . . . . . . . . . . . . . . . . . . . . . 190
7.10.3 Bloch-Wangsness-Redﬁeld theory . . . . . . . . . . . . . . . . . . . . . . . . . 191
7.10.4 Thermal noise of electrical circuits . . . . . . . . . . . . . . . . . . . . . . . . 194
7.10.5 Spectrum and signal-to-noise ratio . . . . . . . . . . . . . . . . . . . . . . . . 194
Dipolar coupling, product operators 199
8.1 Dipolar coupling . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 199
8.2 Quantum states of magnetic moment pairs . . . . . . . . . . . . . . . . . . . . . . . . 201
8.3 Product operators . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 202
CONTENTS vii
8.4 Density matrix of a two-spin system . . . . . . . . . . . . . . . . . . . . . . . . . . . . 204
8.5 Commutators of product operators . . . . . . . . . . . . . . . . . . . . . . . . . . . . 205
8.6 Operator of the observed quantity for more nuclei . . . . . . . . . . . . . . . . . . . . 207
8.7 Dipolar relaxation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 207
8.8 Thermal equilibrium with dipolar coupling . . . . . . . . . . . . . . . . . . . . . . . . 208
8.9 SUPPORTING INFORMATION . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 210
8.9.1 Tensor and Hamiltonian of dipolar coupling . . . . . . . . . . . . . . . . . . . 210
8.9.2 Secular approximation and averaging of dipolar Hamiltonian . . . . . . . . . . 211
8.9.3 Interacting and non-interacting magnetic moments . . . . . . . . . . . . . . . 212
8.9.4 Product operator bases . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 214
8.9.5 Deriving commutators of product operators . . . . . . . . . . . . . . . . . . . 215
8.9.6 Dipole-dipole relaxation: derivation . . . . . . . . . . . . . . . . . . . . . . . . 220
8.9.7 Dipole-dipole relaxation: discussion . . . . . . . . . . . . . . . . . . . . . . . . 224
8.9.8 Two magnetic moments in thermal equilibrium . . . . . . . . . . . . . . . . . 225
Two-dimensional spectroscopy, NOESY 229
9.1 Two-dimensional spectroscopy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 229
9.2 Evolution in the absence of dipolar coupling . . . . . . . . . . . . . . . . . . . . . . . 230
9.3 Signal modulation in a two-dimensional experiment . . . . . . . . . . . . . . . . . . . 231
9.4 NOESY . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 233
9.5 SUPPORTING INFORMATION . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 236
9.5.1 States-Haberkorn-Ruben method of processing hypercomplex data . . . . . . . 236
9.5.2 Quantitative analysis of cross-relaxation in NOESY . . . . . . . . . . . . . . . 237
9.5.3 Intensity of NOESY cross-peaks . . . . . . . . . . . . . . . . . . . . . . . . . . 238
J-coupling, spin echoes 239
10.1 Through-bond coupling . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 239
10.2 Secular approximation, averaging, and relaxation . . . . . . . . . . . . . . . . . . . . 240
10.3 Homo- and heteronuclear magnetic moment pairs . . . . . . . . . . . . . . . . . . . . 242
10.4 Density matrix evolution in the presence of J-coupling . . . . . . . . . . . . . . . . . 242
10.5 Signal in the presence of the J-coupling . . . . . . . . . . . . . . . . . . . . . . . . . . 247
10.6 Spin echoes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 249
10.7 Refocusing echo . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 252
10.8 Decoupling echo . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 252
10.9 Simultaneous echo . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 253
10.10SUPPORTING INFORMATION . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 254
10.10.1Interaction between nuclei mediated by bond electrons . . . . . . . . . . . . . 254
10.10.2Two electrons in a sigma orbital . . . . . . . . . . . . . . . . . . . . . . . . . . 258
10.10.3Classical analysis of two J-coupled polarizations . . . . . . . . . . . . . . . . . 260
10.10.4Comparison of classical and quantum analysis of J-coupling . . . . . . . . . . 266
10.10.5J-coupling compared to classical coupled oscillators . . . . . . . . . . . . . . . 266
10.10.6Two J-coupled nuclei in thermal equilibrium . . . . . . . . . . . . . . . . . . . 269
10.10.7Coherences depicted as double arrows . . . . . . . . . . . . . . . . . . . . . . . 269
viii CONTENTS
Correlated spectroscopy using J-coupling 277
11.1 Through-bond correlation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 277
11.2 INEPT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 278
11.3 Phase cycling . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 281
11.4 Simpliﬁed analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 282
11.5 HSQC . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 283
11.6 Decoupling trains . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 285
11.7 Signal summation and arraying in 2D spectroscopy . . . . . . . . . . . . . . . . . . . 287
11.8 Beneﬁts of HSQC . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 288
11.9 COSY . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 289
11.10SUPPORTING INFORMATION . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 298
11.10.1APT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 298
11.10.2Double-quantum ﬁltered COSY . . . . . . . . . . . . . . . . . . . . . . . . . . 298
Strong coupling, TOCSY 303
12.1 Strong J-coupling . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 303
12.2 Magnetic equivalence . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 305
12.3 TOCSY . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 307
12.4 SUPPORTING INFORMATION . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 311
12.4.1 Diagonalization of the J-coupling Hamiltonian matrix . . . . . . . . . . . . . . 311
12.4.2 Strong J-coupling and density matrix evolution . . . . . . . . . . . . . . . . . 314
12.4.3 HJ and operators of components of total I commute . . . . . . . . . . . . . . 315
12.4.4 J-coupling of magnetically equivalent nuclei . . . . . . . . . . . . . . . . . . . 315
12.4.5 Product operators of three and more coupled magnetic moments . . . . . . . . 316
12.4.6 Three magnetic moments in thermal equilibrium . . . . . . . . . . . . . . . . . 317
12.4.7 Commutation relations of the TOCSY mixing Hamiltonian . . . . . . . . . . . 317
12.4.8 Density matrix evolution in the TOCSY experiment . . . . . . . . . . . . . . . 318
Magnetic ﬁeld gradients 325
13.1 Pulsed ﬁeld gradients in NMR spectroscopy . . . . . . . . . . . . . . . . . . . . . . . 325
13.2 Magnetic resonance imaging . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 329
13.3 Weighting . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 334
13.4 SUPPORTING INFORMATION . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 338
13.4.1 Coherence dephasing and slice selection by ﬁeld gradients . . . . . . . . . . . . 338
13.4.2 Field gradients with smooth amplitude . . . . . . . . . . . . . . . . . . . . . . 339
13.4.3 Coherence selection by pulsed-ﬁeld gradients . . . . . . . . . . . . . . . . . . . 339
13.4.4 Pulsed-ﬁeld gradients and frequency discrimination . . . . . . . . . . . . . . . 340
13.4.5 Slice-selective imaging . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 341
13.4.6 Frequency encoding gradients . . . . . . . . . . . . . . . . . . . . . . . . . . . 341
13.4.7 Phase encoding gradients . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 342
How to use this text
After couple years of teaching the course C5320 Theoretical Concepts of NMR, I have decided to
convert my handwritten notes to an electronic format. My students soon asked me to share my notes
with them. I have agreed, warning them that the text was not supposed to serve as a textbook. Its
purpose was not to explain NMR to students, but to keep the background information in a single
ﬁle for the teacher. More recently, I have added introductions summarizing content of individual
lectures. You recognize them by a larger font and less technical details. But my original notes
are still there, labeled Supporting Information. They represent a very heterogeneous collections of
derivations, lists, technical issues, without much explanation. Intentionally, I have not tried to safe
space and included details usually omitted in textbooks. I have also found useful to incorporate
discussion from the original literature instead of providing reference to old articles. Feel free to use
it as a source of information but do not expect educational approach.
I would like to stress that the course expects a regular attendance of lectures and of the related
practical exercises. Fortunately, there are also excellent textbooks explaining NMR. Our course does
not follow any of them speciﬁcally, but I strongly recommend to check them especially if you ﬁnd my
lectures confusing. At the beginning of each Lecture, I refer to the textbooks by a one-letter symbol
deﬁning the book, followed by the number specifying the section. The one-letter symbols followed
by the full citation and a short description are listed below
• K: J. Keeler, Understanding NMR spectroscopy, 2nd. ed., Wiley 2010. Very educational, easy
to read, but physically correct, written for chemists.
• L: M. Levitt: Spin dynamics, 2nd. ed., Wiley 2008. Also very educational, with many pictures,
physically correct, going more to physics than Keeler and including topics not covered by Keeler
or by our course.
• C: J. Cavanagh et al., Protein NMR spectroscopy, 2nd. ed., Academic Press 2006 .Extremely
useful and detailed, but more diﬃcult to read, the only book of the list discussing applications
to large molecules (proteins), but including also rigorous description of physical principles.
• B: K. Brown: Essential mathematics for NMR and MRI spectroscopists, Royal Society of
Chemistry 2017. Detailed description of many mathematical background, providing details
omitted in textbooks, written for chemists without advanced prior knowledge of mathematics.
Finally, I should also clearly describe the content and purpose of the course. It is curiosityoriented
course, attempting to explain the theory of NMR and to answer the ”why” questions. The
1
2 CONTENTS
course does not teach you how to analyze NMR spectra or set up NMR experiments, but it should
tell you why are the NMR experiments designed as they are. As NMR has roots in physics but
its fruit is mostly picked by chemists, the course must take into account that students of chemistry
and biochemistry are less trained in classical mechanics and electromagnetism than the students of
physics and biophysics. I try to cope with this fact by including the ”zeroth” chapter in this study
text. Its content is not covered by a lecture, but the students should read it individually (ideally
before the course starts) to check how much they understand the basics and to ﬁll the gaps. The
”zeroth” chapter also contains a homework that should guide the students.
Part I
Classical Introduction
3
Before we start:
Classical electromagnetism
Literature: Classical electromagnetism is discussed in L2 and B11, with the mathematical background
covered by B4.
0.1 Electric ﬁeld, electric charge, electric dipole
Objects having a property known as the electric charge (Q) experience forces (F) described as the
electric ﬁeld. Since the force depends on both charge and ﬁeld, a quantity E = F/Q known as the
electric intensity has been introduced:
F = QE. (1)
Field lines are often used to visualize the ﬁelds: direction of the line shows the direction of E,
density of the lines describes the size of E (|E|). A homogeneous static electric ﬁeld is described by
straight parallel ﬁeld lines.
Two point electric charges of the same size and opposite sign (+Q and −Q) separated by a distance
2r constitute an electric dipole. Electric dipoles in a homogeneous static electric ﬁeld experience a
moment of force, or torque τ:
τ = 2r × F = 2r × QE = 2Qr × E = µe × E, (2)
where µe is the electric dipole moment.
τ = µe × E (3)
is another possible deﬁnition of E. As derived in Section 0.6.1, potential energy of an electric
dipole is
E = −µe · E. (4)
0.2 Magnetic ﬁeld and magnetic dipole
There is no ”magnetic charge”, but magnetic moments exist:
5
6
τ = µm × B, (5)
where µm is the magnetic dipole moment (because this course is about magnetic resonance, we
will write simply µ). This is the deﬁnition of the magnetic induction B as a quantity describing
magnetic ﬁeld. As a consequence, potential energy of a magnetic dipole can be derived as described
by Eq. 27 for the electric dipole.
Potential energy of a magnetic moment µ is
E = −µ · B. (6)
The magnetic induction B is related to the force acting on a charged object, but in a diﬀerent
way than the electric intensity E (cf. Eq. 1). The magnetic force depends not only on the electric
charge Q but also on the speed of the charge v (i.e., on the electric current)
F = Q(v × B). (7)
Therefore, the torque τ cannot be described by an equation similar to Eq. 2. Instead,
τ = r × F = Qr × (v × B). (8)
Due to the fundamental diﬀerence between Eqs. 2 and 8, it is more diﬃcult to describe relation
between the magnetic force, magnetic moment and energy. We experience it in Sections 0.6.2 and
4.9.1.
0.3 Source of the electric ﬁeld
The source of the electric ﬁeld is the electric charge. The charge (i) feels (a surrounding) ﬁeld and
(ii) makes (its own) ﬁeld. Charge at rest is a source of a static electric ﬁeld. Parallel plates with
homogeneous distribution of charges (a capacitor) are a source of a homogeneous static electric ﬁeld.
Force between charges is described by the Coulomb’s law. The force between two charges is given
by
F =
1
4π 0
Q1Q2
r2
r
|r|
, (9)
where 0 = 8.854187817 × 10−12
F m−1
is the vacuum electric permittivity.
Consequently, the electric intensity generated by a point charge is
E =
1
4π 0
Q
r2
r
|r|
. (10)
The electric intensity generated by a charge density ρ is
E =
1
4π 0
ˆ
V
dV
ρ
r2
r
|r|
(11)
0.4. ORIGIN OF THE MAGNETIC FIELD 7
Coulomb’s law implies that electric ﬁelds lines of a resting charge
1. are going out of the charge (diverge), i.e., the static electric ﬁeld has a source (the charge)
2. are not curved (do not have curl or rotation), i.e., the static electric ﬁeld does not circulate
This can been written mathematically in the form of Maxwell equations:1
div E =
ρ
0
, (12)
rot E = 0. (13)
where div E is a scalar equal to ∂Ex
∂x
+ ∂Ey
∂y
+ ∂Ez
∂z
and rot E is a vector with the x, y, z components
equal to ∂Ez
∂y
− ∂Ey
∂z
, ∂Ex
∂z
− ∂Ez
∂x
, ∂Ey
∂x
− ∂Ex
∂y
, respectively. These expressions can be written in a much
more compact form, if we introduce a vector operator = ∂
∂x
, ∂
∂y
, ∂
∂z
. Using this formalism, the
Maxwell equations have the form
· E =
ρ
0
, (14)
× E = 0. (15)
0.4 Origin of the magnetic ﬁeld
Electric charge at rest does not generate a magnetic ﬁeld, but a moving charge does. The magnetic
force is a relativistic eﬀect (consequence of the contraction of distances in the direction of the motion,
described by Lorentz transformation).2
Magnetic ﬁeld of a moving point charge is moving with the
charge. Constant electric current generates a stationary magnetic ﬁeld. Constant electric current in
an ideal solenoid generates a homogeneous stationary magnetic ﬁeld inside the solenoid.
Magnetic induction generated by a current density j (Biot-Savart law):
B =
1
4π 0c2
ˆ
V
dV
j
r2
×
r
|r|
=
µ0
4π
ˆ
V
dV
j
r2
×
r
|r|
(16)
Biot-Savart law implies that magnetic ﬁeld lines of a constant current in a straight wire
1. do not diverge, i.e., the static magnetic ﬁeld does not have a source
1
The ﬁrst equation is often written using electric induction D as div D = ρ. If electric properties are described in
terms of individual charges in vacuum, D = 0E. If behavior of charges bound in molecules is described in terms of
polarization P of the material, D = 0E + P.
2
A charge close to a very long straight wire which is uniformly charged experiences an electrical force F⊥ in the
direction perpendicular to the wire. If the charges in the wire move with a velocity v0 and the charge close to the wire
moves along the wire with a velocity v1, the perpendicular force changes to F⊥(1 − v0v1
c2 ), were c is the speed of light
in vacuum. The modifying factor is clearly relativistic (B11.5).
8
2. make closed loops around the wire (have curl or rotation), i.e., the magnetic ﬁeld circulates
around the wire
This can been written mathematically in the form of Maxwell equations:3
· B = 0, (17)
× B = µ0j. (18)
A simple example of a moving charge is a circular loop with an electric current. As derived
in Section 0.6.2, magnetic moment of a current loop is proportional the angular momentum of the
circulating charge.
Magnetic dipolar moment µ is proportional to the angular momentum L
µ = γL, (19)
where γ is known as the magnetogyric ratio.
The classical theory does not explain why particles like electrons or nuclei have their own magnetic
moments, even when they do not move in circles (because the classical theory does not explain why
such particles have their own angular momenta). However, if we take the nuclear magnetic moment
as a fact (or if we obtain it using a better theory), the classical results are useful. It can be shown
that the magnetic moment is always proportional to the angular momentum,4
but the proportionality
constant is not always Q/2m; it is diﬃcult to obtain for nuclei.
Analysis of the current loop in a static homogeneous external magnetic ﬁeld, presented in Section
0.6.2, shows that if the direction of the magnetic moment µ of the loop diﬀers from the direction
of B, a torque trying to align µ with B. However, the magnetic dipole does not adopt the energetically
most favored orientation (with the same direction of µ as B), but rotates around B without
changing the angle between µ and B. This motion on a cone is known as precession.
This is not a result of quantum mechanics, but a classical consequence of the relation between
the magnetic moment and angular momentum of the current loop. The spinning top also precess
in the Earth’s gravitational ﬁeld and riding a bicycle is based on the same eﬀect.5
The precession
frequency can be derived easily for the classical current loop in a magnetic ﬁeld (see Section 0.6.3):
Angular frequency of the precession of a magnetic dipolar moment µ in a magnetic ﬁeld B is
ω = −γB. (20)
3
The second equation is often written using magnetic intensity H as × H = j. If magnetism is described as
behavior of individual charges and magnetic moments in vacuum, H = B/µ0. If properties of a magnetic materials
are described in terms of its magnetization M, then H = B/µ0 − M.
4
A consequence of the rotational symmetry of space described mathematically by the Wigner-Eckart theorem.
5
If you sit on a bike which does not move forward, gravity soon pulls you down to the ground. But if the bike has
a certain speed and you lean to one side, you do not fall down, you just turn a corner. A qualitative discussion of
precession using the spinning top and riding a bicycle is presented in L2.4–L2.5.
0.5. ELECTRODYNAMICS AND MAGNETODYNAMICS 9
0.5 Electrodynamics and magnetodynamics
Similarly to the electric charge, the magnetic dipole (i) feels the surrounding magnetic ﬁeld and (ii)
generates its own magnetic ﬁeld. The magnetic ﬁeld generated by a precessing magnetic dipole is
not stationary, it varies. To describe variable ﬁelds, the Maxwell equations describing rotation must
be modiﬁed:6
× E = −
dB
dt
, (21)
× B =
1
c2
dE
dt
+ µ0j. (22)
Note that electric and magnetic ﬁelds are coupled in the dynamic equations. Not only electric
currents current, but also temporal variation of E induces circulation of B, and circulation of E is
possible if B varies. This has many important consequences: it explains electromagnetic waves in
vacuum and has numerous fundamental applications in electrical engineering, including those used
in NMR spectroscopy.
Eq. 21 shows us how the frequency of the precession motion can be measured. A magnetic dipole
in a magnetic ﬁeld B0 generates a magnetic ﬁeld B with the component B0 constant and the
component ⊥ B0 rotating around B0. If we place a loop of wire next to the precessing dipole, with
the axis of the loop perpendicular to the axis of precession, the rotating component of B induces
circulation of E which creates a measurable oscillating electromotive force (voltage) in the loop (see
Section 0.6.4).
U =
µ0
4π
2|µ|S
r3
ω sin(ωt). (23)
As a consequence, an oscillating electric current ﬂows in the loop (L2.8).
HOMEWORK
First check that you understand Section 0.6.1. Then, derive how is the magnetic moment of a current
loop related to the angular momentum (Section 0.6.2) and what deﬁnes the precession frequency of
a magnetic moment of a current loop in a homogeneous magnetic ﬁeld (Section 0.6.3).
6
The second equation can be written as × H = dD
dt + j.
10
E
z
y
r
⊕
+Q
−Q
A
F
F
r
E
z
y
r
⊕
+Q
−Q
θ
B
z+,1 = z+,0 + z
z−,1 = z−,0 − z
z+,0 = z−,0
F
F
r
E
z
y
µe
θ
C
⊗
τ
Figure 1: Potential energy of an electric dipole in a homogeneous electric ﬁeld described by the intensity E. The reference position of
the dipole (0) is shown in Panel A, the actual position of the dipole (1) is shown in Panel B. Individual charges and forces are shown in
panels A and B, the dipolar moment µe and the torque τ (its direction −x is depicted using the symbol ⊗) are shown in Panel C. Note
that the direction of µe follows the convention used in physics, the convention used in chemistry is opposite.
0.6 SUPPORTING INFORMATION
0.6.1 Potential energy of an electric dipole
Potential energy7 of the electric dipole can be calculated easily as a sum of potential energies of the individual charges. Potential energy
is deﬁned as the work done by the ﬁeld moving the charge from a position (1) to a reference position (0). If we choose a coordinate system
as deﬁned in Figure 1, then the force acts only in the z -direction (Fz = |F| = Q|E| for the positive charge and Fz = −|F| = −Q|E|
for the negative charge). Therefore, it is suﬃcient to follow only how the z -coordinates of the charges change because changes of other
coordinates do not change the energy. The natural choice of the reference position is that the z coordinates are the same for both charges,
z+,0 = z−,0. Changing the z coordinate of the positive charge from z+,0 to z+,1 = z+,0 + z results in a work
Q|E|(z+,0 − z+,1) = −Q|E|z. (24)
Changing the z coordinate of the negative charge from z−,0 to z−,1 = z−,0 − z results in a work
− Q|E|(z−,0 − z−,1) = −Q|E|z. (25)
Adding the works
E = −2Q|E|z = −2Q|E|r cos θ = −µe · E, (26)
where θ is the angle between E and µe.
Equivalently, the potential energy can be deﬁned as the work done by the torque τ on µe (Figure 1C) when rotating it from the
reference orientation to the orientation described by the angle θ (between E and µe). The reference angle for z+,0 = z−,0 is π/2, therefore,
E =
θˆ
π
2
|τ|dθ =
θˆ
π
2
|µe||E| sin θ dθ = −|µe||E| cos θ = −µe · E. (27)
0.6.2 Current loop as a magnetic dipole
Now we derive what is the magnetic dipole of a circular loop with an electric current. The magnetic moment is deﬁned by the torque τ it
experiences in a magnetic ﬁeld B (Eq. 5):
τ = µ × B, (28)
7
Do not get confused: E (scalar) is the energy and E (vector) is electric intensity.
0.6. SUPPORTING INFORMATION 11
Therefore, we can calculate the magnetic moment of a current loop if we place it in a magnetic ﬁeld B. Let us ﬁrst deﬁne the geometry
of our setup. Let the axis z is the normal of the loop and let B is in the xz plane (⇒ By = 0). The vector product in Eq. 5 then simpliﬁes
to
τx = µyBz, (29)
τy = µzBx − µxBz, (30)
τz = −µyBx. (31)
Note that we assume that the electric current in the loop and the magnetic ﬁeld are independent. The current is not induced by B
but has another (unspeciﬁed) origin, and B is not a result of the current, but is introduced from outside.
As the second step, we describe the electric current in the loop. The electric current is a motion of the electric charge. We describe
the current as a charge Q homogeneously distributed in a ring (loop) of a mass m which rotates with a circumferential speed v. Then, each
element of the loop of a inﬁnitesimally small length dl = rdϕ contains the same fraction of the mass dm and of the charge dQ, moving
with the velocity v. The direction of the vector v is tangent to the loop and the amount of the charge per the length element is Q/2πr.
The motion of the charge element dQ can be described, as any circular motion, by the angular momentum
dL = r × dp = dm(r × v), (32)
where r is the vector deﬁning the position of the charge element dQ (Figure 2A). In our geometry, r is radial and therefore always
perpendicular to v. Since both r and v are in the xy plane, dL must have the same direction as the normal of the plane. Therefore, the
x and y components of dL are equal to zero and the z component is constant and identical for all elements (note that r and v of diﬀerent
elements diﬀer, but r × v is constant, oriented along the normal of the z axis and with the size equal to rv for all elements). It is therefore
easy to integrate dL and calculate L of the loop
Lx = 0, (33)
Ly = 0, (34)
Lz = rv
ˆ
loop
dm = mrv. (35)
As the third step, we examine forces acting on dQ. The force acting on a moving charge in a magnetic ﬁeld (the Lorentz force) is
equal to
F = Q(E + v × B), (36)
but we are now only interested in the magnetic component F = Q(v × B). The force acting on a single charge element dQ is
dF = dQ(v × B) =
Q
2πr
dl(v × B) =
Q
2π
(v × B)dϕ. (37)
The key step in our derivation is the deﬁnition of the torque
τ = r × F = Qr × (v × B), (38)
which connects our analysis of the circular motion with the deﬁnition of µ (Eq. 5). The torque acting on a charge element is (Figure 2B)
dτ = r × dF =
Q
2π
r × (v × B)dϕ =
Q
2π


v(r · B) − B (r · v)
=0


 dϕ =
Q
2π
(r · B)vdϕ. (39)
where a useful vector identity a × (b × c) = (a · c)b − (a · b)c helped us to simplify the equation because r ⊥ v). Eq. 39 tells us that
the torque has the same direction as the velocity v (v is the only vector on the right-hand side because r · B is a scalar). In our coordinate
frame, vx = −v sin ϕ, vy = v cos ϕ, vz = 0, and r ·B = rxBx +ryBy +rzBz = rxBx = Bxr cos ϕ (r ·B is reduced to rxBx in our coordinate
frame because By = 0 and rz = 0). Therefore, we can calculate the components of the overall torque τ as (Figure 2C)
τx = −
Qrv
2π
Bx
2πˆ
0
sin ϕ cos ϕdϕ = −
Qrv
4π
Bx
2πˆ
0
sin(2ϕ)dϕ = 0, (40)
τy =
Qrv
2π
Bx
2πˆ
0
cos2
ϕdϕ =
Qrv
4π
Bx
2πˆ
0
(1 + cos(2ϕ))dϕ =
Qrv
2
Bx, (41)
τz = 0. (42)
12
B
r
vdL
x
y
z
A
dϕ
dl
B
r
v
L
dτ
x
y
dF
B
dϕ
B
r
v
L
τ
x
y
µ = γL
C
Figure 2: Current loop as a magnetic dipole. The loop of radius r and length 2πr, charge Q and mass m is shown in cyan. A magnetic
induction B of an external ﬁeld is shown in magenta. The coordinates are chosen such that the loop is placed in the xy plane and B in
the xz. An element of charge dQ (moving with the velocity v), mass dm and length dl = rdϕ is shown in blue. The angular momentum
of the blue element is dL = r × vdm (Panel A). The total angular momentum is L = r × vm (Panel B). The force dF = v × BdQ and the
torque dτ = r × dF acting on the blue element are depicted as the green and red arrows in Panel B. The torque acting on the whole loop
and the magnetic moment experiencing the torque in the ﬁeld B are shown as the red and cyan arrows in Panel C.
Comparison with Eqs. 29–31 immediately shows that
µx = 0, (43)
µy = 0, (44)
µz =
Qrv
2
(45)
and comparison with Eqs. 33–35 reveals that the magnetic dipole moment of the current loop is closely related to the angular
momentum L = r × mv:
µ =
Q
2m
L. (46)
0.6.3 Precession
Angular momentum of a particle moving in a circle is deﬁned as L = mr × v (Eq. 32), where r deﬁnes position of the particle and m and
v are the mass and the velocity of the particle, respectively (Figure 3A). The change of L is described by the time derivative of L.
dL
dt
= m
d(r × v)
dt
= m
dr
dt
× v + mr ×
dv
dt
= m (v × v)
0
+r × ma. (47)
According the second Newton’s law, ma is equal to the force acting on the particle (changing L)
dL
dt
= r × ma = r × F = τ, (48)
where F is the force and τ is the corresponding torque. The change of the angular momentum of a current loop due to an external
force can be calculated in the same manner (Figure 3). For an inﬁnitesimal element of the loop,
d(dL)
dt
= r × a dm = r × dF = dτ. (49)
In a homogeneous magnetic ﬁeld, the force acting on all elements is the same and integration of the individual elements is as easy as
in Eq. 35, resulting in Eq. 48, where the force F and the torque τ now act on the angular momentum of the whole loop. Because µ = γL
(Eq. 19) and τ = µ × B (Eq. 5, the the magnetic moment of a current loop in a homogeneous magnetic ﬁeld changes as
0.6. SUPPORTING INFORMATION 13
dµ
dt
= γr × F = γτ = γµ × B = −γB × µ. (50)
Rotation of any vector, including µ can be described using the angular frequency ω (its magnitude is the speed of the rotation in
radians per second and its direction is the axis of the rotation):
dµ
dt
= ω × µ. (51)
Comparison with Eq. 50 immediately shows that ω = −γB.
0.6.4 Electromotive force (voltage)
We can use a simple example to analyze the induced voltage quantitatively. This voltage (the electromotive force) is an integral of the
electric intensity along the detector loop. Stokes’ theorem (see B9) allows us to calculate such integral from Eq. 21.
˛
L
Edl = −
ˆ
S
∂B
∂t
dS = S
∂B
∂t
· n, (52)
where S is the area of the loop and n is the normal vector to the loop. If the distance r of the magnetic moment from the detector
is much larger than the size of the loop, the magnetic induction of a ﬁeld which is generated by a magnetic moment µ rotating in a plane
perpendicular to the detector loop and which crosses the loop (let us call it Bx) is8
Bx =
µ0
4π
2µx
r3
. (53)
As µ rotates with the angular frequency ω, µx = |µ| cos(ωt), and
∂Bx
∂t
= −
µ0
4π
2
r3
|µ|ω sin(ωt). (54)
Therefore, the oscillating induced voltage is
˛
L
Edl =
µ0
4π
2|µ|S
r3
ω sin(ωt). (55)
8
We describe the ﬁeld generated by a magnetic moment in more detail later in Section 8.1 when we analyze mutual interactions of
magnetic moments of nuclei.
14
r
v
L
A
r
v
L
µ
B B
r
v
L
µ
τ
C
B
L
µ
τ
ω
D B
L
µ
τ
ω
E B
L
µ
τ
ω
F
Figure 3: Classical description of precession of a current loop in a homogeneous magnetic ﬁeld. Angular momentum L of a charged
particle of the mass m moving in a circular loop (shown in cyan in Panel A) randomly oriented in space is given by the vector product
of the actual position vector of the particle r and the actual particle’s velocity v (L = mr × v). Note that size and direction of L is the
same for all positions of the particle along the circle (for all possible vectors r). The angular momentum L of a current loop of the same
mass and the magnetic moment µ (cyan arrow), proportional to L are shown in Panel B. The proportionality constant is γ (Eq. 19). In a
presence of a vertical static magnetic ﬁeld B (magenta arrow in Panel C), the loop experiences a torque τ = µ × B (Eq. 5), shown as the
red arrow in Panel C. This torque (red arrow moved to the tip of the cyan arrow in Panel D) acts on µ, which precesses about B. Two
snapshots of the precessing µ (with the loop) are shown in Panels E and F. The tip of the cyan arrow representing µ rotates about B (the
blue circle) with the angular frequency ω = −γB.
Lecture 1
Nuclear magnetic resonance
Literature: A general introduction can be found in L2.6 and L2.7. A nice and detailed discussion,
emphasizing the importance of relaxation, is in Sz´antay et al.: Anthropic awareness, Elsevier 2015,
Section 2.4. A useful review of relevant statistical concepts is presented in B6. Chemical shift is
introduced by Levitt in L3.7 and discussed in detail in L9.1 (using a quantum approach, but the
classical treatment can be obtained simply by using energy Ej instead of ˆHj and magnetic moment
µjk instead of γj
ˆIjk in Eqs. 9.11–9.14). A nice discussion of the oﬀset eﬀects (and more) can be
found in K4.
1.1 Nuclear magnetic moments in chemical substances
The aim of this course is to describe physical principles of the most frequent version of NMR spectroscopy,
NMR analysis of chemical compounds dissolved in suitable solvents. We start by a description
based on classical electrodynamics and postpone discussions based on quantum mechanics
to Lecture 4. The classical theory does not explain why some nuclei in such solutions have a magnetic
moment, but it describes macroscopic eﬀects of the nuclear magnetic moments in bulk samples
(i.e., in macroscopic systems composed of billions of billions of molecules). It should be emphasized
that classical (non-quantum) physics provides much more relevant description of the mascroscopic
samples than quantum mechanics of individual particles (electrons or nuclei).
Nuclei have permanent microscopic magnetic moments, but the macroscopic magnetic moment
of non-ferromagnetic chemical substances is induced only in the magnetic ﬁeld. This is the eﬀect
of symmetry. Outside a magnet, all orientations of the microscopic magnetic moments have the
same energy and are equally probable. Therefore, the bulk magnetic moment is zero and the bulk
magnetization M (magnetic moment per unit volume) is zero (Fig. 1.1).
1.2 Polarization
In a static homogeneous magnetic ﬁeld B0, the orientations of µ are no longer equally probable: the
orientation of µ along B0 is energetically most favored and the opposite orientation is least favored.
The symmetry is broken in the direction of B0, this direction is used to deﬁne the z axis of a coordinate
system we work in. However, the state with all magnetic moments in the energetically most
15
16
Figure 1.1: Distribution of magnetic moments in the absence of a magnetic ﬁeld. Left, a schematic representation
of an NMR sample. Dots represent molecules, arrows represent magnetic moments (only one magnetic moment per
molecule is shown for the sake of simplicity, like e.g. in compressed 13
C16
O2). Right, the molecules are superimposed
to make the distribution of magnetic moments visible.
favorable orientation is not most probable. Orienting all magnetic moments along the magnetic ﬁeld
represents only one microstate. In contrast, there exist a large number of microstates with somewhat
higher energy. The correct balance between energy and probability is described by the Boltzmann
distribution law, which can be derived from purely statistical arguments. Thermodynamics thus helps
us to describe the polarization along z quantitatively.1
Calculation of the average magnetic moment,
presented in Section 1.5.2, shows that the bulk magnetization of the NMR sample containing nuclei
with µ:
Meq
x = 0 Meq
y = 0 Meq
z =
N
3
|µ|2
|B0|
kBT
, (1.1)
where N is the number of dipoles per unit volume.
In summary, dipoles are polarized in the static homogeneous magnetic ﬁelds. In addition, all
dipoles precess2
with the frequency ω = −γB0, but the precession cannot be observed at the macroscopic
level because the bulk magnetization is parallel with the axis of precession (Fig. 1.2).
1
Thermodynamics also tells us that the energy of the whole (isolated) system must be conserved. Decreased energy
of magnetic moments is compensated by increased rotational kinetic energy of molecules of the sample, coupled with
the magnetic moments via magnetic ﬁelds of the tumbling molecules, as discussed in the next chapter.
2
Precession is described in background Section 0.6.3.
1.3. COHERENCE 17
M
B0
Figure 1.2: Distribution of magnetic moments in a homogeneous magnetic ﬁeld B0. The cyan arrow represents the
bulk magnetization.
1.3 Coherence
In order to observe precession, we need to break the axial symmetry and introduce a coherent motion
of magnetic moments. This is achieved by applying another magnetic ﬁeld B1 perpendicular to B0 and
oscillating with the frequency close to (ideally equal to) γ|B0|/2π. In NMR, sources of the oscillatory
ﬁeld are radio waves.3
Figure 1.3 shows why a static perpendicular magnetic ﬁeld cannot be used,
whereas the desired eﬀect of an oscillating perpendicular magnetic ﬁeld is depicted in Figure 1.4.
If the radio waves are applied exactly for the time needed to rotate the magnetization by 90 ◦
,
they create a state with M perpendicular to B0. The magnetization vector (left panel in Fig. 1.4)
describes a new distribution of magnetic moments (right panel in Fig. 1.4). Such magnetization vector
then rotates with the precession frequency, also known as the Larmor frequency. The described
rotation corresponds to a coherent motion of nuclear dipoles polarized in the direction of M and
generates measurable electromotive force in the detector coil. When describing the eﬀect of radio
waves, the oscillating magnetic ﬁeld of the waves is often approximated by a rotating magnetic ﬁeld.
Such treatment is presented in detail in Section 1.5.5. Later, after introducing quantum-mechanical
description of nuclear magnetic moments, we show that rotating magnetic ﬁelds represent a good
approximation of radio waves oscillating in one dimension, if the magnetic induction of the radio
waves is much lower than |B0| (Sections 5.7.13–5.7.17).
3
In the context of the NMR spectroscopy, it is important that the ﬁeld oscillates in time, not that it travels in
space as a wave.
18
M
B0
M
Bstatic
B0
M
Bstatic
B0
Figure 1.3: Distribution of magnetic moments in the presence of an external homogeneous magnetic ﬁeld B0 (vertical
purple arrow) is such that the bulk magnetization of nuclei (shown in cyan) is oriented along B0 (left). Application of
another static magnetic ﬁeld B1 rotates magnetization away from the original vertical orientation down in a clockwise
direction (middle). However, the magnetization also precesses about B0. After a half-turn precession (right), the
clockwise rotation by the additional magnetic ﬁeld B1 returns the magnetization towards its original vertical direction.
Therefore, a static ﬁeld cannot be used to turn the magnetization from the vertical direction to a perpendicular
orientation.
M
B0
Bradio
M
B0
Figure 1.4: Eﬀect of the radio waves on the bulk magnetization (left) and distribution of magnetic moments after
application of the radio-wave pulse. The thin purple line shows oscillation of the magnetic induction vector of the radio
waves, the cyan trace shows evolution of the magnetization during irradiation. If the perpendicular magnetic ﬁeld
oscillates with a frequency equal to the precession frequency of magnetization, it rotates the magnetization clockwise
then it is tilted to the right, but counter-clockwise when the magnetization is tilted to the left. Therefore, the
magnetization is more and more tilted down from the original vertical direction. The total duration of the irradiation
by the radio wave was chosen so that the magnetization is rotated to the plane perpendicular to B0 (cyan arrow).
Note that the ratio |B0|/|Bradio| is much higher in a real experiment.
1.4. CHEMICAL SHIFT 19
A B C
Figure 1.5: A, Classical description of interaction of an observed magnetic moment with the orbital magnetic moment
of an electron of the same atom. The observed nucleus and the electron are shown in cyan and red, respectively. The
thick purple arrow represents B0, the thin purple induction lines represent the magnetic ﬁeld of the electron (the small
purple arrows indicate its direction). The electron in B0 moves in a circle shown in red, direction of the motion is
shown as the red arrow. The ﬁeld of the orbital magnetic moment of the electron in the same atom decreases the total
ﬁeld in the place of the observed nucleus (the small purple arrow in the place of the cyan nucleus is pointing down).
B, Interaction of an observed magnetic moment with the orbital magnetic moment of an electron of the another atom
(its nucleus is shown in gray). In the shown orientation of the molecule, the ﬁeld of the orbital magnetic moment of
the electron in the other atom increases the total ﬁeld in the place of the observed nucleus (the small purple arrow
close the cyan nucleus is pointing up). C, As the molecule rotates, the cyan nucleus moves to a position where the
ﬁeld of the orbital magnetic moment of the electron in the other atom starts to decrease the total ﬁeld (the induction
lines reverse their direction in the place of the cyan nucleus).
1.4 Chemical shift
The description of the motions of the bulk nuclear magnetization presented in the previous section
is simple but boring. What makes NMR useful for chemists and biologists is the fact that the energy
of the magnetic moment of the observed nucleus is inﬂuenced by magnetic ﬁelds associated with
motions of nearby electrons. In order to understand this eﬀect, we need to describe the magnetic
ﬁelds of moving electrons.
If a moving electron enters a homogeneous magnetic ﬁeld, it experiences a Lorentz force and moves
in a circle in a plane perpendicular to the ﬁeld (cyclotron motion). Such an electron represents an
electric current in a circular loop, and is a source of a magnetic ﬁeld induced by the homogeneous
magnetic ﬁeld. The homogeneous magnetic ﬁeld B0 in NMR spectrometers induces a similar motion
of electrons in atoms, which generates microscopic magnetic ﬁelds (Figure 1.5A).
The observed nucleus feels the external magnetic ﬁeld B0 slightly modiﬁed by the microscopic
ﬁelds of electrons. If the electron distribution is spherically symmetric, with the observed nucleus in
the center (e.g. electrons in the 1s orbital of the hydrogen atom), the induced ﬁeld of the electrons
decreases the eﬀective magnetic ﬁeld felt by the nucleus in the center. Since the induced ﬁeld of
electrons Be is proportional to the inducing external ﬁeld B0, the eﬀective ﬁeld can be described as
B = B0 + Be = (1 + δ)B0. (1.2)
The constant δ is known as chemical shift and does not depend on the orientation of the molecule
20
in such a case4
. The precession frequency of the nucleus is equal5
to (1 + δ)ω0.
Most molecules consist of multiple atoms and electron distribution is therefore not spherically
symmetric around the observed nucleus. As a consequence, the eﬀective ﬁeld depends on the orientation
of the whole molecule deﬁning mutual positions of atoms and orientation of molecular orbitals.
The currents induced in orbitals of other atoms may decrease or increase (shield or deshield) the
eﬀective magnetic ﬁeld felt by the observed nucleus (Figure 1.5B,C). Therefore, the eﬀective ﬁeld
ﬂuctuates as a result of rotational diﬀusion of the molecule and of internal motions changing mutual
positions of atoms. The induced ﬁeld of electrons is still proportional to the inducing external ﬁeld
B0, but the proportionality constants are diﬀerent for each combination of components of Be and
B0 in the coordination frame used. Therefore, we need six6
constants δjk to describe the eﬀect of
electrons:
Be,x = δxxB0,x + δxyB0,y + δxzB0,z (1.3)
Be,y = δyxB0,x + δyyB0,y + δyzB0,z (1.4)
Be,z = δzxB0,x + δzyB0,y + δzzB0,z (1.5)
Eqs. 1.3–1.5 can be written in more compact forms


Be,x
Be,y
Be,z

 =


δxx δxy δxz
δyx δyy δyz
δzx δzy δzz

 ·


B0,x
B0,y
B0,z

 (1.6)
or
Be = δ · B0, (1.7)
where δ is the chemical shift tensor.
It is always possible to ﬁnd a coordinate system X, Y, Z known as the principal frame, where δ is
represented by a diagonal matrix. In such a system, we need only three constants (principal values of
the chemical shift tensor): δXX, δY Y , δZZ. However, three more parameters must be speciﬁed: three
Euler angles (written as ϕ, ϑ, and χ in this text) deﬁning orientation of the coordinate system X, Y, Z
in the laboratory coordinate system x, y, z. Note that δXX, δY Y , δZZ are true constants because they
do not change as the molecule tumbles in solution (but they may change due to internal motions
or chemical changes of the molecule). The orientation is completely described by the Euler angles.
Graphical representation of the chemical shift tensor is shown in Figure 1.6, the algebraic description
is presented in Section 1.5.6. We derive a not very simple equation describing how electrons modify
the external magnetic ﬁeld:
4
Instead of δ, a constant with the opposite sign deﬁning the chemical shielding is sometimes used.
5
The value of δ in Eq. 1.2 describes how much the frequency of nuclei deviates from a hypothetical frequency of
free nuclei. Such a hypothetical frequency is diﬃcult to measure. In practice, frequencies of nuclei in certain, readily
accessible chemical compounds are used instead of the frequencies of free nuclei as the reference values of δ, as is
described in Section 3.1.
6
There are nine constants in Eqs. 1.3–1.5, but δxy = δyx, δxz = δzx, and δyz = δzy.
1.4. CHEMICAL SHIFT 21
= + +
Figure 1.6: Visualization of the chemical shift tensor (black). Distance of each point at the plotted surface from
its center is proportional to the magnetic induction Be in the given direction. The chemical shift tensor can be
decomposed to its isotropic (blue), axial (red), and rhombic (green) contributions. The dark and light colors indicate
positive and negative values.
Be = δiB0


0
0
1

 + δaB0


3 sin ϑ cos ϑ cos ϕ
3 sin ϑ cos ϑ sin ϕ
3 cos2
ϑ − 1


+ δrB0


−(2 cos2
χ − 1) sin ϑ cos ϑ cos ϕ + 2 sin χ cos χ sin ϑ sin ϕ
−(2 cos2
χ − 1) sin ϑ cos ϑ sin ϕ − 2 sin χ cos χ sin ϑ cos ϕ
+(2 cos2
χ − 1) sin2
ϑ

, (1.8)
where δi, δa, and δr are constants describing sizes of the isotropic, axially symmetric and asymmetric
(rhombic) components of the chemical shift tensor, respectively, and ϑ, ϕ, χ are the aforementioned
Euler angles.
Do we really need such a level of complexity? The answer is ”yes and no”. When we analyze only
the (average) value of the precession frequency, it is suﬃcient to consider only the isotropic component.
The description of the eﬀect of electrons then simpliﬁes to Eq. 1.2, where δ now represents δi of
Eq. 1.8. When we analyze also the eﬀect of stochastic motions, the other terms become important as
well. The correct quantitative analysis requires full Eq. 1.8, but the basic principles can be discussed
without using the rhombic component. Therefore, we will use the axially symmetric approximation
of Eq. 1.8 when we discuss eﬀects of molecular motions in Section 2.6.1.
Practical consequences of the existence of the chemical shift, their formal description and related
conventions used in the NMR literature are discussed in Section 1.5.7. In addition, Section 1.5.7
presents simpliﬁed equations of motion describing evolution of magnetization in terms of classical
physics and in the absence of relaxation. Solution of these equations is described in Section 1.5.8 for
a simple case of magnetization rotating in the absence of the radio waves. The classical analysis of
relaxation eﬀects is then discussed in the next lecture.
We can summarize the results of the analysis of the eﬀect of the chemical shift as follows
22
• In the absence of the radio waves, the chemical shift distinguishes precession frequency of
individual nuclei. In isotropic liquids (solutions of studied compounds in common solvents,
investigated most frequently), the precession frequency is given by ω0 = −γ(1 + δ)B0, where
δ = δi is isotropic chemical shift, reﬂecting electron distribution around the nucleus in the
molecule.
• In the rotating coordinate system, the precession is described by the frequency oﬀset Ω =
ω0 − ωrot = ω0 − (−ωradio) = ω0 + ωradio, where the frequency of the rotation of the coordinate
frame ωrot is set to the frequency −ωradio deﬁned in Section 1.5.5 (deﬁnition of the sign is
a bit tricky, but the magnitude |ωradio| is simply frequency of the radio wave applied in the
experiment.)
• During application of the radio wave, the chemical shift makes impossible to fulﬁll the resonance
condition ω0 = −ωradio (or Ω = 0) for all nuclei.
• Only the nucleus with Ω = 0 experiences exactly the desired eﬀect of the radio wave pulse,
e.g. rotation exactly by 90 ◦
exactly around axis corresponding to direction x in the rotating
coordinate system. All other nuclei are aﬀected somewhat diﬀerently, the deviations are known
as oﬀset eﬀects.
• On one hand, the chemical shift (oﬀset eﬀects) make the eﬀects of radio waves imperfect for
nuclei with Ω = 0, on the other hand, the chemical shift (oﬀset eﬀects) allow us to inﬂuence
diﬀerent nuclei by magnetic waves selectively.
• By choosing a certain frequency of the radio wave, we specify which nucleus has Ω = 0 (is on
resonance), by choosing the power of radio waves (|ω1| = γ|B1|), we specify which nucleus is
not aﬀected by the radio wave (i.e. its magnetization is rotated by 360 ◦
).
• Low-power pulses (with low ω1) are more selective than high-power pulses (with low ω1). If a
low- and high-power pulses should rotate magnetization by the same angle, the duration of the
low-power pulse must be longer.
Note that now (when we take into account the chemical environment of the nucleus) we describe
precession of magnetic moments about B0 in isotropic liquids by three diﬀerent frequencies:
• the actual precession frequency (resonance frequency, chemically shifted Larmor frequency) ω0.
Its magnitude is in the radio frequency range (typically 10 MHz–1 GHz), its value is ω0 =
−γ(1 + δ)B0.
• chemical shift δ = δi is a relative value expressed in units of ppm. In principle, it should
be equal to −(ω0 + γB0)/γB0. In practice, it is reported relative to a reference signal from a
standard compound (ω0 −ω0,ref)/ω0,ref. The chemical shift is the value reported in the literature
as a property of the given compound. It is given by the molecular structure, is inﬂuenced by
chemical and physical conditions (temperature, ionic strength, pH) but does not depend on the
experimental setup (on B0 or frequency of radio waves used in the experiment).
1.4. CHEMICAL SHIFT 23
• frequency oﬀset Ω = ω0 − (−ωradio). Its magnitude is in the audio frequency range. Frequency
oﬀset is given by the choice of the frequency of the radio waves used in the experiment. Its
value is important for the experimental setup but not comparable with frequency oﬀsets at
other spectrometers. Therefore, it is usually not reported in the literature.
HOMEWORK
Derive the bulk magnetization of an NMR sample (Section 1.5.2) and solve the equations describing
evolution of the magnetization in B0 (Section 1.5.8).
24
1.5 SUPPORTING INFORMATION
1.5.1 Calculating averages
An average value of some quantity f is calculated as
f =
f1 + f2 + · · · + fN
N
=
N
j=1
fj
N
=
N
j=1
fj
N
j=1
1
. (1.9)
If f is a function of a variable t (e.g., time) and we measure f for regularly spaced values of t (e.g. in regular time steps ∆t), we can
calculate the average of f in the interval between t0 and tN = t0 + N∆t as
f(tj) =
N
j=1
f(tj)
N
=
N
j=1
f(tj)
N
j=1
1
. (1.10)
The same result is obtained if we multiply both the numerator and denominator by ∆t:
f(tj) =
N
j=1
f(tj)∆t
N
j=1
∆t
. (1.11)
Shortening ∆t → 0 tells us that we can calculate an average value of a continuous function f(t) by integration:
f(t) =
tN´
t0
f(t)dt
tN´
t0
dt
=
tN´
t0
f(t)dt
tN − t0
. (1.12)
This recipe can be easily extended to functions of more variables. For example, the average value of f(x, y) for x0 < x < xN and
y0 < y < yN is
f(x, y) =
xN´
x0
yN´
y0
f(x, y)dxdy
xN´
x0
yN´
y0
dxdy
=
xN´
x0
yN´
y0
f(x, y)dxdy
(xN − x0)(yN − y0)
. (1.13)
The geometric interpretation is presented in Figure 1.7A. The average value is equal to the volume below the surface deﬁned by f(x, y)
(pink in Figure 1.7A) above the rectangle (xN −x0)×(yN −y0), divided by the volume of the box of dimensions (xN −x0)×(yN −y0)×1.
The latter volume is numerically equal to the area of the rectangle (xN − x0) × (yN − y0). The former volume is a sum of volumes of many
rectangular prisms. Each prism has the same base of the area dx · dy (shown in yellow in Figure 1.7A) and a diﬀerent height (shown in
magenta) equal to f(x, y).
In NMR spectroscopy, we often calculate average for various orientations in space. The orientation can be described by a vector r of
unit length (|r| = 1) pointing in the given direction. The end points of vectors deﬁning all possible orientations form a surface of a sphere
of the radius r = 1 (Figure 1.7B). The orientation-dependent quantity f can be described as a function of two angles, of inclination ϑ
and azimuth ϕ. The average value of f(ϑ, ϕ) is calculated as an integral of the values of f ”above” the surface of the sphere, divided by
the area of the surface. The integral ”above” the surface of the sphere is a sum of integrals ”above” narrow bands on the surface of the
sphere. Each band (an example is shown in green in Figure 1.7B) can be decomposed into small rectangles. One rectangle is shown in
yellow in Figure 1.7B. One side of the rectangle (red in Figure 1.7B) corresponds to the width of the green band. Its length is given by the
arch between the vectors r(ϑ, ϕ) (green in Figure 1.7B) and r(ϑ − dϑ, ϕ) (red in Figure 1.7B). The length of the arch is rdϑ. The other
side of the rectangle (blue in Figure 1.7B) corresponds to the arch between the projections of the vectors r(ϑ, ϕ) (green in Figure 1.7B)
and r(ϑ, ϕ + dϕ) (blue in Figure 1.7B) to the horizontal plane. The length of the blue arch is r sin ϑdϕ. Therefore, the area of the yellow
rectangle is r2 sin ϑdϑdϕ, or simply sin ϑdϑdϕ because r = 1.
The integration is equivalent to summation of volume elements similar to the magenta prisms in Figure 1.7A. One of them is depicted
in Figure 1.7C. The integral can be written as
ϑ=πˆ
ϑ=0
ϕ=2πˆ
ϕ=0
f(ϑ, ϕ) sin ϑdϑdϕ =
2πˆ
0
dϕ
πˆ
0
sin ϑdϑf(ϑ, ϕ) (1.14)
1.5. SUPPORTING INFORMATION 25
dϕ
dϑ
r sin ϑdϕ
rdϑ
dx
dy
A CB
f(ϑ, ϕ)
ϑ
ϕ
f(x, y)
r sin ϑ
Figure 1.7: Integration in calculating averages. Integration of f(x, y) (A), integration of f(ϑ, ϕ) (B), and the volume element
f(ϑ, ϕ)r2 sin ϑdϑdϕ (C). Details are described in the text.
and the average is
f(ϑ, ϕ) =
2π´
0
dϕ
π´
0
sin ϑdϑf(ϑ, ϕ)
2π´
0
dϕ
π´
0
sin ϑdϑ
=
2π´
0
dϕ
π´
0
sin ϑdϑf(ϑ, ϕ)
2π´
0
dϕ
1´
−1
du
=
2π´
0
dϕ
π´
0
sin ϑdϑf(ϑ, ϕ)
4π
, (1.15)
where we used the substitution
u = cos ϑ ⇒ du =
du
dϑ
dϑ =
d cos ϑ
dϑ
dϑ = − sin ϑdϑ (1.16)
1.5.2 Polarization and bulk magnetization
The average value of the z-component of µ is calculated according to Section 1.5.1. The distribution of µ is axially symmetric. Therefore,
µeq
z does not depend on ϕ and we can integrate over ϕ and over ϑ separately in Eq. 1.15
µeq
z =
2π´
0
dϕ
π´
0
Peq(ϑ)µz(ϑ sin ϑ)dϑ
4π
=
4π
π´
0
Peq(ϑ)µz(ϑ) sin ϑdϑ
4π
=
πˆ
0
Peq
(ϑ)µz sin ϑdϑ =
πˆ
0
Peq
(ϑ)|µ| cos ϑ sin ϑdϑ, (1.17)
where ϑ is the inclination (angle between µ and axis z) and Peq(ϑ) is the probability of µ to be tilted by the angle ϑ. If the magnetic
dipoles are in a thermodynamic equilibrium, the angular distribution of the µ orientation is given by the Boltzmann law: Probability of a
system to be in the state with the energy Ej at the temperature T is given by
Peq
j =
e
−
Ej
kBT
Z
, (1.18)
where Z is sum of the e
−
Ek
kBT
terms of all possible states.
Peq
(ϑ) =
e
−
E(ϑ)
kBT
π´
0
e
−
E(ϑ )
kBT
sin ϑ dϑ
, (1.19)
26
where T is the thermodynamic temperature, kB = 1.38064852×10−23 m2 kg s−2 K−1 is the Boltzmann constant, and E = −|µ||B0| cos ϑ
is the magnetic potential energy of the dipole. The distribution is axially symmetric, all values of the azimuth ϕ are equally possible.
Using the substitutions
u = cos ϑ ⇒ du =
du
dϑ
dϑ =
d cos ϑ
dϑ
dϑ = − sin ϑdϑ (1.20)
and
w =
|µ||B0|
kBT
, (1.21)
Peq
(ϑ) =
e
−
E(ϑ)
kBT
π´
0
e
−
E(ϑ )
kBT
sin ϑ dϑ
=
euw
−1´
1
−eu wdu
=
euw
1´
−1
eu wdu
=
euw
1
w
eu w 1
−1
=
w
ew − e−w
euw
= Peq
(u). (1.22)
Knowing the distribution, the average z-component of µ can be calculated
µeq
z =
πˆ
0
Peq
(ϑ)|µ| cos ϑ sin ϑdϑ =
1ˆ
−1
|µ|uPeq
(u)du =
|µ|w
ew − e−w
1ˆ
−1
ueuw
du. (1.23)
The integration can be performed per parts (per partes), with the result
µeq
z =
|µ|w
ew − e−w
1
w2
euw
(uw − 1)
1
−1
=
|µ|
ew − e−w
1
w
ew
(w − 1) −
1
w
e−w
(−w − 1) = |µ|
ew + e−w
ew − e−w
−
1
w
= |µ| coth(w) −
1
w
,
(1.24)
where we recognized that the ratio of exponential terms is the function hyperbolic cotangent (coth(w)). The function coth(w) can be
expanded as a Taylor series
coth(w) ≈
1
w
+
w
3
−
w3
45
+
2w5
945
− · · · ⇒ µz ≈ |µ|
w
3
−
w3
45
+
2w5
945
− · · · . (1.25)
At the room temperature, |µ||B0| kBT even in the strongest NMR magnets. Therefore, w is a very small number and its high
powers in the Taylor series can be neglected. In summary, the angular distribution can be approximated by
µeq
z =
1
3
|µ|2|B0|
kBT
, (1.26)
while
µeq
x = µeq
y = 0. (1.27)
1.5.3 Changing Cartesian coordinate frame
Diﬀerent Cartesian coordinate systems are optimal for description of diﬀerent issues related to NMR spectroscopy. It is therefore useful
to be able to change the coordinate system when needed. In this section, we limit the changes of coordinate systems to their rotation in
space.
We start by a two-dimensional case (Figure 1.8). Let us suppose we know that a vector a has components ax, ay in an ”original”
coordinate system xy, and we wish to express components of a in a ”primed” coordinate system x y . First, we have to specify relationship
between the coordinate systems. This relation is a rotation in the xy plane. We get the ”primed” coordinate frame (shown in red in
Figure 1.8) if we rotate the ”original” system (shown in blue in Figure 1.8) by an angle ϕ. Note that we now treat the original (blue)
coordinate frame as a geometric object and actively rotate it to a new (red) orientation in the plane.
In order to describe the vector a is represented in two rotated coordinate systems, we change the point of view. Now we are interested
how the vector a is seen from diﬀerent coordinate frames. If we express ax as |a| cos ϕa and ay as |a| sin ϕa, and ax as |a| cos ϕa and ay
as |a| sin ϕa, we see that ϕa = ϕa − ϕ (Figure 1.8). The coordinates of a in the ”primed” coordinate system (Figure 1.8B) are the same
as coordinates of a rotated ”backwards” (by angle −φ) in the original system (Figure 1.8C). This manipulation of a is known as passive
rotation. Note that the angles describing the active rotation (of the coordinate frames) and the passive rotation (of a) diﬀer in the sign.
The passive rotation can written down as a set of equations relating the coordinates of a in two diﬀerent frames
ax = |a| cos ϕa = |a| cos(ϕa − ϕ) = |a|(cos ϕa cos ϕ + sin ϕa sin ϕ) = ax cos ϕ + ay sin ϕ (1.28)
ay = |a| sin ϕa = |a| sin(ϕa − ϕ) = |a|(sin ϕa cos ϕ − cos ϕa sin ϕ) = −ax sin ϕ + ay cos ϕ, (1.29)
1.5. SUPPORTING INFORMATION 27
A
x
y
y
ϕa
x
ϕa
ϕ
a B
x
ϕa
y
y
xϕa
ϕ
a
C y
xϕa
ϕa
−ϕ
a
a
Figure 1.8: Changing two-dimensional coordinate frame. A, vector a (black) in two coordinate frames, shown in blue and red. The
red coordinate system is obtained by rotating the blue frame by the angle ϕ. B, the same picture oriented so that the axes of the red
coordinate frame are horizontal and vertical. C, Rotation of the vector a by the angle −ϕ results in a vector a , which is oriented in the
blue coordinate frame exactly like a is oriented in the red coordinate frame (cf. Panel B).
or in a matrix form
ax
ay
=
cos ϕ sin ϕ
− sin ϕ cos ϕ
ax
ay
. (1.30)
In order to convert coordinates of a in the ”primed” system to the coordinates of a in the ”original” frame, we have to rotate by +ϕ.
Changing sign of ϕ results in
ax
ay
=
cos ϕ − sin ϕ
sin ϕ cos ϕ
ax
ay
. (1.31)
The obtained relations can be easily extended to two three-dimensional coordinate frames xyz and x y z related by rotation about
the common axis z


ax
ay
az

 =


cos ϕ sin ϕ 0
− sin ϕ cos ϕ 0
0 0 1


rotation by −ϕ


ax
ay
az




ax
ay
az

 =


cos ϕ − sin ϕ 0
sin ϕ cos ϕ 0
0 0 1


rotation by +ϕ


ax
ay
az

 . (1.32)
Similar expressions can be derived for coordinated systems related by rotation about the x and y axes, respectively:


ax
ay
az

 =


1 0 0
0 cos ϑx sin ϑx
0 − sin ϑx cos ϑx


rotation by −ϑx


ax
ay
az




ax
ay
az

 =


1 0 0
0 cos ϑx − sin ϑx
0 sin ϑx cos ϑx


rotation by +ϑx


ax
ay
az

 (1.33)
and


ax
ay
az

 =


cos ϑy 0 − sin ϑy
0 1 0
sin ϑy 0 cos ϑy


rotation by −ϑy


ax
ay
az




ax
ay
az

 =


cos ϑy 0 sin ϑy
0 1 0
− sin ϑy 0 cos ϑy


rotation by +ϑy


ax
ay
az

 . (1.34)
We can now proceed to three-dimensional coordinate frames. Again, we start by specifying their mutual relation. In order to describe
a general relation of coordinated systems xyz and x y z , three subsequent active rotations of the frame xyz are needed. The choice of the
actual rotations is somewhat arbitrary. Note that if a mutual orientation of two coordinate frame is described by rotations about diﬀerent
axes, or by rotations about the same axes but in a diﬀerent order, the numerical values of the rotation angles diﬀer. Several conventions are
used in diﬀerent ﬁelds of science and for diﬀerent purposes, none of them is a general recommended standard. Nevertheless, all conventions
allow us to relate the coordinate frames unambiguously. In this course, we use the following rotations (Figure 1.9A):
28
1. Rotate about the z axis of the original coordinate frame to move the y axis to the x y plane. We call7 the angle of this rotation
χ and label a vector deﬁning the new direction of the y axis as n. Note that the direction of n is the intersection of the planes xy
and x y .
2. Rotate about n to move the axis z to the direction of z . We call the angle of this rotation ϑ.
3. Rotate about z to move n to the direction of y . We call the angle of this rotation ϕ.
Written in the matrix form, the active rotation converting the ”original” frame xyz to the ”primed” frame x y z is expressed as


cos ϕ − sin ϕ 0
sin ϕ cos ϕ 0
0 0 1




cos ϑ 0 sin ϑ
0 1 0
− sin ϑ 0 cos ϑ




cos χ − sin χ 0
sin χ cos χ 0
0 0 1

 =


cos ϕ cos ϑ cos χ − sin ϕ sin χ − cos ϕ cos ϑ sin χ − sin ϕ cos χ cos ϕ sin ϑ
sin ϕ cos ϑ cos χ + cos ϕ sin χ − sin ϕ cos ϑ sin χ + cos ϕ cos χ sin ϕ sin ϑ
− sin ϑ cos χ sin ϑ sin χ cos ϑ

 . (1.35)
The passive rotations describing how a is seen from diﬀerent coordinate frames, are performed in the reverse manner:


ax
ay
az

 =


cos χ sin χ 0
− sin χ cos χ 0
0 0 1




cos ϑ 0 − sin ϑ
0 1 0
sin ϑ 0 cos ϑ




cos ϕ sin ϕ 0
− sin ϕ cos ϕ 0
0 0 1




ax
ay
az

 (1.36)


ax
ay
az

 =


cos ϕ − sin ϕ 0
sin ϕ cos ϕ 0
0 0 1




cos ϑ 0 sin ϑ
0 1 0
− sin ϑ 0 cos ϑ




cos χ − sin χ 0
sin χ cos χ 0
0 0 1




ax
ay
az

 (1.37)
Expressing the product of the three rotation matrices,


ax
ay
az

 =


cos χ cos ϑ cos ϕ − sin χ sin ϕ cos χ cos ϑ sin ϕ + sin χ cos ϕ − cos χ sin ϑ
− sin χ cos ϑ cos ϕ − cos χ sin ϕ − sin χ cos ϑ sin ϕ + cos χ cos ϕ sin χ sin ϑ
sin ϑ cos ϕ sin ϑ sin ϕ cos ϑ




ax
ay
az

 (1.38)


ax
ay
az

 =


cos ϕ cos ϑ cos χ − sin ϕ sin χ − cos ϕ cos ϑ sin χ − sin ϕ cos χ cos ϕ sin ϑ
sin ϕ cos ϑ cos χ + cos ϕ sin χ − sin ϕ cos ϑ sin χ + cos ϕ cos χ sin ϕ sin ϑ
− sin ϑ cos χ sin ϑ sin χ cos ϑ




ax
ay
az

 . (1.39)
In the language of linear algebra, the matrix of the trigonometric functions of ϕ, ϑ, χ is a transformation matrix. If we label the
elements of the transformation matrices Rk k for the transformation from the ”original” to the ”primed” coordinate frame and Rkk for
the inverse transformation, the change of the coordinate frame (transformation) can be described in terms of the components of the vector
a as
ak =
k
Rk k(−ϕ, −ϑ, −χ)ak ak =
k
Rkk (χ, ϑ, ϕ)ak . (1.40)
The change of the coordinate system can be written in a similar fashion for a tensor Tjk (Figure 1.9B,C). We derive the relation in
two steps. In the ﬁrst step, we combine the rows of the matrix representing Tjk into row vectors Tj


Txx Txy Txz
Tyx Tyy Tyz
Tzx Tzy Tzz

 =


Tx
Ty
Tz

 . (1.41)
Formally, the vectors Tj are components of a column vector that transforms according to Eq. 1.40:
Tj =
j
Rjj (−ϕ, −ϑ, −χ)Tj. (1.42)
In the second step, we describe transformation of each vector Tj . As Tj are row vectors, they are multiplied from right by the
transformation matrix
Tj x Tj y Tj z


Rxx Rxy Rxz
Ryx Ryy Ryz
Rzx Rzy Rzz

 = Tj x Tj y Tj z , (1.43)
7
Our angles ϕ, ϑ, χ represent Euler angles, usually labeled α, β, and γ. As the Greek letters α, β, and γ are traditionally used for
diﬀerent purposes in NMR spectroscopy, we use other letters in our course.
1.5. SUPPORTING INFORMATION 29
A
y
xx
z
z y
n
ϕ
ϑ
χ
B Y
Z
x
z
z
x
y
y
n
C
Z
Y
z
x
y
y
x
z
n
Figure 1.9: Changing three-dimensional coordinate frame. A, the red coordinate system is obtained by rotating the blue frame (i)
about the axis z until the blue axis y coincides with n (angle of this rotation is χ), (ii) about n by ϑ (this determines the new direction of
axis z, labeled z and shown in red), and (iii) about z by ϕ (this determines the new direction of axis y, labeled y and shown in red). B,
a graphical representation of a tensor (green) in the discussed coordinate frames. The tensor, formally corresponding to a chemical shift
tensor with δi = 0 and δa = δr, is visualized as described in Figure 1.6. Dark and light green correspond to positive and negative values.
C, graphical represeantation of the same tensor rotated (i) by −ϕ about z , (ii) by −ϑ about n, and (iii) by −χ about z. The rotated
tensor has the same orientation in the blue coordinate system as the original tensor had in the red coordinate frame.
or
Tj k =
k
Tj kRkk (−ϕ, −ϑ, −χ). (1.44)
Note that if Tj were written as column vectors, the transformation would be


Rx x Rx x Rx x
Ry y Ry y Ry y
Rz z Rz z Rz z




Tj x
Tj y
Tj z

 =


Tj x
Tj y
Tj z

 . (1.45)
The transformation matrices in Eqs. 1.43 and 1.45 are related by transposition (changing rows to columns and columns to rows),
exactly like the transformation matrices in Eqs. 1.38 and 1.39. Therefore, transformation of Tj can be also described as
Tj k =
k
Rk k(ϕ, ϑ, χ)Tj k. (1.46)
Combining the ﬁrst and second step of the derivation, we can describe transformation of a tensor Tjk as
Tj m =
j k
Rj j(−ϕ, −ϑ, −χ)Rkk (−ϕ, −ϑ, −χ)Tjk =
j k
Rj j(−ϕ, −ϑ, −χ)Rk k(χ, ϑ, ϕ)Tjk. (1.47)
and
Tjk =
j k
Rjj (χ, ϑ, ϕ)Rk k(χ, ϑ, ϕ)Tj k =
j k
Rjj (χ, ϑ, ϕ)Rkk (−ϕ, −ϑ, −χ)Tj k . (1.48)
Yet another way of describing rotations of vectors and tensors is discussed in Section 4.9.14, when quantum mechanical treatment of
angular momentum is presented.
1.5.4 Rotation in complex representation
Rotations described in Section 1.5.3 can be also analyzed in a manner that is particularly useful in NMR spectroscopy. The three-dimensional
vectors in the real Cartesian coordinate system are ﬁrst converted to spherical coordinates
ax = |a| sin ϑa cos ϕa, (1.49)
ay = |a| sin ϑa sin ϕa, (1.50)
az = |a| cos ϑa, (1.51)
30
and then expressed as two-dimensional complex vectors, called spinors
aα
aβ
=
|a| cos ϑa
2
e−i ϕa
2
|a| sin ϑa
2
e+i ϕa
2
. (1.52)
We can easily check that the complex numbers aα and aβ unambiguously deﬁne the Cartesian coordinates ax, ay, az
aαa∗
β + aβa∗
α = |a| sin
ϑa
2
cos
ϑa
2
e−iϕa + |a| sin
ϑa
2
cos
ϑa
2
e+iϕa = |a| sin ϑa
e+iϕa + e−iϕa
2
= |a| sin ϑa cos ϕa = ax, (1.53)
i aαa∗
β − aβa∗
α = i |a| sin
ϑa
2
cos
ϑa
2
e−iϕa − |a| sin
ϑa
2
cos
ϑa
2
e+iϕa = −i|a| sin ϑa
e+iϕa − e−iϕa
2
= |a| sin ϑa sin ϕa = ay, (1.54)
aαa∗
α − aβa∗
β = |a| cos2 ϑa
2
− |a| sin2 ϑa
2
= |a| cos ϑa = az, (1.55)
and
aαa∗
α + aβa∗
β = |a| cos2 ϑa
2
+ |a| sin2 ϑa
2
= |a|. (1.56)
Rotation of a spinor by an angle −ϑn/2 about an axis n corresponds to the rotation of the real 3D vector about an angle −ϑn.
Rotations of spinors are described by 2 × 2 matrices
αn −β∗
n
βn α∗
n
, (1.57)
where
αn = cos ϑn/2 − inz sin ϑn/2, (1.58)
βn = −i(nx + iny) sin ϑn/2. (1.59)
For rotations about the x, y, and z axes,
aα
aβ
=
αx −β∗
x
βx α∗
x
aα
aβ
=
1 0
0 1
cos
ϑx
2
− i
0 1
1 0
sin
ϑx
2
aα
aβ
=
cos ϑx
2
−i sin ϑx
2
−i sin ϑx
2
cos ϑx
2
aα
aβ
, (1.60)
aα
aβ
=
αy −β∗
y
βy α∗
y
aα
aβ
=
1 0
0 1
cos
ϑy
2
− i
0 −i
i 0
sin
ϑy
2
aα
aβ
=
cos
ϑy
2
− sin
ϑy
2
sin
ϑy
2
cos
ϑy
2
aα
aβ
, (1.61)
aα
aβ
=
αz −β∗
z
βz α∗
z
aα
aβ
=
1 0
0 1
cos
ϑz
2
− i
1 0
0 −1
sin
ϑz
2
aα
aβ
=
e−i ϑz
2 0
0 e+i ϑz
2
aα
aβ
. (1.62)
The rotation of a 3D real vector by the Euler angles −ϕ, −ϑ, −χ (Eq. 1.35) thus corresponds to the following rotation of the spinor
aα
aβ
=
e−i χ
2 0
0 e+i χ
2
cos ϑ
2
− sin ϑ
2
sin ϑ
2
cos ϑ
2
e−i ϕ
2 0
0 e+i ϕ
2
aα
aβ
=
cos ϑ
2
e−i ϕ+χ
2 − sin ϑ
2
e+i ϕ−χ
2
sin ϑ
2
e−i ϕ−χ
2 cos ϑ
2
e+i ϕ+χ
2
aα
aβ
. (1.63)
When we proceed to the quantum mechanical description of NMR, we again meet spinors and 2×2 matrices describing their rotations
(Pauli matrices). However, what we describe here is rotation of completely classical objects, just expressed using pairs of complex numbers.
1.5. SUPPORTING INFORMATION 31
M
B0
Bradio
M
B1
Figure 1.10: Rotation of the magnetization to direction perpendicular to B0, shown in the laboratory and rotating coordinate frame
in the left and right panel, respectively. The thin purple line shows oscillation of the magnetic induction vector of the radio waves, the
cyan trace shows evolution of the magnetization during irradiation.
1.5.5 Rotating coordinate frame
Mathematically, the described radio ﬁeld can be decomposed into two components B+
radio and B−
radio rotating with the same angular
frequency but in opposite directions (ωradio and −ωradio, respectively). The component rotating in the same direction as the precessing
dipoles (B−
radio ≡ B1 in this text) tilts the magnetization vector M from the z direction, the other component can be neglected as long
as |B1| |B0|. This process represents a double rotation, the ﬁrst rotation is precession around the direction of B0, the second rotation
around B1 is known as nutation. Although this mathematical decomposition is only formal and does not reﬂect the physical reality, it is
frequently used to facilitate the analysis of the eﬀect of radio waves on magnetization. The description can be simpliﬁed (the eﬀect of the
precession removed), if we use B1 to deﬁne the x axis of our coordinate frame. As B1 rotates about B0 with an angular frequency ωradio,
we work in a coordinate frame rotating with a frequency ωrot = −ωradio (rotating frame). In order to deﬁne the direction of x in the
rotating frame, we must also deﬁne the phase φrot.
The components of the ﬁeld B1 rotating with the angular frequency −ωradio are in the laboratory frame
B1,x = |B1| cos(−ωradiot + φradio) = |B1| cos(ωrott + φradio), (1.64)
B1,y = |B1| sin(−ωradiot + φradio) = |B1| sin(ωrott + φradio), (1.65)
B1,z = 0 (1.66)
and in the rotating frame
B1,x = |B1| cos(φrot), (1.67)
B1,y = |B1| sin(φrot), (1.68)
B1,z = 0. (1.69)
Consequently, the rotation of magnetization is given by the angular frequency vector
ω = ω0 + ω1 = −γ(B0 + B1) =


0
0
−γ|B0|

 +


−γ|B1| cos(−ωradiot + φrot)
−γ|B1| sin(−ωradiot + φrot)
0

 =


−γ|B1| cos(−ωradiot + φrot)
−γ|B1| sin(−ωradiot + φrot)
−γ|B0|

 (1.70)
in the laboratory frame, and by
ω = ω1 = −γB1 =


−γ|B1| cos(φrot)
−γ|B1| sin(φrot)
0

 (1.71)
32
in the coordinate frame rotating with the angular frequency ωrot = −ωradio = ω0.
What are the components of B1 in the rotating frame for diﬀerent choices of φrot?
If φrot = 0, cos(0) = 1, sin(0) = 0, and
B1,x = |B1|, (1.72)
B1,y = 0, (1.73)
B1,z = 0. (1.74)
If φrot = π
2
, cos( π
2
) = 0, sin( π
2
) = 1, and
B1,x = 0, (1.75)
B1,y = |B1|, (1.76)
B1,z = 0. (1.77)
If φrot = π, cos(π) = −1, sin(π) = 0, and
B1,x = −|B1|, (1.78)
B1,y = 0, (1.79)
B1,z = 0, (1.80)
and so on.
The typical convention is to choose φrot = π for nuclei with γ > 0 and φrot = 0 for nuclei with γ < 0. Then, the nutation frequency
is ω1 = +γB1 = +|γ|B1 (opposite convention to the precession frequency!) for nuclei with γ > 0 and ω1 = −γB1 = +|γ|B1 (the same
convention as the precession frequency) for nuclei with γ < 0.
1.5.6 Chemical shift tensor
The chemical shift tensor in its principal frame can be also written as a sum of three simple matrices, each multiplied by one characteristic
constant:


δXX 0 0
0 δY Y 0
0 0 δZZ

 = δi


1 0 0
0 1 0
0 0 1

 + δa


−1 0 0
0 −1 0
0 0 2

 + δr


1 0 0
0 −1 0
0 0 0

 , (1.81)
where
δi =
1
3
Tr{δ} =
1
3
(δXX + δY Y + δZZ ) (1.82)
is the isotropic component of the chemical shift tensor,
δa =
1
3
∆δ =
1
6
(2δZZ − (δXX + δY Y )) (1.83)
is the axial component of the chemical shift tensor (∆δ is the chemical shift anisotropy), and
δr =
1
3
ηδ∆δ =
1
2
(δXX − δY Y ) (1.84)
is the rhombic component of the chemical shift tensor (ηδ is the asymmetry of the chemical shift tensor).
The chemical shift tensor written in its principle frame is relatively simple, but we need its description in the laboratory coordinate
frame. Changing the coordinate systems represents a rotation in a three-dimensional space, as described in Section 1.5.3. Equations
describing such a simple operation are relatively complicated. On the other hand, the equations simplify if B0 deﬁnes the z axis of the
coordinate frame (i.e., B0,z = B0 and B0,x = B0,y = 0):
Be = δiB0


0
0
1

 + δaB0


3 sin ϑ cos ϑ cos ϕ
3 sin ϑ cos ϑ sin ϕ
3 cos2 ϑ − 1

 + δrB0


−(2 cos2 χ − 1) sin ϑ cos ϑ cos ϕ + 2 sin χ cos χ sin ϑ sin ϕ
−(2 cos2 χ − 1) sin ϑ cos ϑ sin ϕ − 2 sin χ cos χ sin ϑ cos ϕ
+(2 cos2 χ − 1) sin2 ϑ

 . (1.85)
The ﬁrst, isotropic contribution does not change upon rotation (it is a scalar). The second, axial contribution, is insensitive to the
rotation about the symmetry axis Z, described by χ. Rotation of the chemical shift anisotropy tensor from its principal frame to the
laboratory frame can be also described by orientation of Z in the laboratory frame:
1.5. SUPPORTING INFORMATION 33
δa


−1 0 0
0 −1 0
0 0 2

 −→ δa


3Z2
x − 1 3ZxZy 3ZxZz
3ZxZy 3Z2
y − 1 3ZyZz
3ZxZz 3ZyZz 3Z2
z − 1

 , (1.86)
where Zx = sin ϑ cos ϕ, Zy = sin ϑ sin ϕ, and Zz = cos ϑ.
1.5.7 Oﬀset eﬀects
The presence of electrons makes NMR a great method for chemical analysis. The measured precession frequency depends not only on the
type of nucleus (e.g. 1H) but also on the electronic environment: frequencies of protons in diﬀerent chemical moieties diﬀer and can be used
to identify chemical groups in organic molecules. But how do the electrons inﬂuence the physical description of the nuclear magnetization?
The eﬀect of the isotropic component of the chemical shift on the precession frequency is simply introducing a small correction constant
1 + δ modifying γ:
ω0 = −γB0 → ω0 = −γ(1 + δ)B0. (1.87)
The trouble is that the correction is diﬀerent for each proton (or carbon etc.) in the molecule. Therefore, the frequency of the radio
waves can match ω0 = −γ(1 + δ)|B0| only for one proton in the molecule. For example, if the radio wave resonate with the frequency of
the methyl proton in ethanol, it cannot resonate with the frequency of the proton in the OH or CH2 group. In the rotating coordinate
frame, only magnetization of the methyl protons rotates about ω1 = γ(1 + δmethylB1 ≈ γB1. Magnetizations of other protons rotate about
other axes (Figure 1.11). Such rotations can be described by eﬀective angular frequencies
ωeﬀ = ω1 + Ω, (1.88)
where
Ω = ω0 − ωrot = ω0 − (−ωradio) = ω0 + ωradio (1.89)
is the angular frequency oﬀset. As any vector in a 3D space, ωeﬀ is characterized by three parameters: magnitude ωeﬀ , inclination ϑ,
and azimuth ϕ.
The magnitude of the eﬀective frequency is
ωeﬀ = ω2
1 + Ω2. (1.90)
The inclination can be calculated from
tan ϑ =
ω1
Ω
. (1.91)
The azimuth is given by the phase of B1 (ϕ = ϕrot in a single-pulse experiment).
As a result of the chemical shift, only the magnetization of the nucleus with Ω = 0 (methyl protons in our case) rotates along the
”meridian” in the rotating coordinate system (Figure 1.11 left). Magnetizations of other protons move in other circles (Figure 1.11 right).
Therefore, if the radio transmitter is switched oﬀ when the methyl magnetization is pointing horizontally (and starts to rotate around the
”equator” with the precession frequency of methyl protons), vectors of magnetizations of other protons point in diﬀerent directions, and
start to precess on cones with diﬀerent inclinations and with diﬀerent initial phases. Such eﬀects, known as the oﬀset eﬀects, inﬂuence the
measured signal.8
The discussed motion of the magnetization vector M during irradiation is described by the following equations
dMx
dt
= −ΩMy + ω1 sin ϕMz, (1.92)
dMy
dt
= +ΩMx − ω1 cos ϕMz, (1.93)
dMz
dt
= −ω1 sin ϕMx + ω1 cos ϕMy, (1.94)
where ϕ is the azimuth of ωeﬀ . The equation can be written in a compact form as
dM
dt
= ωeﬀ × M. (1.95)
8
The result is the same as if apparent eﬀective ﬁelds of the magnitude Beﬀ = B2
1 + (Ω/γ)2 were applied in the direction in the
directions of ωeﬀ . The apparent eﬀective ﬁeld Beﬀ is often used to describe the oﬀset eﬀects.
34
M
B1
M
−Ω/γ
B1
Beﬀ
Figure 1.11: Evolution of the magnetization vectors with precession frequency exactly matching the used radio frequency (left) and
slightly oﬀ-resonance (right). The evolution is shown in a coordinate frame rotating with ωrot = −ωradio.
1.5.8 Evolution of magnetization in B0
Eqs. 1.92–1.94 are easy to solve in the absence of B1 (i.e., after turning oﬀ the radio waves):
dMx
dt
= −ΩMy (1.96)
dMy
dt
= ΩMx (1.97)
dMz
dt
= 0 (1.98)
The trick is to multiply the second equation by i and add it to the ﬁrst equation or subtract it from the ﬁrst equation.
d(Mx + iMy)
dt
= Ω(−My + iMx) = +iΩ(Mx + iMy) (1.99)
d(Mx − iMy)
dt
= Ω(−My − iMx) = −iΩ(Mx − iMy). (1.100)
Each diﬀerential equation can be solved easily using the standard procedure. The results are
Mx + iMy = C+e+iΩt
(1.101)
Mx − iMy = C−e−iΩt
, (1.102)
where the integration constants C+ = Mx(0) + iMy(0) = M2
x(0) + M2
y (0)eiφ0 and C− = Mx(0) − iMy(0) = M2
x(0) + M2
y (0)e−iφ0
are given by the initial phase φ0 of M in the coordinate system (in our case, t = 0 is deﬁned by switching oﬀ the radio waves):
Mx + iMy = M2
x(0) + M2
y (0)e+i(Ωt+φ0)
= M2
x(0) + M2
y (0)(cos(Ωt + φ0) + i(sin(Ωt + φ0)) (1.103)
Mx − iMy = M2
x(0) + M2
y (0)e−i(Ωt+φ0)
= M2
x(0) + M2
y (0)(cos(Ωt + φ0) − i(sin(Ωt + φ0)), (1.104)
1.5. SUPPORTING INFORMATION 35
Mx = M2
x(0) + M2
y (0) cos(Ωt + φ0) (1.105)
My = M2
x(0) + M2
y (0) sin(Ωt + φ0), (1.106)
where setting t = 0 shows that
tan φ0 =
sin φ0
cos φ0
=
My(0)
Mx(0)
. (1.107)
In order to obtain φ0 and M2
x(0) + M2
y (0), we must ﬁrst solve Eqs. 1.92–1.94. The solution, presented in Section 1.5.9, is
Mx(0) = M0(1 − cos(ωeﬀ τp)) sin ϑ cos ϑ, (1.108)
My(0) = M0 sin(ωeﬀ τp) sin ϑ, (1.109)
Mz(0) = M0(cos2
ϑ + cos(ωeﬀ τp) sin2
ϑ), (1.110)
where M0 is the magnitude of the bulk magnetization in the thermodynamic equilibrium, τp is duration of irradiation by the radio
waves, and tan ϑ = ω1/Ω.
1.5.9 Evolution of magnetization in B0 + B1
On one hand, Eqs. 1.92–1.94 are substantially more diﬃcult to solve than Eqs. 1.96–1.98. On the other hand, both sets of equations
(Eqs. 1.92–1.94 and Eqs. 1.96–1.98) describe the same physical process, rotation about a given axis: ωeﬀ in Eqs. 1.92–1.94 and Ω Eqs.
1.96–1.98. This suggests that Eqs. 1.92–1.94 can be solved easily in a coordinate system with the z axis deﬁned by the direction of ωeﬀ .
The procedure is straightforward. First, the direction of the vector Meq is expressed in a coordinate system with the axis z given by
ωeﬀ , using Eq. 1.36 (or Eq. 1.38). The angles ϕ and ϑ in Eq. 1.36 correspond to the azimuth ϕ and inclination ϑ of ωeﬀ , respectively, the
value χ is arbitrary because we do not need to specify the x axis of the new laboratory frame. Setting χ = 0,


Meq
x
Meq
y
Meq
z

 =


1 0 0
0 1 0
0 0 1




cos ϑ 0 − sin ϑ
0 1 0
sin ϑ 0 cos ϑ




cos ϕ sin ϕ 0
− sin ϕ cos ϕ 0
0 0 1




0
0
M0

 . (1.111)
Second, the evolution of Meq is described as a rotation about z by an angle ωeﬀ τp. Using Eq. 1.32,


Mx
My
Mz

 =


cos ωeﬀ τp − sin ωeﬀ τp 0
sin ωeﬀ τp cos ωeﬀ τp 0
0 0 1




Meq
x
Meq
y
Meq
z

 . (1.112)
Third, the components of the magnetization vector at the end of the pulse are expressed in the original coordinate system (i.e., in the
rotating frame)


Mx
My
Mz

 =


cos ϕ − sin ϕ 0
sin ϕ cos ϕ 0
0 0 1




cos ϑ 0 sin ϑ
0 1 0
− sin ϑ 0 cos ϑ




1 0 0
0 1 0
0 0 1




Mx
My
Mz

 . (1.113)
The whole procedure can be written in a single equation as


Mx
My
Mz

 =


cos ϕ − sin ϕ 0
sin ϕ cos ϕ 0
0 0 1




cos ϑ 0 sin ϑ
0 1 0
− sin ϑ 0 cos ϑ




cos ωeﬀ τp − sin ωeﬀ τp 0
sin ωeﬀ τp cos ωeﬀ τp 0
0 0 1




cos ϑ 0 − sin ϑ
0 1 0
sin ϑ 0 cos ϑ




cos ϕ sin ϕ 0
− sin ϕ cos ϕ 0
0 0 1




0
0
M0

 .
(1.114)
For example, for ϕ = φrot = 0


Mx
My
Mz

 =


M0(1 − cos(ωeﬀ τp)) sin ϑ cos ϑ
M0 sin(ωeﬀ τp) sin ϑ
M0(cos2 ϑ + cos(ωeﬀ τp) sin2 ϑ)

 . (1.115)
36
M
B1
M
−Ω/γ
B1
Beﬀ
Figure 1.12: Evolution of the magnetization vectors with precession frequency exactly matching the used radio frequency (left) and
with a frequency oﬀset Ω (right), for ω1 = Ω/
√
15. If ω1 rotates magnetization of the former nucleus by 90 ◦, then ωeﬀ =
√
1 + 15Ω = 4Ω
rotates magnetization of the latter nucleus by 4×90 ◦ = 360 ◦, i.e., by the full circle. The evolution is shown in a coordinate frame rotating
with ωrot = −ωradio. In both cases, magnetization rotates about the thick purple arrow with the angular frequency proportional to the
length of the arrow.
1.5.10 Selective pulses
As discussed in Section 1.5.7, magnetization of various nuclei after applying a 90◦ pulse to the equilibrium distribution of magnetic moments
depends on their frequency oﬀsets Ω. Therefore, Mx, My, and Mz in Eq. 1.115 are in general functions of Ω (hidden in ωeﬀ = ω1 + Ω). In
other words, the same radio-wave pulse will rotate magnetization of diﬀerent nuclei diﬀerently, depending on their Ω. For a certain ratio
of B1 to −Ω/γ, the magnetization makes a full circle and returns to the original direction along B0. It is therefore possible to chose such
value of ω1 ≈ γB1 so that magnetization of one nucleus (with precession frequency resonating with the radio wave frequency) is ﬂipped by
90 ◦ (Figure 1.12) or 180 ◦ (Figure 1.13), while magnetization of another nucleus (oﬀset by Ω) is practically unaﬀected, being returned to
the original direction.
The described selective manipulation of magnetic moments of diﬀerent nuclei is rather limited. We can choose only which nucleus is
rotated (e.g. by 90 ◦) and which nucleus is not aﬀected. Magnetization of other nuclei will end up in some direction M given by the value
of Ω of the given nucleus. Such dependence of M on Ω is known as excitation proﬁle (or magnetization proﬁle in general, for other pulses
and other initial directions of M). For a single pulse, the excitation proﬁle is an explicitly deﬁned mathematical function with only one
variable parameter B1.
Variability of the selective manipulations can be increased by applying consecutively more pulses with diﬀerent B1. Then, the
excitation proﬁle (dependence of M on Ω) depends on the actual B1 values of all pulses. Pulses that selectively inﬂuence relatively broad
ranges of frequencies are useful in advanced NMR experiments, selective excitation of a narrow frequency range is critically important in
magnetic resonance imaging (see Section 13.4.5). There are several algorithms to design a series of pulses that provides a desired excitation
proﬁle. Here we comment only one of them. It is a classical approach (Shinnar–Le Roux algorithm), described by Pauly et al. in IEEE
Transactions on Medical Imaging, 10 (1991) 53–65.
Magnetic moments are irradiated by a series of short radio-wave pulses. Duration of each pulse is ∆t and B1 varies. The pulses rotate
M from the initial orientation (e.g. Meq). The rotation is expressed as described in Section 1.5.4 in terms of spinors and 2 × 2 matrices.
During the j-th pulse, the magnetization rotates by Ω∆t about the z axis and by −ω1,j = γB1,j about the direction of B1,j in the xy
plane. The former rotation can be described as
e−i Ω∆t
2 0
0 e+i Ω∆t
2
=
z
1
2 0
0 z− 1
2
. (1.116)
According to Eqs.1.57–1.59, rotation about B1,j is described by the matrix
αn,j −β∗
n,j
βn,j α∗
n,j
(1.117)
1.5. SUPPORTING INFORMATION 37
M
B1
M
−Ω/γ
B1
Beﬀ
Figure 1.13: Evolution of the magnetization vectors with precession frequency exactly matching the used radio frequency (left) and
with a frequency oﬀset Ω (right), for ω1 = Ω/
√
3. If ω1 rotates magnetization of the former nucleus by 180 ◦, then ωeﬀ =
√
1 + 3Ω = 2Ω
rotates magnetization of the latter nucleus by 2 × 180 ◦ = 360 ◦, i.e., by the full circle. The evolution is shown in a coordinate frame
rotating with ωrot = −ωradio. In both cases, magnetization rotates about the thick purple arrow with the angular frequency proportional
to the length of the arrow.
with the coeﬃcients
αn,j = cos
ω1,j∆t
2
− i
ω1z,j
|ω1,j|
sin
ω1,j∆t
2
= cos
ω1,j∆t
2
= Cj, (1.118)
βn,j = −i
ω1x,j
|ω1,j|
+ i
ω1y,j
|ω1,j|
sin
ω1,j∆t
2
= −ie−iϕj sin
ω1,j∆t
2
= Sj. (1.119)
In analogy to Eq. 1.63, rotation by each pulse is given by
αj −β∗
j
βj α∗
j
=
Cj −S∗
j
Sj Cj
z
1
2 0
0 z− 1
2
. (1.120)
We can deﬁne the parameters αj, βj by noticing that α0 = 1 and β0 = 0 (i.e., no rotation in the initial state) and by calculating the
recursion
αj
βj
=
αj −β∗
j
βj α∗
j
=
Cj −S∗
j
Sj Cj
z
1
2 0
0 z− 1
2
αj−1
βj−1
= z
1
2
Cj −S∗
j
Sj Cj
1 0
0 z−1
αj−1
βj−1
. (1.121)
Absorbing zj/2 into the parameters,
αjz
j
2
βjz
j
2
=
αj −β∗
j
βj α∗
j
=
Cj −S∗
j
Sj Cj
1 0
0 z−1
αj−1z
1
2 z
j−1
2
βj−1z
1
2 z
j−1
2
=
Cj −S∗
j
Sj Cj
1 0
0 z−1
αj−1z
j
2
βj−1z
j
2
. (1.122)
Labeling αjzj/2 = Aj and βjzj/2 = Bj, we can write
A0
B0
=
1
0
, (1.123)
A1
B1
=
C1
S1
, (1.124)
38
A2
B2
=
C2C1 − S∗
2 S1z−1
S2C1 − C2S1z−1 , (1.125)
A3
B3
=
C3C2C1 − (C3S∗
2 S1 + S∗
3 S2C1)z−1 − S∗
3 C2S1z−2
S3C2C1 − (S3S∗
2 S1 − C3S2C1)z−1 − C3C2S1z−2 , (1.126)
and so on. If we deﬁne the amplitudes and phases of B1 of all pulses, deﬁning the parameters Cj, Sj, we can express all Aj, Bj and
calculate the ﬁnal dependence of the magnetization on Ω (hidden in z).
Pauly et al. describe design of a selective pulse, which is the opposite task: to calculate amplitudes and phases of B1 of all pulses
from a set of polynomials Aj, Bj that deﬁne the desired excitation proﬁle (dependence of the magnetization on Ω).9 As the ﬁrst step, the
recursion equation
Cj −S∗
j z−1
Sj Cjz−1
Aj−1z
j
2
Bj−1z
j
2
=
Aj
Bj
(1.127)
(Eq. 1.122) is inverted by multiplying both sides from left by the inversion matrix
Cj S∗
j
−Sjz Cjz
Cj −S∗
j z−1
Sj Cjz−1
Aj−1
Bj−1
=
Aj−1
Bj−1
=
Cj S∗
j
−Sjz Cjz
Aj
Bj
=
CjAj + S∗
j Bj
−SjzAj + CjzBj
Aj
Bj
. (1.128)
This matrix equation represents a set of two equations. The bottom one
Bj−1 = CjzBj − SjzAj (1.129)
Bj−1z−1
= CjBj − SjAj (1.130)
provides a further clue. All polynomials start by constant terms, and continue with terms with z−1, z−2. However, the left-hand side
of Eq. 1.130 is multiplied by z−1 and thus does not have any constant term, it starts with the terms with z−1. Therefore, the right-hand
side, CjBj − SjAj, must not have any constant term either. If we label the constant terms aj for Aj and bj for Bj, the requirement of no
constant term can be written as
Cjbj − Sjaj = 0 ⇒
bj
aj
=
Sj
Cj
=
−ie−iϕj sin(ω1,j∆t/2)
cos(ω1,j∆t/2)
=
ie−iϕj sin(γB1,j∆t/2)
cos(γB1,j∆t/2)
. (1.131)
Solving this equation yields expressions deﬁning amplitude and phase of B1 for each j. Examples of excitation proﬁles of several pulses
are shown in Table 1.1.
B1,j =
2
γ∆t
arctan
bj
aj
(1.132)
e−iϕj =
−ibj/aj
tan(γB1,j∆t/2)
. (1.133)
9
Ideally, the pulse should rotate magnetization by 90 ◦ for Ω in a small interval of oﬀsets and have no eﬀect outside this interval. Pauly
et al. discuss how a polynomial approximation of the desired excitation proﬁle can be found. Here, we just note that our polynomials
consist of periodic functions and any shape can be approximated by a suﬃcient number of such periodic functions.
1.5. SUPPORTING INFORMATION 39
Table 1.1: Dependence of excitation eﬃciency on frequency oﬀsets for various amplitude modulations of radio wave pulses. The pulses
are shown as histograms in the second column. The lengths and amplitudes (relative |B1| values) are plotted in the real ratios, blue and
red correspond to the phase of 0◦ and 180◦, respectively. The eﬃciency of excitation is plotted in blue, ranging from zero (Mz = Meq,
Mx = My = M2
x + M2
y = 0, magnetization vector in the z direction, no excitation) to one (Mz = 0, M2
x + M2
y = Meq, magnetization
vector in the xy plane). The deviations of the x and y components of the magnetization vector from the desired −y direction are plotted
in red (in radians). The range of the frequency oﬀsets is −30 kHz to +30 kHz. The lengths and amplitude of the hard rectangular pulse
correspond to 10 µs and |ω1| = 25 kHz. The lengths (64.5 µs) and amplitude (|ω1| = 9.675 kHz) of the selective rectangular pulse are chosen
so that the frequency oﬀset of 15 kHz is equal to
√
15|ω1|. The Q5 and EBURP2 pulses consist of 1000 rectangular pulses 0.3 µs long. (the
total length is 300 µs). The amplitudes of the Q5 and EBURP2 pulses were set so that the pulses rotate the magnetization by 90◦ when
applied on resonance. NMR-Sim (Pavel Kessler, Bruker Biospin) was used to calculate the eﬀects of shaped pulses.
Pulse name Amplitude modulation (variation of |B1|) oﬀset-dependent eﬀect
rectangular (hard)
0
1
0
2π
rectangular (selective)
0
1
0
2π
Q5
0
1
0
2π
EBURP2
0
1
0
2π
40
Lecture 2
Relaxation
Literature: A nice introduction is in K9.1 and K9.3, more details can be found in L19 and L20.1–
L20.3.
2.1 Relaxation due to chemical shift anisotropy
The Boltzmann law allowed us to describe the state of the system in the thermal equilibrium, but it
does not tell us how is the equilibrium reached. The processes leading to the equilibrium states are
known as relaxation. Relaxation takes places e.g. when the sample is placed into a magnetic ﬁeld
inside the spectrometer or after excitation of the sample by radio wave pulses.
Spontaneous emission is completely ineﬃcient (because energies of nuclear magnetic moments
in available magnetic ﬁelds are very small). Relaxation in NMR is due to interactions with local
ﬂuctuating magnetic ﬁelds in the molecule. One source1
of ﬂuctuating ﬁelds is the anisotropy of
chemical shift, described by the axial and rhombic components of the chemical shift tensor. The
chemical shift tensor is given by the distribution of electrons in a molecule. Therefore, its orientation
in a coordinate frame attached to the molecule is ﬁxed. As collisions with other molecules change
orientation of the observed molecule, the isotropic component of the chemical shift tensor does not
change because it is spherically symmetric (cf. Figure 1.6). However, contributions to the local ﬁelds
described by the axial and rhombic components ﬂuctuate even if the constants δa and δr do not
change because the axial and rhombic parts of the chemical shift depend on the orientation of the
molecule (Figure 2.1).
Here, we introduce the basic idea by analyzing the eﬀects of ﬂuctuating magnetic ﬁelds in a
classical manner. Obviously, it is not possible to describe exactly random motions of each magnetic
moment. However, it is possible to describe statistically the eﬀect of random ﬂuctuations of magnetic
ﬁelds on the bulk magnetization. For the simplest model of molecules (rigid spherical particles in an
isotropic solvent), the ﬁnal equation is surprisingly simple. However, the derivation is very tedious.
Therefore, we limit our analysis to the axially symmetric chemical shift tensor, and divide it to two
steps.
1
There are stronger sources of ﬂuctuating ﬁelds in real molecules, but we limit our discussion to the chemical shift
anisotropy in this lecture. We extend our analysis to other sources later, when we introduce quantum mechanical
description of NMR.
41
42
Figure 2.1: Visualization of reorientation of the anisotropic contribution to the chemical shift tensor as a result of
tumbling (rotational diﬀusion) of the molecule. Positive and negative values are plotted in blue and red, respectively.
2.2 Adiabatic contribution to relaxation
We start by the analysis of adiabatic contributions to relaxation. In physics, the term adiabatic is
used for processes that do not change energy of the studied system.
In order to distinguish ﬂuctuations that result in adiabatic and nonadiabatic contributions to
relaxation, we recall how magnetic moments move in the strong external magnetic ﬁeld B0, deﬁning
the z axis of our coordinate system. We have learnt in the previous lecture that B0 causes rotation of
magnetic moments about the z axis. Therefore, it does not change distribution of the z-components
of magnetic moments (components parallel to B0). The energy of magnetic moments is given by
−µ · B0 = −µzB0, i.e., it depends only on the component of the magnetic moment parallel to B0.
Now we consider eﬀects of additional ﬁelds. In this lecture, we analyze only eﬀects of very small
ﬁelds of moving electrons, introduced in the previous lecture and labeled Be. We have learnt that
horizontal ﬁelds, like Be,x and Be,y have no overall eﬀect on orientations of magnetic moments, unless
their oscillate with the precession frequency of the magnetic moments. As the molecules change
their orientations, components Be,x and Be,y ﬂuctuate. However, the rate of ﬂuctuations is in general
diﬀerent from the precession frequency because there is no reason why the molecular collisions causing
the ﬂuctuations should rotate our molecule with the same rate as the precession of magnetic moments
in our magnet. Only very rarely and for a short time, the rate of molecular rotation may coincide
with the precession frequency by accident.
In addition to Be,x and Be,y, molecular collisions also change Be,z. The vertical ﬁeld Be,z adds
to B0, changing the precession frequency −γBz, but not inﬂuencing µz. The ﬂuctuations of Be,z
are stochastic because the molecular collisions are random. Therefore, the average of the ﬂuctuating
ﬁeld Be,z is zero and the average vertical ﬁeld remains B0.
We can conclude that (i) ﬂuctuations of Be,z does not change the energy E = −µzBz because
they do not change µz or the overall Bz, (ii) Be,x and Be,y do not have any eﬀect except for rare
moments when the rate of molecular rotation coincides with the precession frequency. Most of the
time, magnetic moments do not exchange energy with the environment and the process is adiabatic.
We analyze such permanent adiabatic inﬂuence of Be,z in this section and complete the description
by including the momentary contributions of Be,x and Be,y in Section 2.3.
2.2. ADIABATIC CONTRIBUTION TO RELAXATION 43
As the vertical ﬁeld rotates the magnetic moments about the z axis, it changes µx and µy.
Therefore, stochastic ﬂuctuations of the vertical magnetic ﬁeld B0 + Be,z randomize distribution of
the x and y components. In other words, the adiabatic contributions to relaxation destroy coherence
of the x and y components of magnetic moments (distributed as shown in the right panel of Figure 1.4)
that was created by the radio wave pulse at the beginning of the NMR experiment.
How are the vertical ﬂuctuations of the magnetic ﬁeld related to the molecular motions? As the
molecules rotate and the anisotropic components chemical shift tensors rotate with them (Figure 2.1),
the vertical magnetic ﬁelds (B0 + Be,z) ﬂuctuate.2
These ﬂuctuations are random and independent
for diﬀerent molecules because individual molecules in solution tumble randomly (due to collisions
with other molecules) and independently. Therefore, the frequency of precession of magnetic moments
in individual molecules, given by B0 + Be,z, also ﬂuctuates (randomly and independently for
each molecule). As a consequence, the magnetic moments in individual molecules do not precess
completely coherently (with the same frequency) and their distribution shown in Figure 1.4 is slowly
randomized. The cyan arrow in Figure 1.4, representing the bulk magnetization M of the given
distribution of magnetic moments, shrinks but stays in the xy plane, as long as only adiabatic relaxation
(ﬂuctuations along B0) are considered. Note that we observe two processes: rotation of the
cyan arrow (M) in the xy plane with the (average) precession frequency, and shrinking of the cyan
arrow due to the adiabatic relaxation.
In order to describe the adiabatic relaxation quantitatively, we express the precession frequency
ωz in terms of the components of the chemical shift tensor and angles3
describing its orientation in
the laboratory coordinate frame, depending on the orientation of the given molecule in the sample
(Eq 1.8):
ωz = −γ(B0 + Be,z) = −γB0(1 + δi) − γB0δa(3 cos2
ϑ − 1). (2.1)
The analysis presented in Section 2.6.1 shows that the coherence disappears (the cyan arrow
shrinks) with a rate constant (called R0 in this text) proportional to the time integral of the time
correlation function, i.e., of a mathematical function describing how quickly an ensemble of molecules
(and consequently the chemical shift tensor attached to it) looses memory of its original orientation
(Eq. 2.51).
R0 = (γB0δa)2
∞ˆ
0
(3 cos2 ϑ(0) − 1)(3 cos2 ϑ(t) − 1)dt, (2.2)
where the horizontal bar indicates an average value for all molecules in the sample and ϑ(0)
describes orientation of the chemical shift tensor at t = 0. Note that statistics play the key role
here: the whole analysis relies on the fact that although the product (3 cos2
ϑ(0) − 1)(3 cos2
ϑ(t) − 1)
changes randomly and diﬀerently for each molecule (and therefore cannot be described), the value of
the time correlation function (3 cos2 ϑ(0) − 1)(3 cos2 ϑ(t) − 1) is deﬁned statistically. If the structure
of the molecule does not change (rigid body rotational diﬀusion), which is the case we analyze, the
2
As the molecule rotates, Be,x and Be,y of course ﬂuctuates too. However, ﬂuctuating Be,x and Be,y have only the
non-adiabatic eﬀect, discussed in Section. 2.3
3
We need only one angle, ϑ, for our analysis of adiabatic contribution to relaxation.
44
analytical form of (3 cos2 ϑ(0) − 1)(3 cos2 ϑ(t) − 1) can be derived. The simplest analytical form of
the time correlation function is derived from the rotational diﬀusion equation in Section 2.6.4. The
derivation shows that the time correlation function for spherically symmetric rotational diﬀusion is
a single-exponential function:
3
2
cos2 ϑ(0) −
1
2
3
2
cos2 ϑ(t) −
1
2
=
1
5
e−t/τc
dt =
1
5
e−6Drott
, (2.3)
where τc is the rotational correlation time and Drot
is the rotational diﬀusion coeﬃcient, given
by the Stokes’ law
kBT
8πη(T)r3
, (2.4)
where r is the radius of the spherical particle, T is the temperature, and η(T) is the dynamic
viscosity of the solvent, strongly dependent on the temperature.4
Analytical solutions are also available (but more diﬃcult to derive) for axially symmetric and
asymmetric rotational diﬀusion, with the time correlation function in a form of three- and ﬁveexponential
functions, respectively.
For the spherically symmetric rotational diﬀusion, the rate constant of the loss of coherence can
be calculated easily:
R0 =
4
5
(γB0δa)2
∞ˆ
0
e−t/τc
dt =
4
5
(γB0δa)2
τc =
4
5
(γB0δa)2 1
6Drot
. (2.6)
2.3 Including non-adiabatic contribution to relaxation
A much more complex analysis of the non-adiabatic contributions to relaxation, consequences of magnetic
ﬁelds ﬂuctuations perpendicular to B0, is outlined in Section 2.6.5. Fluctuations perpendicular
to B0 are also results of molecular tumbling, but now we are interested in how Be,x and Be,y ﬂuctuate
due to the reorientation of the chemical shift tensor. Be,x and Be,y have the same direction as the
magnetic ﬁeld of the radio waves used to rotate the magnetization form the equilibrium orientation
(in the z direction) to the xy plane. Accidentally, the molecule may tumble for a short time with a
rate close to the precession frequency of the magnetic moments. The resulting perpendicular ﬂuctuations
then act on the magnetic moments in a similar manner as the radio waves, i.e. rotate them
about a horizontal axis. This of course changes the distribution of the z components of the magnetic
moments and changes their energy in B0 (exchanges the potential magnetic energy of the magnetic
moments with the kinetic rotational energy of molecules). However, there is a fundamental diﬀerence
4
Dynamic viscosity of water can be approximated by
η(T) = η0 × 10T0/(T −T1)
, (2.5)
where η0 = 2.414 × 10−5
kg m−1
s−1
, T0 = 247.8 K, and T1 = 140 K (Al-Shemmeri, T., 2012. Engineering Fluid
Mechanics. Ventus Publishing ApS. pp. 1718.).
2.3. INCLUDING NON-ADIABATIC CONTRIBUTION TO RELAXATION 45
between the ﬂuctuations and the radio waves. The radio waves coherently rotate magnetic moments
in all molecules, but the ﬂuctuating ﬁelds are diﬀerent in the individual molecules. And because the
ﬂuctuations are random, they randomly change distribution of magnetic moments until it returns
to the equilibrium distribution. This is what happens after a sample is placed in the magnetic ﬁeld
of the spectrometer, and this is also what starts to happen immediately after the magnetization is
tilted from the z direction by the radio waves.
The analysis in Section 2.6.5 provides values of two relaxation rates, (i) of the longitudinal relaxation
rate R1 describing how fast the z component of the bulk magnetization returns to its equilibrium
value, and (ii) of the transverse relaxation rate R2 describing how fast the x and y components of the
bulk magnetization decay to zero. Note that the longitudinal and transverse relaxation are diﬀerent
processes. The return of Mz to its equilibrium value is identical with the process of restoring the
equilibrium distribution of magnetic moments. However, the transverse relaxation has two sources,
the non-adiabatic return to the equilibrium distribution of magnetic moments (with the orientation
along B0 being slightly preferred) and the adiabatic loss of coherence. For large molecules, the loss
of coherence is much faster than the return to the equilibrium distribution, which makes R2 R1.
Quantitatively,
R1 = 3 (γB0δa)2 1
2
J(ω0) +
1
2
J(−ω0) ≈ 3 (γB0δa)2
J(ω0), (2.7)
where
J(ω0) =
∞ˆ
−∞
3
2
cos2(θ(0)) −
1
2
3
2
cos2(θ(t)) −
1
2
cos(ω0t). (2.8)
The function J(ω) is known as the spectral density function.
Note that
• The deﬁnition of R1, describing solely the non-adiabatic eﬀects of ﬂuctuations perpendicular to
B0 includes the same time correlation function as the deﬁnition of R0, describing the adiabatic
eﬀects of ﬂuctuations parallel to B0. This is possible in isotropic solutions, where no orientation
of the molecule is preferred. Then the distribution of the orientation of the molecules in the x
or y direction should be the same as in the z direction and the same time correlation function
can be used. Do not get confused! The molecules may be oriented isotropically even if their
tumbling is anisotropic. The anisotropic tumbling (rotational diﬀusion) is a result of a nonspherical
shape of the molecule, whereas anisotropic orientation is a result of an external force
preferring certain orientation of the molecules. The magnetic ﬁeld represents such a force, but
this force is very small for diamagnetic molecules and can be often neglected when describing
orientations of the molecules.5
• The deﬁnition of R1, unlike that of R0, includes also the value of the (average) precession frequency
ω0. This reﬂects the fact that the ﬂuctuations perpendicular to B0 rotate the magnetic
5
Note, however, that the magnetic ﬁeld cannot be neglected when describing the return of the magnetization to
the equilibrium, as discussed in Section 2.6.5.
46
moments about a horizontal axis only if their rate matches the precession frequency (resonance
condition).
• The term in the integral deﬁning R0, lacking the cosine function of ω0, can be also written as a
value of the spectral function at the zero frequency (zero in the argument converts the cosine
function to unity).
Similarly, R2 is given by
R2 = 2 (γB0δa)2
J(0) +
3
2
(γB0δa)2
J(ω0). (2.9)
Note that
• The ﬁrst term is the adiabatic contribution destroying the coherence.
• The second term is the non-adiabatic contribution, equal to 1
2
R1. The factor of 1
2
reﬂects the
fact that ﬂuctuations in a certain direction inﬂuence only components of magnetic moment
vectors perpendicular to that direction. E.g., ﬂuctuations along the x axis inﬂuence only µy,
but not µx. Therefore, a ﬂuctuation in the x direction that causes some longitudinal relaxation
(described by R1) by altering µz, is only half as eﬀective at causing transverse relaxation
described by R2 (only µy is altered, not µx).
The longitudinal relaxation rate R1, describing the return of Mz to the equilibrium due to the
chemical shift anisotropy in randomly reorienting molecules, and the transverse relaxation rate
R2, describing the decay of magnetization in the xy plane, are given by
R1 =
3
4
b2
J(ω0), (2.10)
R2 =
1
2
b2
J(0) +
3
8
b2
J(ω0), (2.11)
where b = −2γB0δa.
2.4 Internal motions, structural changes
So far, we analyzed only the rigid body motions of molecules, assuming that the structures of
molecules are rigid. What happens if the structure of the molecule changes? Let us ﬁrst assume
that the structural changes are random internal motions which change orientation of the chemical
shift tensor relative to the orientation of the whole molecule, but do not aﬀect the size or shape of
the tensor. Then, R0 can be derived in the same manner as in the absence of molecular motions
(Eq. 2.36 can be still used) and R0 is still given by Eq. 2.2, but the correlation function is not monoexponential
even if the rotational diﬀusion of the molecule is spherically symmetric. The internal
motions contribute to the dynamics together with the rotational diﬀusion, and in a way that is very
diﬃcult to describe exactly. Yet, useful qualitative conclusions can be made.
2.5. BLOCH EQUATIONS 47
• If the internal motions are much faster than rotational diﬀusion, correlation between 3 cos2
ϑ(0)−
1 and 3 cos2
ϑ(t) − 1 is lost much faster. The faster the correlation decays, the lower is the
result of integration. The internal motions faster than rotational diﬀusion always decrease the
value of R0 (make relaxation slower). Amplitude and rate of the fast internal motions can be
estimated using approximative approaches.
• If the internal motions are much slower than rotational diﬀusion, the rate of the decay of
the correlation function is given by the faster contribution, i.e., by the rotational diﬀusion.
The internal motions much slower than rotational diﬀusion do not change the value of R0
signiﬁcantly. Amplitude and rate of the fast internal motions cannot be measured if the motions
do not change size or shape of the diﬀusion tensor.
If the structural changes alter size and/or shape of the chemical shift tensor,6
parameters δi and δa
vary and cannot be treated as constants. E.g., the parameter δi is not absorbed into the constant (average)
precession frequency (removed by introducing the rotating coordinate frame in Section 2.6.1)
and δi(0)δi(t) contributes to R0 even if it decays much slower than (3 cos2 ϑ(0) − 1)(3 cos2 ϑ(t) − 1).
• Internal motions or chemical processes changing size and/or shape of the chemical shift tensor
may have a dramatic eﬀect on relaxation even if their frequency is much slower than the
rotational diﬀusion of the molecule. If the molecule is present in two inter-converting states
(e.g. in two conformations or in a protonated and deprotonated state), the strongest eﬀect is
observed if the diﬀerences between the chemical shift tensors of the states are large and if the
frequency of switching between the states is similar to the diﬀerence in γB0δi of the states.
Such processes are known as chemical or conformational exchange and increase the value of R0
and consequently R2.
2.5 Bloch equations
The eﬀects of relaxation can be included in the equations describing evolution of the bulk magnetization
(Eqs. 1.92–1.94). The obtained set of equations, known as Bloch equations, provides a general
macroscopic description of NMR for proton and similar nuclei.
dMx
dt
= −R2Mx − ΩMy + ω1 sin ϕMz, (2.12)
dMy
dt
= +ΩMx − R2My − ω1 cos ϕMz, (2.13)
dMz
dt
= −ω1 sin ϕMx + ω1 cos ϕMy − R1(Mz − Meq
z ). (2.14)
(2.15)
6
Examples of such changes are internal motions changing torsion angles and therefore distribution of electrons, or
chemical changes (e.g. dissociation of protons) with similar eﬀects.
48
HOMEWORK
Derive the rate constant R0 (Section 2.6.1).
2.6. SUPPORTING INFORMATION 49
2.6 SUPPORTING INFORMATION
2.6.1 Loss of coherence
Motion of a magnetic moment in a magnetic ﬁled is described classically as (cf. Eq. 1.95)
dµ
dt
= ω × µ = −γB × µ, (2.16)
or for individual components:
dµx
dt
= ωyµz − ωzµy, (2.17)
dµy
dt
= ωzµx − ωxµz, (2.18)
dµz
dt
= ωxµy − ωyµx. (2.19)
Solving a set of three equations is not so easy. Therefore, we start with a simpliﬁed case. Remember what we learnt when we tried
to rotate the magnetization away from the z direction by magnetic ﬁelds perpendicular to B0, i.e., by ﬁelds with Bx and By components.
Only Bx and By ﬁelds rotating with the frequency equal to the precession frequency of individual magnetic moments (Larmor frequency)
have the desired eﬀect. Let us start our analysis by assuming that the molecular motions are much slower than the Larmor frequency.
Under such circumstances, the eﬀects of Be,x and Be,y can be neglected and the equations of motion simplify to
dµx
dt
= −ωzµy = γBzµy (2.20)
dµy
dt
= ωzµx = −γBzµx (2.21)
dµz
dt
= 0 (2.22)
Eqs. 2.20–2.22 are very similar to Eqs. 1.96–1.98, so we try the same approach and calculate
dµ+
dt
≡
d(µx + iµy)
dt
= iωz(µx + iµy) = −iγBz(µx + iµy) = −iγBzµ+
(2.23)
According to Eq. 1.85,
Bz = B0 + Be,z = B0(1 + δi + δa(3 cos2
ϑ − 1) + δr(2 cos2
χ − 1) sin2
ϑ). (2.24)
For the sake of simplicity, we assume that the chemical shift tensor is axially symmetric (δr = 0). Then, ωz can be written as
ωz = −γ(B0 + Be,z) = −γB0(1 + δi) − γB0δa(3 cos2
ϑ − 1) = ω0 + bΘ , (2.25)
where
ω0 = −γB0(1 + δi) (2.26)
b = −2γB0δa (2.27)
Θ =
3 cos2 ϑ − 1
2
. (2.28)
This looks ﬁne, but there is a catch here: Eq. 2.23 cannot be solved as easily as we solved 1.96–1.98 because ωz is not constant but
ﬂuctuates in time. The value of ωz is not only changing, it is changing diﬀerently for each molecule in the sample and it is changing in
a random, unpredictable way! Can we solve the equation of motion at all? The answer is ”yes and no”. The equation of motion cannot
be solved for an individual magnetic moment. However, we can take advantage of statistics and solve the equation of motion for the total
magnetization M+, given by the statistical ensemble of magnetic moments.
We start by assuming that for a very short time ∆t, shorter than the time scale of molecular motions, the orientation of the molecule
does not change and Θ remains constant. We try to describe the evolution of µ+ in such small time steps, assuming
∆µ+
∆t
≈
dµ+
dt
≈ i(ω0 + bΘ )µ+
(2.29)
50
µ+
0
ibΘ1∆t
1
µ+
1
ibΘ2∆t
1
µ+
2
ibΘ3∆t
1
µ+
3
ibΘ4∆t
1
µ+
4 · · ·
ibΘk∆t
1
µ+
k
Figure 2.2: Evolution of magnetic moments due to longitudinal (parallel with B0) ﬂuctuations of magnetic ﬁelds. The symbols µ+
0
and µ+
k are connected by 2k possible pathways composed of black and green segments. Each black segment represents multiplication by
one, each green segment represents multiplication by ibΘj ∆t, where j ranges from 1 to k. The product of binomials in Eq. 2.36 is a sum
of 2k terms. In order to obtain one term of the series, we walk along the corresponding pathway and multiply all black and green numbers
written above the individual steps. The pathway composed of the black segments only gives the result of multiplication equal to one, the
pathways containing just one green segment give results of multiplication proportional to ∆t, the pathways containing two green segments
give results of multiplication proportional to (∆t)2, etc. In order to get the complete product in Eq. 2.36, we must walk through all possible
pathways (all possible combinations of the segments) and sum all results of the multiplication.
If the initial value of µ+ is µ+
0 and if the values of ω0, b, Θ during the ﬁrst time step are ω0,1, b1, Θ1, respectively, the value of µ+
after the ﬁrst time step is
µ+
1 = µ+
0 + ∆µ+
1 = µ+
0 + i(ω0,1 + b1Θ1)∆tµ+
0 = [1 + i(ω0,1 + b1Θ1)∆t]µ+
0 . (2.30)
After the second step,
µ+
2 = µ+
1 + ∆µ+
2 = µ+
1 + i(ω0,2 + b2Θ2)∆tµ+
1 = [1 + i(ω0,2 + b2Θ2)∆t][1 + i(ω0,1 + b1Θ1)∆t]µ+
0 . (2.31)
After k steps,
µ+
k = [1 + i(ω0,k + bkΘk)∆t][1 + i(ω0,k−1 + bk−1Θk−1)∆t] · · · [1 + i(ω0,2 + b2Θ2)∆t][1 + i(ω0,1 + b1Θ1)∆t]µ+
0 . (2.32)
If the structure of the molecule does not change, the electron distribution is constant and the size and shape of the chemical shift tensor
described by δi and δa does not change in time. Then, ω0 and b are constant and the only time-dependent parameter is Θ , ﬂuctuating as
the orientation of the molecule (described by ϑ) changes. The parameter ω0 = −γB0(1 + δi) represents a constant frequency of coherent
rotation under such circumstances. The coherent rotation can be removed if we describe the evolution of µ+ in a coordinate frame rotating
with the frequency ω0. The transformation of µ+ to the rotating frame is given by
(µ+
)rot = µ+
e−iω0t
. (2.33)
We also need to express the derivative of (µ+)rot, which is done easily by applying the chain rule:
d(µ+)rot
dt
=
d(µ+e−iω0t)
dt
=
dµ+
dt
e−iω0t
− iω0µ+
e−iω0t
. (2.34)
Substituting dµ+/dt from Eq. 2.29 results in
d(µ+)rot
dt
= i(ω0 + bΘ )µ+
e−iω0t
− iω0µ+
e−iω0t
= ibΘ µ+
e−iω0t
= ibΘ (µ+
)rot. (2.35)
When compared with Eq. 2.29, we see that ω0 disappeared, which simpliﬁes Eq. 2.32 to
(µ+
k )rot = [1 + ibΘk∆t][1 + ibΘk−1∆t] · · · [1 + ibΘ2∆t][1 + ibΘ1∆t](µ+
0 )rot. (2.36)
The process of calculating the product of brackets in Eq. 2.36 is shown schematically in Figure 2.2. The ﬁnal product is
(µ+
k )rot = [1 + ib∆t(Θk + Θk−1 + · · · + Θ1) − b2
∆t2
(Θk(Θk−1 + · · · + Θ2 + Θ1) + · · · + Θ2Θ1) − ib3
∆t3
(. . . ) + · · · ](µ+
0 )rot. (2.37)
We can now return to the question how random ﬂuctuations change µ+. Let us express the diﬀerence between µ+ after k and k − 1
steps:
∆(µ+
k )rot = (µ+
k )rot − (µ+
k−1)rot = [ib∆tΘk − b2
∆t2
Θk(Θk−1 + · · · + Θ1) − ib3
∆t3
(. . . ) + · · · ](µ+
0 )rot. (2.38)
Dividing both sides by ∆t
∆(µ+
k )rot
∆t
= ibΘk − b2
∆tΘk(Θk−1 + · · · + Θ1) − ib3
∆t2
(. . . ) + · · · (µ+
0 )rot =

ibΘk − b2
Θk
k−1
j=1
Θk−j∆t − ib3
∆t2
(. . . ) + · · ·

 (µ+
0 )rot
(2.39)
2.6. SUPPORTING INFORMATION 51
and going back from ∆t to dt (neglecting terms with dt2, dt3, . . . , much smaller than dt), or, more speciﬁcally to dtj (we sum all
terms diﬀering in j, i.e., evaluated at diﬀerent time instants tk − tj, where tk is constant and tj is variable),
d(µ+(tk))rot
dt
=

ibΘ (tk) − b2
tkˆ
0
Θ (tk)Θ (tk − tj)dtj

 (µ+
0 )rot. (2.40)
We see that calculating how ﬂuctuations of Bz aﬀect an individual magnetic moment in time tk requires knowledge of the orientations
of the molecule during the whole evolution (Θ (tk −tj)). However, we are not interested in the evolution of a single magnetic moment, but
in the evolution of the total magnetization M+. The total magnetization is given by the sum of all magnetic moments (magnetic moments
in all molecules). Therefore, we must average orientations of all molecules in the sample. In other words, we should describe Θ using two
indices, k and m, where k describes the time step and m the orientation of the given molecule. Calculation of the evolution of M+ then
should include summation of Θk,m for all k and m, or integration over the angles describing orientations of the molecule in addition to the
time integration. As the magnetic moments move almost independently of the molecular motions, we can average Θ and µ+ separately.
In the case of the axially symmetric chemical shift tensor, the orientations of molecules are given by orientations of the symmetry axes Z
of the chemical shift tensors of the observed nuclei in the molecules, described by the angles ϕ and ϑ. In order to simplify averaging the
orientations, we assume that all orientations are equally probable. This is a very dangerous assumption. It does not introduce any error
in this section, but leads to wrong results when we analyze the eﬀects of ﬂuctuations of magnetic ﬁelds perpendicular to B0!
As the angle ϑ(t) is hidden in the function Θ (t) = (3 cos ϑ2 − 1)/2 in our equation, the ensemble averaging can be written as7
d(M+(tk))rot
dt
=

ib
1
4π
2πˆ
0
dϕ
πˆ
0
Θ (tk) sin ϑdϑ − b2
tkˆ
0
dtj
1
4π
2πˆ
0
dϕ
πˆ
0
Θ (tk)Θ (tk − tj) sin ϑdϑ

 (M+
0 )rot, (2.41)
where ϕ ≡ ϕ(tk) and ϑ ≡ ϑ(tk).
In order to avoid writing too many integration signs, we mark the averaging simply by a horizontal bar above the averaged function:
d(M+(tk))rot
dt
=

ibΘ (tk) − b2
tkˆ
0
Θ (tk)Θ (tk − tj)dtj

 (M+
0 )rot. (2.42)
The average values of Z2
z = cos2 ϑ, of Z2
x = cos2 ϕ sin2 ϑ, and of Z2
y = sin2 ϕ sin2 ϑ must be the same because none of the directions
x, y, z is preferred:
Z2
x = Z2
y = Z2
z . (2.43)
Therefore,
Z2
x + Z2
y + Z2
z = 3Z2
z (2.44)
and
Z2
x + Z2
y + Z2
z = 1 ⇒ Z2
x + Z2
y + Z2
z = 1 ⇒ 3Z2
z − 1 = (3 cos2 ϑ − 1) = 2Θ = 0 ⇒ Θ = 0. (2.45)
It explains why we did not neglect already the b2dt term – we would obtain zero on the right-hand side in the rotating coordinate
frame (this level of simpliﬁcation would neglect the eﬀects of ﬂuctuations and describe just the coherent motions).
We have derived that the equation describing the loss of coherence (resulting in a loss of transverse magnetization) is
d(M+(tk))rot
dt
= −

b2
tkˆ
0
Θ (tk)Θ (tk − tj)dtj

 (M+
0 )rot, (2.46)
where the value of Θ (tk)Θ (tk − tj) is clearly deﬁned statistically (by the averaging described above). Values of Θ (tk)Θ (tk − tj)
can be determined easily for two limit cases:
• tj = 0: If tj = 0, Θ (tk)Θ (tk − tj) = (Θ (tk))2, i.e., Θ (tk) and Θ (tk − tj) are completely correlated.
The average value of Θ (tk)2 is
7
Two integrals in the following equation represent calculation of an average of a function depending on the orientation. Geometrically,
it is summation of the values of the function for individual surface elements (deﬁned by inclination ϑ and azimuth ϕ) of a sphere with the
radius r = 1, divided by the complete surface of the sphere 4π (see Section 1.5.1). Note that the current orientation of each molecule at tk
is described by ϑ(tk) and ϕ(tk), the values ϑ(tj) hidden in the function Θ (tj) describe only history of each molecule. They are somehow
related to ϑ(tk) and ϕ(tk) and therefore treated as an unknown function of ϑ(tk) and ϕ(tk) during the integration.
52
Θ (tk)2 =
1
4
(3 cos2 ϑ − 1)2 =
1
16π
2πˆ
0
dϕ
πˆ
0
dϑ(sin ϑ)(3 cos2
ϑ − 1)2
=
1
8
1ˆ
−1
(3u2
− 1)2
du
=
1ˆ
−1
9u4 − 6u2 + 1
8
du =
9u5 − 10u3 + 5u
40
1
−1
=
1
5
. (2.47)
• tj → ∞: If the changes of orientation (molecular motions) are random, the correlation between Θ (tk) and Θ (tk − tj) is lost
for very long tj and they can be averaged separately: Θ (tk)Θ (tk − tj) = Θ (tk) · Θ (tk − tj). But we know that average
Θ (t) = 3 cos2 ϑ − 1 = 0. Therefore, Θ (tk)Θ (tk − tj) = 0 for tj → ∞.
If the motions are really stochastic, it does not matter when we start to measure time. Therefore, we can start counting time from
tk, i.e., set tk = 0, and integrate from −tk to zero:
tkˆ
0
Θ (tk)Θ (tk − tj)dtj =
0ˆ
−tk
Θ (0)Θ (−tj)dtj. (2.48)
Furthemore, orientations at t = tk are correlated with those at t = 0 exactly like orientations at t = −tk if the molecule moves really
randomly (the ﬂuctations are stationary). As a consequence, the integration from −tk to zero can be replaced by integrating from zero to
+tk:
0ˆ
−tk
Θ (0)Θ (−tj)dtj =
tkˆ
0
Θ (0)Θ (tj)dtj. (2.49)
Finally, extending the upper integration limit from tk to inﬁnity does not change the integral signiﬁcantly if tk was already long
enough to reduce Θ (tk)Θ (tk − tj) almost to zero. Therefore, we can describe the loss of coherence for any suﬃciently long tk as
d(M+)rot
dt
= −

b2
∞ˆ
0
Θ (0)Θ (t)dt

 (M+
)rot, (2.50)
which resembles a ﬁrst-order chemical kinetics with the rate constant
R0 = b2
∞ˆ
0
Θ (0)Θ (t)dt. (2.51)
In order to calculate the value of the rate constant R0, we must be able to evaluate the averaged term Θ (0)Θ (t), known as the
time correlation function. As mentioned above, statistics play the key role here. Although the product Θ (0)Θ (t) changes randomly and
individually, the value of the time correlation function is deﬁned statistically.
2.6.2 Stochastic molecular motions: diﬀusion
Diﬀusion can be viewed as a result of collisions of the observed molecule with other molecules. Collisions change position of the molecule
is space (cause translation) and orientation of the molecule (cause rotation). Rotational diﬀusion is important for NMR relaxation.
Translational diﬀusion inﬂuences NMR experiments only if the magnetic ﬁled is inhomogeneous. Translational diﬀusion can be described
as a random walk in a three-dimensional space, rotational diﬀusion can be described as a random walk on a surface of a sphere. Although
we are primarily interested in relaxation and we do not discuss magnetic ﬁeld inhomogeneity at this moment, we start our discussion with
the random walk in a three-dimensional space because the random walk on a surface of a sphere is just a special case of the general walk
in three directions. In the following section (Section 2.6.3), we continue with the analysis of the simplest example of the random walk
on a spherical surface, i.e., of the isotropic rotational diﬀusion. The analysis shows that the isotropic rotational diﬀusion is described by
a simple exponential time dependence (Eq. 2.59). This relation will serve as a starting point for derivation of the key component of the
theory of NMR relaxation, of the time correlation function, described in Section 2.6.4.
We start with several deﬁnitions. Let us assume that the position of our molecule is described by coordinates x, y, z and its orientation
is described by angles ϕ, ϑ, χ.
2.6. SUPPORTING INFORMATION 53
• Probability that the molecule is inside a cubic box of a volume ∆V = ∆x∆y∆z centered around x, y, z is
P(x, y, z, t, ∆x, ∆y, ∆z) =
x+ ∆x
2ˆ
x− ∆x
2
y+ ∆y
2ˆ
y− ∆y
2
z+ ∆z
2ˆ
z− ∆z
2
ρ(x, y, z, t)dxdydz,
where ρ(x, y, z, t) is probability density at x, y, z, corresponding to local concentration in a macroscopic picture. If the box is small
enough so that ρ(x, y, z, t) does not change signiﬁcantly inside the box, the equation with the triple integral can be simpliﬁed to
P(x, y, z, t, ∆x, ∆y, ∆z) = ρ(x, y, z, t)∆V.
• Probability that the molecule crosses one wall of the box centered around x, y, z and jumps into the box centered around x+∆x, y, z
during a time interval δt is proportional to the area of the wall between boxes centered around x, y, z and around x+∆x, y, z. This
area is equal to ∆y∆z = ∆V/∆x and the probability of jumping from the box centered around x, y, z to the box centered around
x + ∆x, y, z can be written as
P(x → x + ∆x; x, y, z, t, ∆x, ∆y, ∆z, ∆t) = Φx→x+∆x∆y∆z = Φx→x+∆x∆V/∆x,
where Φx→x+∆x is the ﬂux from the box centered around x, y, z to the box centered around x + ∆x, y, z (per unit area). The
corresponding probability density is
ρ(x → x + ∆x; x, y, z, t, ∆t) = P(x → x + ∆x; x, y, z, t, ∆x, ∆y, ∆z, ∆t)/∆V = Φx→x+∆x/∆x.
The probability of jumping to the box centered around x + ∆x, y, z is also proportional to the probability that the molecule is
inside the box centered around x, y, z (equal to ρ(x, y, z, t)∆V if the box is small enough). If the probability of escaping the box is
the same in all directions,
ρ(x → x + ∆x; x, y, z, t, ∆t) = ξρ(x, y, z, t),
ρ(y → y + ∆y; x, y, z, t, ∆t) = ξρ(x, y, z, t),
ρ(z → z + ∆z; x, y, z, t, ∆t) = ξρ(x, y, z, t),
where ξ is a proportionality constant describing frequency of crossing a wall of a box (per unit volume and including the physical
description of the collisions).
• The net ﬂux in the x direction is given by
Φx = Φx→x+∆x − Φx+∆x→x = ξ∆x(ρ(x, y, z, t) − ρ(x + ∆x, y, z, t)) = −ξ∆x∆ρ = −ξ(∆x)2 ∂ρ
∂x
= −Dtr ∂ρ
∂x
,
where Dtr = ξ(∆x)2 is the translational diﬀusion coeﬃcient.
• The net ﬂux in all directions is
Φ = −Dtr
ρ,
which is the ﬁrst Fick’s law.
• The continuity equation ˆ
V
∂ρ
∂t
dV +
‹
S
ΦdS = 0
states that any time change of probability that the molecule is in a volume V is due to the total ﬂux through a surface S enclosing
the volume V (molecules are not created or annihilated). Using the divergence theorem,
0 =
∂ρ
∂t
+ · Φ =
∂ρ
∂t
+ · −Dtr
ρ ⇒
∂ρ
∂t
= Dtr 2
ρ,
which is the second Fick’s law.
• If the diﬀusion is not isotropic, the diﬀusion coeﬃcient is replaced by a diﬀusion tensor. If we deﬁne a coordinate frame so that the
diﬀusion tensor is represented by a diagonal matrix with elements Dtr
xx, Dtr
yy, Dtr
zz, the second Fick’s law has the following form:
∂ρ
∂t
= Dtr
xx
∂
∂x
∂ρ
∂x
+ Dtr
yy
∂
∂y
∂ρ
∂y
+ Dtr
zz
∂
∂z
∂ρ
∂z
= Dtr
xx
∂2
∂x2
+ Dtr
yy
∂2
∂y2
+ Dtr
zz
∂2
∂z2
ρ.
54
2.6.3 Isotropic rotational diﬀusion
Isotropic rotational diﬀusion can be viewed as random motions of a vector describing orientation of the molecule. Such motions are
equivalent to a random wandering of a point particle on a surface of a sphere with a unit diameter. In order to describe such a random
walk on a spherical surface, it is convenient to express the second Fick’s law in spherical coordinates
∂ρ
∂t
=
Drot
r2 sin ϑ
∂
∂r
r2
sin ϑ
∂
∂r
+
∂
∂ϑ
sin ϑ
∂
∂ϑ
+
∂
∂ϕ
1
sin ϑ
∂
∂ϕ
ρ. (2.52)
Since r is constant and equal to unity,
∂ρ
∂t
=
Drot
sin ϑ
∂
∂ϑ
sin ϑ
∂
∂ϑ
+
∂
∂ϕ
1
sin ϑ
∂
∂ϕ
ρ. (2.53)
Using the substitution u = cos ϑ (and ∂u = − sin ϑ∂ϑ),
∂ρ
∂t
= Drot
(1 − u2
)
∂2
∂u2
− 2u
∂
∂u
+
1
1 − u2
∂2
∂ϕ2
ρ. (2.54)
Let us now try if time and space coordinates can be separated, i.e. if ρ can be expressed as a product ρ(ϑ, ϕ, t) = f(ϑ, ϕ)g(t).
f
∂g
∂t
= gDrot
(1 − u2
)
∂2
∂u2
− 2u
∂
∂u
+
1
1 − u2
∂2
∂ϕ2
f. (2.55)
Dividing both sides of the equation by Drotρ = Drotfg,
1
Drot
1
g
∂g
∂t
=
1
f
(1 − u2
)
∂2
∂u2
− 2u
∂
∂u
+
1
1 − u2
∂2
∂ϕ2
f. (2.56)
If the separation of time and space coordinates is possible, i.e., if Eq. 2.56 is true for any t and any ϑ, ϕ independently, both sides of
the equation must be equal to the same constant (called λ bellow).
1
Drot
1
g
∂g
∂t
= λ (2.57)
1
f
(1 − u2
)
∂2
∂u2
− 2u
∂
∂u
+
1
1 − u2
∂2
∂ϕ2
f = λ. (2.58)
Solution of the ﬁrst equation is obviously
g(t) = g(0)eλDrot
t
, (2.59)
where λ is obtained by solving the second equation. We solve a simpliﬁed version of the second equation in Section 2.6.4.
2.6.4 Time correlation function
Analysis of the isotropic rotational diﬀusion in Section 2.6.3 allows us to calculate the time correlation function Θ (0)Θ (t) for this type
of diﬀusion (with a spherical symmetry). The ensemble-averaged product of randomly changing (3 cos2 ϑ(t) − 1)/2, evaluated for a time
diﬀerence t, can be expressed as
3
2
cos2 ϑ(0) −
1
2
3
2
cos2 ϑ(t) −
1
2
(2.60)
=
2πˆ
0
dϕ(0)
πˆ
0
sin ϑ(0)dϑ(0)ρ0
2πˆ
0
dϕ(t)
πˆ
0
sin ϑ(t)dϑ(t)
3
2
cos2
ϑ(0) −
1
2
3
2
cos2
ϑ(t) −
1
2
G(ϑ(0), ϕ(0)|ϑ(t), ϕ(t)), (2.61)
where ρ0 is the probability density of the original orientation described by ϑ(0) and ϕ(0), and G(ϑ(0), ϕ(0)|ϑ(t), ϕ(t)) is the conditional
probability density or propagator (also known as the Green’s function) describing what is the chance to ﬁnd an orientation given by ϑ(t), ϕ(t)
at time t, if the orientation at t = 0 was given by ϑ(0), ϕ(0).
If the molecule is present in an isotropic environment,8 ρ0 plays a role of a normalization constant and can be calculated easily from
the condition that the overall probability of ﬁnding the molecule in any orientation is equal to one:
8
Note that in the isotropic environment, where all orientations of the molecule are equally probable, the diﬀusion can be very anisotropic
if the shape of the molecule greatly diﬀers from a sphere.
2.6. SUPPORTING INFORMATION 55
2πˆ
0
dϕ(0)
πˆ
0
sin ϑ(0)dϑ(0)ρ0 = 4πρ0 = 1 ⇒ ρ0 =
1
4π
. (2.62)
Evaluation of G(ϑ(0), ϕ(0)|ϑ(t), ϕ(t)) requires to solve the diﬀusion equation (Eq. 2.58). We again express G as a product of timedependent
and time-independent functions g(t)P(ϑ). The function g(t) is deﬁned by Eq. 2.57, the function P(ϑ) is a simpliﬁed version of
function f(ϑ, ϕ) from Eq. 2.58. Since our correlation correlation function does not depend on ϕ, ∂P/∂ϕ = 0, and we can further simplify
Eq. 2.58 to
(1 − u2
)
d2
du2
− 2u
d
du
P = λP, (2.63)
(1 − u2
)
d2P
du2
− 2u
dP
du
− λP = 0. (2.64)
We expand P in a Taylor series
P =
∞
k=0
akuk
, ak =
1
k!
dkP(0)
duk
, (2.65)
calculate its ﬁrst and second derivatives
dP
du
=
∞
k=0
kakuk−1
, (2.66)
d2P
du2
=
∞
k=0
k(k − 1)akuk−2
, (2.67)
and substitute them into Eq. 2.64
(1 − u2
)
∞
k=0
k(k − 1)akuk−2
− 2
∞
k=0
kakuk
− λ
∞
k=0
akuk
= 0 (2.68)
∞
k=0
k(k − 1)akuk−2
−
∞
k=0
k(k − 1)akuk
− 2
∞
k=0
kakuk
− λ
∞
k=0
akuk
= 0. (2.69)
Note that the ﬁrst two terms of the ﬁrst sum are equal to zero (the ﬁrst term includes multiplication by k = 0 and the second term
includes multiplication by k − 1 = 0 for k = 1). Therefore, we can start summation from k = 2 in the ﬁrst term
∞
k=2
k(k − 1)akuk−2
−
∞
k=0
k(k − 1)akuk
− 2
∞
k=0
kakuk
− λ
∞
k=0
akuk
= 0. (2.70)
We shift the index in the ﬁrst sum by two to get the ﬁrst sum expressed in the same power of u as the other sums
∞
k=0
(k + 2)(k + 1)ak+2uk
−
∞
k=0
k(k − 1)akuk
− 2
∞
k=0
kakuk
− λ
∞
k=0
akuk
= 0 (2.71)
∞
k=0
((k + 2)(k + 1)ak+2 − (k(k − 1) + 2k + λ)ak) uk
=
∞
k=0
((k + 2)(k + 1)ak+2 − (k(k + 1) + λ)ak)
must be zero
uk
= 0. (2.72)
This equation is true for u = 0 only if the underbraced expression is equal to zero
(k + 2)(k + 1)ak+2 − (k(k + 1) + λ)ak = 0, (2.73)
which gives us a recurrence formula relating ak+2 and ak:
ak+2 =
k(k + 1) + λ
(k + 2)(k + 1)
ak. (2.74)
We can use the recurrence formula to express the Taylor series in terms of a0 and a1:
P = a0 1 +
0 · 1 + λ
1 · 2
u2
+
0 · 1 + λ
1 · 2
·
2 · 3 + λ
3 · 4
u4
+ . . . + a1 u +
1 · 2 + λ
2 · 3
u3
+
1 · 2 + λ
2 · 3
·
3 · 4 + λ
4 · 5
u5
+ . . . = 0. (2.75)
56
What is the value of λ? Note that ak+2 = 0 for each λ = −k(k + 1), which terminates one of the series in large parentheses, while the
other series grows to inﬁnity (for u = 0). To keep P ﬁnite, the coeﬃcient before the large parentheses in the unterminated series must be
set to zero. It tells us that we can ﬁnd a possible solution for each even or odd k if a1 = 0 or a0 = 0, respectively.
k = 0 a1 = 0 G = P0 = 1 λ = −k(k + 1) = 0 (2.76)
k = 1 a0 = 0 G = P1 = u = cos ϑ λ = −k(k + 1) = −2 (2.77)
k = 2 a1 = 0 G = P2 =
3u2 − 1
2
=
3 cos2 ϑ − 1
2
λ = −k(k + 1) = −6 (2.78)
k = 3 a0 = 0 G = P3 =
5u3 − 3u
2
=
5 cos3 ϑ − 3 cos ϑ
2
λ = −k(k + 1) = −12 (2.79)
..
. (2.80)
The value of a0 or a1 preceding the terminated series was chosen so that Pk(u = 1) = Pk(ϑ = 0) = 1.
Which of the possible solutions is the correct one? It can be shown easily that
1ˆ
−1
Pk(u)Pk (u)du =
πˆ
0
Pk(ϑ)Pk (ϑ)dϑ =
2
2k + 1
δkk , (2.81)
where δkk = 1 if k = k and δkk = 0 if k = k , i.e., the integral is equal to zero for each k = k (Pk are orthogonal). As we are going to
use G = g(t)P(ϑ) to calculate a correlation function for functions having the same form as the solutions for k = 2 and as the calculation of
the correlation function includes the same integration as in Eq. 2.81, it is clear that the only solution which gives us a non-zero correlation
function is that for k = 2, i.e. P2. Our function G is therefore given by
G = g0
3 cos2 ϑ − 1
2
e−6Drot
t
. (2.82)
Still, we need to evaluate the factor g0. This value must be chosen so that we fulﬁll the following conditions:
2πˆ
0
dϕ
πˆ
0
sin ϑdϑG = 1 (2.83)
and
G(t = 0) = δ(ϑ − ϑ(0)), (2.84)
where δ(ϑ − ϑ(0)) is a so-called Dirac delta function, deﬁned as
∞ˆ
−∞
f(x)δ(x − x0) = f(x0). (2.85)
The second condition says that ϑ must have its original value for t = 0. This is fulﬁlled for g0 proportional to (3 cos2 ϑ(0) − 1)/2:
g0 = c0
3 cos2 ϑ(0) − 1
2
. (2.86)
We can re-write our original deﬁnition of the correlation function with the evaluated G function and in a somewhat simpliﬁed form
(omitting integration over ϕ and ϕ(0)):
3
2
cos2 ϑ(0) −
1
2
3
2
cos2 ϑ(t) −
1
2
=
1ˆ
−1
du0ρ0c0
1ˆ
−1
du
(3u2
0 − 1)2
4
(3u2 − 1)2
4
e−6Drot
t
, (2.87)
where ρ0 can be evaluated from the normalization condition
1 =
1ˆ
−1
du0ρ0 = 2ρ0 ⇒ ρ0 =
1
2
(2.88)
and
c0 from
2.6. SUPPORTING INFORMATION 57
1 =
1ˆ
−1
du0
1ˆ
−1
c0
3u2
0 − 1
2
3u2 − 1
2
δ(u−u0)du =
1ˆ
−1
du0c0
(3u2
0 − 1)2
4
=
1ˆ
−1
du0c0
9u4
0 − 6u2
0 + 1
4
= c0
9u5
0 − 10u3
0 + 5u0
20
1
−1
=
2
5
c0, (2.89)
showing that c0 = 5/2.
Finally, the correlation function can be calculated
3
2
cos2 ϑ(0) −
1
2
3
2
cos2 ϑ(t) −
1
2
=
5
4
1ˆ
−1
du0
(3u2
0 − 1)2
4
1ˆ
−1
du
(3u2 − 1)2
4
e−6Drot
t
=
5
4
2
5
2
5
e−6Drot
t
=
1
5
e−6Drot
t
. (2.90)
We have derived that the time correlation function for spherically symmetric rotational diﬀusion is a single-exponential function.
2.6.5 Return to equilibrium
After introducing the correlation function, we can repeat the analysis using the same simpliﬁcations (rigid molecule, isotropic liquid), but
taking the transverse (perpendicular) ﬁeld ﬂuctuations into account.
dµx
dt
= ωyµz − ωzµy (2.91)
dµy
dt
= ωzµx − ωxµz (2.92)
dµz
dt
= ωxµy − ωyµx (2.93)
Expressing ωx as bΘ⊥ cos ϕ and ωy as bΘ⊥ sin ϕ, where
b = −2γB0δa (2.94)
Θ⊥
=
3
2
sin ϑ cos ϑ, (2.95)
gives
dµx
dt
= (bΘ⊥
sin ϕ)µz − (ω0 + bΘ )µy (2.96)
dµy
dt
= (ω0 + bΘ )µx − (bΘ⊥
cos ϕ)µz (2.97)
dµz
dt
= (bΘ⊥
cos ϕ)µy − (bΘ⊥
sin ϕ)µx, (2.98)
Introducing µ+ = µx + iµy and µ− = µx − iµy results in
dµ+
dt
= −ibΘ⊥
eiϕ
µz + i(ω0 + bΘ )µ+
(2.99)
dµ−
dt
= ibΘ⊥
e−iϕ
µz − i(ω0 + bΘ )µ−
(2.100)
dµz
dt
=
i
2
bΘ⊥
e−iϕ
µ+
− eiϕ
µ−
, (2.101)
In a coordinate frame rotating with ω0,
d(µ+)rot
dt
= −ibΘ⊥
ei(ϕ−ω0t)
µz + ibΘ (µ+
)rot (2.102)
d(µ−)rot
dt
= ibΘ⊥
e−i(ϕ−ω0t)
µz − ibΘ (µ−
)rot (2.103)
dµz
dt
=
i
2
bΘ⊥
e−i(ϕ−ω0t)
(µ+
)rot − ei(ϕ−ω0t)
(µ−
)rot , (2.104)
58
µ+
0
ibΘ1∆t
1
e
e
e
e
e
e
e
e
e
e
e
e
e
−i
b
2
Θ
⊥
1
∆te
−i(ϕ1−ω0t1)
µ+
1
ibΘ2∆t
1
e
e
e
e
e
e
e
e
e
e
e
e
e
−i
b
2
Θ
⊥
2
∆te
−i(ϕ2−ω0t2)
µ+
2
ibΘ3∆t
1
e
e
e
e
e
e
e
e
e
e
e
e
e
−i
b
2
Θ
⊥
3
∆te
−i(ϕ3−ω0t3)
µ+
3
ibΘ4∆t
1
e
e
e
e
e
e
e
e
e
e
e
e
e
−i
b
2
Θ
⊥
4
∆te
−i(ϕ4−ω0t4)
µ+
4 · · ·
ibΘk∆t
1
e
e
e
e
e
e
e
e
e
e
e
e
e
−i
b
2
Θ
⊥
k
∆te
−i(ϕk
−ω0tk
)
µ+
k
µz,0
1¡
¡
¡
¡
¡
¡
¡
¡
¡
¡
¡
¡
¡
−ibΘ⊥
1∆tei(ϕ1−ω0t1)
e
e
e
e
e
e
e
e
e
e
e
e
e
ibΘ
⊥
1
∆te
−i(ϕ1−ω0t1)
µz,1
1¡
¡
¡
¡
¡
¡
¡
¡
¡
¡
¡
¡
¡
−ibΘ⊥
2∆tei(ϕ2−ω0t2)
e
e
e
e
e
e
e
e
e
e
e
e
e
ibΘ
⊥
2
∆te
−i(ϕ2−ω0t2)
µz,2
1¡
¡
¡
¡
¡
¡
¡
¡
¡
¡
¡
¡
¡
−ibΘ⊥
3∆tei(ϕ3−ω0t3)
e
e
e
e
e
e
e
e
e
e
e
e
e
ibΘ
⊥
3
∆te
−i(ϕ3−ω0t3)
µz,3
1¡
¡
¡
¡
¡
¡
¡
¡
¡
¡
¡
¡
¡
−ibΘ⊥
4∆tei(ϕ4−ω0t4)
e
e
e
e
e
e
e
e
e
e
e
e
e
ibΘ
⊥
4
∆te
−i(ϕ4−ω0t4)
µz,4 · · ·
1¡
¡
¡
¡
¡
¡
¡
¡
¡
¡
¡
¡
¡
−ibΘ⊥
k∆tei(ϕk−ω0tk)
e
e
e
e
e
e
e
e
e
e
e
e
e
ibΘ
⊥
k
∆te
−i(ϕk
−ω0tk
)
µz,k
µ−
0
−ibΘ1∆t
1¡
¡
¡
¡
¡
¡
¡
¡
¡
¡
¡
¡
¡
ib
2Θ⊥
1∆tei(ϕ1−ω0t1)
µ−
1
−ibΘ2∆t
1¡
¡
¡
¡
¡
¡
¡
¡
¡
¡
¡
¡
¡
ib
2Θ⊥
2∆tei(ϕ2−ω0t2)
µ−
2
−ibΘ3∆t
1¡
¡
¡
¡
¡
¡
¡
¡
¡
¡
¡
¡
¡
ib
2Θ⊥
3∆tei(ϕ3−ω0t3)
µ−
3
−ibΘ4∆t
1¡
¡
¡
¡
¡
¡
¡
¡
¡
¡
¡
¡
¡
ib
2Θ⊥
4∆tei(ϕ4−ω0t4)
µ−
4 · · ·
−ibΘk∆t
1¡
¡
¡
¡
¡
¡
¡
¡
¡
¡
¡
¡
¡
ib
2Θ⊥
k∆tei(ϕk−ω0tk)
µ−
k
Figure 2.3: Evolution of magnetic moments due to longitudinal (parallel) and transverse (perpendicular) ﬂuctuations of magnetic ﬁelds.
The meaning of the diagram is the same as in Fig. 2.2, but additional segments (red and blue) interconnect µ+
j , µ−
j , and µz,j, substantially
increasing the number of possible pathways. The pathway composed of the black segments only gives the result of multiplication equal to
one, the pathways containing just one segment of a diﬀerent color give results of multiplication proportional to ∆t, the pathways containing
two segments of a color diﬀerent than black give results of multiplication proportional to (∆t)2, etc.
Note that now the transformation to the rotating frame did not remove ω0 completely, it survived in the exponential terms.
Again, the set of diﬀerential equations cannot be solved because Θ , Θ⊥, and ϕ ﬂuctuate in time, but we can analyze the evolution
in time steps short enough to keep Θ , Θ⊥, and ϕ constant.
µ+
1 = µ+
0 + ∆µ+
1 = [1 + i(ω0 + bΘ1)∆t]µ+
0 − ibΘ⊥
1 ∆tei(ϕ1−ω0t1)
µz,0 (2.105)
µ−
1 = µ−
0 + ∆µ−
1 = [1 − i(ω0 + bΘ1)∆t]µ−
0 + ibΘ⊥
1 ∆te−i(ϕ1−ω0t1)
µz,0 (2.106)
µz,1 = µz,0 + ∆µz,1 = µz,0 −
i
2
bΘ⊥
1 ∆te−i(ϕ1−ω0t1)
µ+
0 +
i
2
bΘ⊥
1 ∆tei(ϕ1−ω0t1)
µ−
0 . (2.107)
The µ+, µ−, and µz,0 are now coupled which makes the step-by-step analysis much more complicated. Instead of writing the equations,
we just draw a picture (Figure 2.3) similar to Fig. 2.2. Derivation of the values of relaxation rates follows the procedure described for
the parallel ﬂuctuations (Eqs. 2.36–2.41). As the number of possible pathways in Fig. 2.3 is very high, already the list of the terms
proportional to ∆t and ∆t2 is very long. Fortunately, we are not interested in evolution of magnetic moments in individual molecules,
described in Fig. 2.3. The values of Θ1, Θ⊥
1 , ϕ1, etc. are diﬀerent for each molecule and we are interested in what we get after averaging
results of multiplications for all molecules (all possible orientations). In order to avoid writing the long expressions for magnetic moments
of individual molecules, we skip steps corresponding to Eqs. 2.36–2.40 and jump directly to the calculation of the evolution of total
magnetization (corresponding to Eq. 2.41).
2.6. SUPPORTING INFORMATION 59
Let us start with the terms proportional to ∆t, which give us the imaginary term proportional to b when calculating dM+/dt (and
dM−/dt, dMz/dt). We have already seen that the average of Θ (the green segment) is zero. The terms containing Θ⊥ (red and blue
segments) contain the exponential expression with the phase including ϕ. If the azimuth ϕ is random,9 the ”red” and ”blue” terms average
to zero.
Let us now turn to the terms proportional to ∆t2, which give us the time integral multiplied by b2 when calculating dM+/dt (and
dM−/dt, dMz/dt). The pathways containing two red segments or two blue segments correspond to ∆t2 terms with a random phase in
the exponent (random sums of ϕj − ω0tj). When averaged for all orientations, such phases tend to zero. The ∆t2 terms do not average
to zero only in two cases: (i) if the pathway contains two green segments (eﬀect of longitudinal ﬂuctuations described above) or (ii) if the
pathway contains a combination of one red and one blue segment. The former case is obvious, but the latter one is more subtle.
We can distinguish two combinations of one red and one blue segment:
1
2
b2
∆t2
Θ⊥
k ei(ϕk−ω0tk)
Θ⊥
j e−i(ϕj −ω0tj )
=
1
2
b2
∆t2
Θ⊥
k Θ⊥
j ei(ϕk−ϕj −ω0(tk−tj ))
(2.108)
(with −ω0(tk − tj) in the exponent) and
1
2
b2
∆t2
Θ⊥
k e−i(ϕk−ω0tk)
Θ⊥
j ei(ϕj −ω0tj )
=
1
2
b2
∆t2
Θ⊥
k Θ⊥
j ei(−ϕk+ϕj +ω0(tk−tj ))
(2.109)
(with +ω0(tk − tj) in the exponent). As discussed in Section 2.6.1, we can replace tk by zero and tj by t because the molecular
motions are random:
1
2
b2
∆t2
Θ⊥
(0)Θ⊥
(t)ei(−(ϕ(t)−ϕ(0))+ω0t))
(2.110)
(with +ω0t in the exponent) and
1
2
b2
∆t2
Θ⊥
(0)Θ⊥
(t)ei(+(ϕ(0)−ϕ(t))−ω0t))
(2.111)
(with −ω0t in the exponent).
In both cases, the phase is not randomly distributed for diﬀerent orientations only if ϕ(0) − ϕ(t) is similar to ω0t. The average value
of Θ⊥(0)2 is 3/10:
Θ⊥(t)2 =
9
4
cos2 ϑ sin2 ϑ =
9
16π
2πˆ
0
dϕ
πˆ
0
dϑ(sin3
ϑ cos2
ϑ) =
9
16π
2πˆ
0
dϕ
1ˆ
−1
du(u2
− u4
) =
9
8
u3
3
−
u5
5
1
−1
=
9
8
2
3
−
2
5
=
3
10
(2.112)
for any t.
The Mz component of magnetization is given by the average of the µz components at tk. In order to get to µz,k through paths giving
terms proportional to ∆t2, we must start at µz,0 and pass one blue segment and one red segment in Figure 2.3. Eqs. 2.110 and 2.111
mathematically describe that orientations of magnetic moments are redistributed if the molecular motions (described by the azimuth ϕ)
accidently resonate for a short time with the frequencies ω0t and −ω0t. Then the magnetic energy of the magnetic moments is exchanged
with the rotational kinetic energy of the molecules. This energy exchange must be taken into account when we average magnetic moments
of individual molecules to calculate Mz. Let us call the total rotational energy of molecules Erot
0 . The exchange of the magnetic energy Eµ
of a magnetic moment µ with a small amount of rotational energy of molecules ∆Erot can be described as
Erot
0 → Erot
0 + ∆Erot
+ Eµ. (2.113)
The molecular motions have much more degrees of freedom (both directions of rotational axes and rates of rotation vary) than the
magnetic moments (size is ﬁxed, only orientation changes). We can therefore assume that the exchange perturbs distribution of the
magnetic moments, but the rotating molecules stay very close to the termodynamic equilibrium. At the equilibrium, the probability to
ﬁnd a molecule with the rotational kinetic energy Erot
0 + ∆Erot is proportional (Boltzmann law) to
e
−∆Erot
kBT
≈ 1 −
∆Erot
kBT
. (2.114)
The conservation of energy requires
Erot
0 + ∆Erot
+ Eµ = Erot
0 , (2.115)
showing that ∆Erot = −Eµ. Consequently, the population of molecules with the given rotational energy is proportional to 1 −
∆Erot/kBT = 1 + Eµ/kBT. According to Eq. 1.19, the probability of ﬁnding a magnetic moment in the orientation described by a given
u = cos ϑµ is
9
Note that this is true even in the presence of B0 and in molecules aligned along the direction of B0, for example in liquid crystals
oriented by the magnetic ﬁeld.
60
Peq
(u) =
w
ew − e−w
euw
≈
w
1 − w − 1 + w
(1 + uw) =
1
2
(1 + uw). (2.116)
Consequently, Eµ/kBT = −uw = 1 − 2Peq(u) and the probability to ﬁnd a molecule with the rotational kinetic energy Erot
0 + ∆Erot
is proportional to
1 −
∆Erot
kBT
= 1 +
Eµ
kBT
= 2 − 2Peq
(u) = 2(1 − Peq
(u)), (2.117)
where the factor of two can be absorbed in the normalization constant.
We have derived that the averaged values of µz are weighted by 1−Peq(u). How does it aﬀect the calculation of Mz? In the expression
µz − Peq(u)µz, µz in the ﬁrst term is not weighted by anything and its average (multiplied by the number of magnetic moments per unit
volume) is equal to Mz. The average value of the second term has been already calculated in Eqs. 1.23–1.26. It represents the equilibrium
value of the magnetization, Meq. Therefore, averaging of µz results in Mz − Meq, usually abbreviated as ∆Mz.
Using the same arguments as in Section 2.6.1,
d∆Mz
dt
= −

1
2
b2
∞ˆ
0
Θ⊥(0)Θ⊥(t)e−i(ϕ(t)−ϕ(0))eiω0t
dt +
1
2
b2
∞ˆ
0
Θ⊥(0)Θ⊥(t)ei(ϕ(t)−ϕ(0))e−iω0t
dt

 ∆Mz (2.118)
The relaxation rate R1 for Mz, known as longitudinal relaxation rate in the literature, is the real part10 of the expression in the
parentheses
R1 = b2



∞ˆ
0
Θ⊥(0)Θ⊥(t)e−i(ϕ(t)−ϕ(0))eiω0t
dt +
∞ˆ
0
Θ⊥(0)Θ⊥(t)ei(ϕ(t)−ϕ(0))e−iω0t
dt



. (2.119)
If the ﬂuctuations are random and their statistical properties do not change in time, they are stationary: the current orientation of the
molecule is correlated with the orientation in the past in the same manner as it is correlated with the orientation in the future. Therefore,
∞ˆ
0
Θ⊥(0)Θ⊥(t)e−i(ϕ(t)−ϕ(0))eiω0t
dt =
1
2


∞ˆ
0
Θ⊥(0)Θ⊥(t)e−i(ϕ(t)−ϕ(0))eiω0t
dt +
0ˆ
−∞
Θ⊥(0)Θ⊥(t)e−i(ϕ(t)−ϕ(0))eiω0t
dt

 (2.120)
=
1
2
∞ˆ
−∞
Θ⊥(0)Θ⊥(t)e−i(ϕ(t)−ϕ(0))eiω0t
dt. (2.121)
∞ˆ
0
Θ⊥(0)Θ⊥(t)ei(ϕ(t)−ϕ(0))e−iω0t
dt =
1
2


∞ˆ
0
Θ⊥(0)Θ⊥(t)ei(ϕ(t)−ϕ(0))e−iω0t
dt +
0ˆ
−∞
Θ⊥(0)Θ⊥(t)ei(ϕ(t)−ϕ(0))e−iω0t
dt

 (2.122)
=
1
2
∞ˆ
−∞
Θ⊥(0)Θ⊥(t)ei(ϕ(t)−ϕ(0))e−iω0t
dt. (2.123)
In isotropic solutions, the motions of molecules are very little aﬀected by magnetic ﬁelds. Therefore, the choice of the z axes is
arbitrary form the point of the view of the molecule (not of the magnetic moment!). Therefore, the terms with Θ⊥ can be replaced by
those with Θ , multiplied by 3/2 to match the diﬀerence between Θ (0)2 = 1/5 and Θ⊥(0)2 = 3/10:
1
2
∞ˆ
−∞
Θ⊥(0)Θ⊥(t)e i(ϕ(t)−ϕ(0))e±iω0t
dt =
3
4
∞ˆ
−∞
Θ (0)Θ (t)e±iω0t
dt. (2.124)
Real parts of the integrals in Eq. 2.124 are known as spectral density functions J(ω). Note that the real part of the integral in the
right-hand side of Eq. 2.124 is
10
Solving Eq. 2.118 gives
∆Mz = ∆Mz(0)e−(R1+iω )t
= ∆Mz(0)e−R1t
eiω t
= ∆Mz(0)e−R1t
cos ω t + i sin ω t ,
where R1 and ω are the real and imaginary parts, respectively, of the expression in the parentheses in Eq. 2.118. Whereas R1 describes
the decay rate of ∆Mz, ω (much smaller than ω0), known as the dynamic frequency shift, describes an oscillation of ∆Mz, and is usually
included into the value of ω0.
2.6. SUPPORTING INFORMATION 61



3
4
∞ˆ
−∞
Θ (0)Θ (t)e±iω0t
dt



=
3
4
∞ˆ
−∞
Θ (0)Θ (t) cos(ω0t)dt. (2.125)
because
e±ix
= cos x ± i sin x. (2.126)
Also note that the integral in Eq. 2.50 in Section 2.6.1 can be also included in the deﬁnition of the spectral density function if we
replace ω0 by zero:
∞ˆ
0
Θ (0)Θ (t)dt =
1
2


∞ˆ
0
Θ (0)Θ (t)dt +
0ˆ
−∞
Θ (0)Θ (t)dt

 =
1
2
∞ˆ
−∞
Θ (0)Θ (t) cos(0)dt =
1
2
J(0). (2.127)
62
Lecture 3
Signal acquisition and processing
Literature: Function of an NMR spectrometer is nicely described in L4, K13, or C3.1. More details
are provided in B23. Experimental setup is discussed in C3.8.2. Signal averaging is described in L5.2,
quadrature detection in L5.7 and LA.5, K13.6, and C3.2.3, Fourier transformation is introduced in
K5.1–K5.3.1 and L5.8.1.–L5.8.3, and treated more thoroughly in B8 and C3.3.1. Phase correction
is described nicely in K5.3.2–K5.3.4 and discussed also in C3.3.2.3 and L5.8.4–L5.8.5, zero ﬁlling is
discussed in C3.3.2.1 and K5.5, and apodization is explained in K5.4 and C3.3.2.2.
3.1 NMR experiment
It is not our aim to discuss practical issues of NMR spectroscopy. However, it is useful to have at
least a basic idea how NMR signal is acquired and processed before discussing theory of magnetic
resonance.
The real NMR experiment closely resembles FM radio broadcast. The mega-hertz radio frequency
ωradio plays the role of the carrier frequency, and is frequency-modulated by the oﬀset, which usually
falls in the range of kilo-hertz audio frequencies. In a similar fashion, the carrier frequency of the FM
broadcast is modulated by the audio frequency of the transmitted signal (voice, music). Like when
listening to the radio, we need to know the carrier frequency to tune the receiver, but its value is
not interesting. The interesting information about the chemical environment is hidden in the audiofrequency
oﬀset. But recall that the numerical value of Ω is arbitrary as it depends on the actual
choice of the carrier frequency. What can be interpreted unambiguously, is the constant δ, given just
by the electron density. As discussed in Section 1.4, the absolute value of δ is extremely diﬃcult to
obtain because the reference δ = 0 represents nuclei with no electrons – deﬁnitely not a sample we
are used to produce in our labs. Therefore, more accessible references (precession frequencies ωref of
stable chemical compounds) are used instead of the vacuum frequency. The value of δ is then deﬁned
as (ω − ωref)/ωref and usually presented in the units of ppm (see Section 1.4).
Setting up the NMR experiment is not a simple task. It includes several steps, listed in Section
3.10.1
63
64
3.2 NMR signal acquisition
Most often, the NMR signal is acquired in a manner called quadrature detection. The procedure
is described in Section 3.10.2, here we only describe its result. Magnetic ﬁeld of the rotating magnetization
induces electromotive voltage in the detector coil. This electric signal, oscillating and
decaying due to the relaxation, is known as the free induction decay (FID). The voltage induced in
the detector coil is split into two channels. The high-frequency (radio) component of the signal (oscillating
with carrier frequency −ωradio) is ﬁltered out (demodulation). The resulting signal contains
only the low-frequency (audio) component (superposition of oscillations with frequency oﬀsets Ωj
of individual nuclei) but with a diﬀerent phase in each channel. The phase diﬀerence between the
channels is π/2, or 90 ◦
. It is convenient to treat the signals in the individual channels (labeled a(t)
and b(t) in this text) as a real and imaginary component of a single complex number, denoted y(t)
in this text. If we ignore relaxation, the complex signal can be described as
y(t) = a(t) + ib(t) =
j
(Aj cos(Ωjt) + iAj sin(Ωjt)) =
j
AjeiΩjt
. (3.1)
The output of the quadrature receiver is converted to a digital form (analog-digital conversion).
Therefore, the information obtained from an NMR experiment is a set of complex numbers describing
the signal intensities at the time points t ∈ {0, ∆t, 2∆t, · · · , (N − 1)∆t}.
The NMR signal induced by precession of the magnetization vector is very weak, comparable
to the noise, generated mostly by random motions of electrons in the receiver coil. Therefore, the
NMR experiments are usually repeating several times, adding the signal together. If the experiment
is repeated in the same manner n-times, the evolution of the magnetization vector is identical in all
repetitions (magnetization is evolving coherently), and the sum of the signals from the individual
measurements, called transients, is simply ny(t). However, the absolute size of the signal is not
important, what really matters is the signal-to-noise ratio. Therefore, it is also important how noise
accumulates when adding signals of separate measurements. The analysis presented in Section 3.10.3
shows that the signal-to-noise ratio is proportional to the square root of the number of summed
transients.
3.3 Fourier transformation
The eﬀect of electrons (chemical shift) makes NMR signal much more interesting but also much
more complicated. Oscillation of the voltage induced in the receiver coil is not described by a cosine
function, but represents a superposition (sum) of several cosine curves (phase-shifted and dumped).
It is practically impossible to get the frequencies of the individual cosine functions just by looking at
the recorded interferograms. Fortunately, the signal acquired as a function of time can be converted
into a frequency dependence using a straightforward mathematical procedure, known as Fourier
transformation.
It might be useful to present the basic idea of the Fourier transformation in a pictorial form before
we describe details of Fourier transformation by mathematical equations. The oscillating red dots
in Figure 3.1 represent an NMR signal deﬁned by one frequency ν. Let us assume that the signal
3.3. FOURIER TRANSFORMATION 65
f1 f2 f3 f4 f5
ν = f3
t t
ν = ?
f5
Σ = 0
f4
Σ = 0
f3
Σ = 50
f2
Σ = 0
f1
Σ = 0
Figure 3.1: The basic idea of Fourier transformation.
oscillates as a cosine function but we do not know the frequency. We generate a testing set of cosine
functions of diﬀerent known frequencies fj (blue curves in Figure 3.1) and we multiply each blue
testing function by the red signal. The resulting product is plotted as magenta dots in Figure 3.1.
Then we sum the values of the magenta points for each testing frequency getting one number (the
sum) for each blue function. Finally, we plot these numbers (the sums) as the function of the testing
frequency. How does the plot looks like? If the testing frequency diﬀers from ν, the magenta dots
oscillate around zero and their sum is close to zero (slightly positive or negative, depending on
how many points were summed). But if we are lucky and the testing frequency matches ν (f3 in
Figure 3.1), the result is always positive (we always multiply two positive numbers or two negative
numbers). The sum is then also positive, the larger the more points are summed. Therefore, the sum
for the matching frequency is much higher than the other sums, making a positive peak in the ﬁnal
green plot (the dependence on fj). The ﬁnal plot represents a frequency spectrum and the position of
the peak immediately identiﬁes the value of the unknown frequency. If the NMR signal is composed
of two frequencies, the red dots oscillate in a wild interference patterns, not allowing to get the
frequency simply by measuring the period of the oscillation. However, the individual components (if
they are suﬃciently diﬀerent) just make several peaks in the ﬁnal green plot and their frequencies
can be easily obtained by reading the positions of the peaks.
Let us now try to describe the Fourier transformation in a bit more mathematical manner (a more
detailed discussion is presented in Section 3.10.4). If a continuous signal y(t) were recorded using
quadrature detection, i.e., stored as complex numbers, it would be appropriate to apply continuous
complex Fourier transformation, deﬁned as
66
Signalinchannel1:{y(t)}
t
Signalinchannel2:{y(t)}
t
{Y(ω)}
ω
Ω
1/R22R2
{Y(ω)}
ω
Ω
Figure 3.2: Ideal signal detected with a quadrature detection (top) and its Fourier transform (bottom).
Y (ω) =
∞ˆ
−∞
y(t)e−iωt
dt. (3.2)
Important properties of continuous complex Fourier transformation are summarized in Section
3.10.6.
Although the actual NMR signal is not recorded and processed in a continuous manner, the idealized
continuous Fourier transformation helps to understand the fundamental relation between the
shapes of FID and frequency spectra and reveals important features of signal processing. Therefore,
we discuss the continuous Fourier transformation before we proceed to the discrete analysis.
An ”ideal signal” (see Figure 3.2) has the form y(t) = 0 for t ≤ 0 and y(t) = Ae−R2t
eiΩt
for t ≥ 0,
where A can be a complex number (complex amplitude), including the real amplitude |A| and the
initial phase φ0:
A = |A|eiφ0
. (3.3)
As derived in Section 3.10.5,
Fourier transform of the ”ideal” signal is
Y (ω) =
∞ˆ
−∞
Ae−R2t
eiΩt
e−iωt
dt = A
R2
R2
2 + (Ω − ω)2
+ iA
Ω − ω
R2
2 + (Ω − ω)2
(3.4)
If φ0 = 0, the blue term, known as the absorption line is a real function ( {Y (ω)}) having a shape
3.3. FOURIER TRANSFORMATION 67
Signalinchannel1:{y(t)}
t
Signalinchannel2:{y(t)}
t
{Y(ω)}
ω
Ω1
Ω2
Ω3
{Y(ω)}
ω
Ω1
Ω2 Ω3
Figure 3.3: Signal (top) and frequency spectrum (bottom) with three precession frequencies.
of the Lorentz curve (see Figure 3.2). The shape of the absorption line is given1
by the relaxation
rate R2:
• Peak height ∝ 1/R2 (Y = Ymax at ω = Ω ⇒ Ymax = Y (Ω) = A/R2)
• Linewidth at the half-height = 2R2 (Y = Ymax/2 at Ω − ω = ±R2)
The red term, the dispersion line, is purely imaginary ( {Y (ω)}) if φ0 = 0. Such shape is less
convenient in real spectra containing several lines because the broad wings of the dispersion line
distort the shape of the neighbouring lines (see Figure 3.2).
Figure 3.3 documents that Fourier transformation allows us to immediately determine several
precession frequencies in spectra even if the signal in the time domain (FID) is very diﬃcult to
interpret, and that the real (absorption) part of the complex spectrum is much better for such
purpose.
Figures 3.4 and 3.5 document the advantage of recording the signal with the quadrature detection,
as a complex number. If we take only the signal from the ﬁrst channel, oscillating as the cosine
function if φ = 0, and stored as the real part if the quadrature detection is used (Figure 3.4), and
perform the Fourier transformation, we get a spectrum with two peaks with the frequency oﬀsets
Ω and −Ω. Such a spectrum does not tell us if the actual Larmor frequency is ω0 = ωradio − Ω or
ω0 = ωradio + Ω. If we use the signal from the second channel only, oscillating as the sine function
if φ = 0 (Figure 3.5), a spectrum with two peaks is obtained again, the only diﬀerence is that the
1
In practice, it is also aﬀected by inhomogeneities of the static magnetic ﬁeld, increasing the apparent value of R2.
This eﬀect is known as inhomogeneous broadening.
68
Signalinchannel1:{y(t)}
t
Signalinchannel2:{y(t)}
t
{Y(ω)}
ω
Ω−Ω
{Y(ω)}
ω
Ω−Ω
Figure 3.4: A signal detected in the ﬁrst (”real”) channel (top) and its Fourier transform (bottom).
peaks have opposite phase (i.e., their phases diﬀer by 180 ◦
). But if we combine both signals, the
false peaks at −Ω disappear because they have opposite signs and cancel each other in the sum of
the spectra.
The discussed transformation of a continuous signal is extremely useful for understanding the
relation between evolution of the magnetization vector and shape of the peaks observed in the
frequency spectra. But in reality, the signal is ﬁnite (tmax < ∞) and discrete (∆t > 0):
t ∈ { 0, ∆t, 2∆t, · · · , (N − 1)∆t }
y(t) ∈ { y0, y1, y2, · · · , yN−1 }
As a consequence, the frequency spectrum is also discrete (∆ω > 0) and ﬁnite (has a deﬁned
spectral width N∆ω):
ω ∈ { 0, ∆ω, 2∆ω, · · · , (N − 1)∆ω }
Y (ω) ∈ { Y0, Y1, Y2, · · · , YN−1 }
The seemingly marginal diﬀerence between ideal and real (ﬁnite and discrete) signal has several
practical consequences, discussed in Sections 3.4 and 3.6.
3.4 Consequence of ﬁnite signal acquisition
In reality, the acquisition of signal stops at a ﬁnite time tmax:
Y (ω) =
tmaxˆ
0
Ae(iΩ−R2)t
e−iωt
dt = A
1 − e−R2tmax
ei(Ω−ω)tmax
R2 − i(Ω − ω)
. (3.5)
3.5. DISCRETE FOURIER TRANSFORMATION 69
Signalinchannel1:{y(t)}
t
Signalinchannel2:{y(t)}
t
{Y(ω)}
ω
Ω
−Ω
{Y(ω)}
ω
Ω−Ω
Figure 3.5: A signal detected in the second (”imaginary”) channel (top) and its Fourier transform (bottom).
It has some undesirable consequences:
Leakage: Part of the signal is lost, peak height Y (Ω) < A/R2.
Truncation artifacts: For R2 → 0,
Y (ω) =
tmaxˆ
0
Ae(i(Ω−ω))t
dt = A
1 − ei(Ω−ω)tmax
−i(Ω − ω)
= A
sin(Ω − ω)tmax
Ω − ω
+ iA
1 − cos(Ω − ω)tmax
Ω − ω
. (3.6)
If the acquisition is stopped before the signal relaxes completely, artifacts (baseline oscillation)
appear. In the limit of no relaxation, the real part of the Fourier-transformed signal does not have a
pure absorption shape (Lorentz curve), but has a shape of the sin(Ω − ω)tmax/(Ω − ω)tmax function
(sinc function).
The ﬁnite nature of the acquired signal has also a subtle eﬀect known as loss of causality. The phenomenon
is discussed in detail Section 3.10.7 and a simple way of avoiding its undesired consequences
is described in Section 3.7.
3.5 Discrete Fourier transformation
As mentioned in Sections 3.2 and 3.3, the digitized acquired signal is ﬁnite (tmax < ∞) and discrete
(∆t > 0):
t ∈ { 0, ∆t, 2∆t, · · · , (N − 1)∆t }
y(t) ∈ { y0, y1, y2, · · · , yN−1 }
70
Signalinchannel1:{y(t)}
t
Signalinchannel2:{y(t)}
t
{Y(ω)}
ω
Ω
{Y(ω)}
ω
Ω
{Y(ω)}
ω
Figure 3.6: Eﬀect of ﬁnite acqusition in the limit R2 → 0.
Therefore, discrete Fourier transformation is typically applied, producing digital spectra with the
spectral width limited to N∆ω
ω ∈ { 0, ∆ω, 2∆ω, · · · , (N − 1)∆ω }
Y (ω) ∈ { Y0, Y1, Y2, · · · , YN−1 }
or, expressing ω as 2πf (in Hertz)
f ∈ { 0, ∆f, 2∆f, · · · , (N − 1)∆f }
Y (ω) ∈ { Y0, Y1, Y2, · · · , YN−1 }
As shown in Section 3.10.8, the values of ∆t, ∆f and N are not independent in the discrete
Fourier transformation, but they are restricted by the relation
∆f∆t = 1/N. (3.7)
The consequences of the requirement ∆f∆t = 1/N are:
• spectral width N∆f = 1/∆t, it is deﬁned by the choice of the time increment
• digital resolution ∆f = 1/N∆t, it is deﬁned by the choice of the maximum acquisition time
Possible deﬁnitions of the discrete Fourier transform with a correct normalization (so that ∆f∆t =
1/N) are
Yk =
N−1
j=0
yje−i 2π
N
kj
yj =
1
N
N−1
k=0
Ykei 2π
N
kj
(3.8)
or
3.6. CONSEQUENCE OF DISCRETE SIGNAL ACQUISITION 71
Yk =
1
√
N
N−1
j=0
yje−i2π
N
kj
yj =
1
√
N
N−1
k=0
Ykei 2π
N
kj
. (3.9)
3.6 Consequence of discrete signal acquisition
As derived in Section 3.10.9, the discrete ”ideal” NMR signal
yj = Ae−R2j∆t
ei2πνj∆t
(3.10)
has a Fourier transform
Yk =
N−1
j=0
Ae−R2j∆t
ei2πνj∆t
e−i 2π
N
kj
∆t = A∆t
1 − e−R2N∆t
eiπ(N−2k)
1 + (1 − R2∆t)e−i2π k
N
. (3.11)
Since the signal is discrete, the spectral width is limited: ∆t > 0 ⇒ N∆f = 1/∆t < ∞. The
consequences of the discrete sampling are, as derived in Section 3.10.9:
Aliasing: A peak of the real frequency ν + N∆f (outside the spectral width) appears at the
apparent frequency ν in the spectrum (Nyquist theorem: frequencies ν and ν + 1/∆t cannot be
distinguished).
Oﬀset: Peak height of the continuous Fourier transform Y (f) = A/R2 and oﬀset of the continuous
Fourier transform Y (±∞) = 0. Peak height of the discrete Fourier transform.
YN
2
= A∆t
1 − e−R2N∆t
R2∆t
→ A/R2 (3.12)
for N∆t → ∞, but oﬀset of the discrete Fourier transform is non-zero. For a suﬃciently long
acquisition time (compared to the relaxation rate, i.e., N∆t 1/R2), the oﬀset is equal to half of
the intensity of the signal at the ﬁrst time point y(0).
3.7 Zero ﬁlling
Routinely, a sequence of NZ zeros is appended to the recorded signal, mimicking data obtained at
time points N∆t to (N + NZ − 1)∆t:
t ∈ { 0, ∆t, 2∆t, · · · , (N − 1)∆t }
y(t) ∈ { y0, y1, y2, · · · , yN−1 }
↓
(3.13)
t ∈ { 0, ∆t, 2∆t, · · · , (N − 1)∆t, N∆t, (N + 1)∆t, · · · , (N + NZ − 1)∆t }
y(t) ∈ { y0, y1, y2, · · · , yN−1, 0, 0, · · · , 0 }
(3.14)
This may look like a completely artiﬁcial procedure, but there are several practical reasons to do
it.
72
Signalinchannel1:{y(t)}
t
Signalinchannel2:{y(t)}
t
{Y(ω)}
ω
Ω1 = Ω2 − 2π/∆t
{Y(ω)}
ω
Ω1 = Ω2 − 2π/∆t
Figure 3.7: Aliasing. If the signal is acquired in discrete time intervals (dots in the top plots), the signals with frequencies
diﬀerent by an integer multiple of 2π/∆t, shown by solid (Ω1) and dotted (Ω2) lines, cannot be distinguished.
Both signals give a peak with the same frequency in the spectrum. This frequency is equal to Ω1 and to Ω2 − 2π/∆t,
where 2π/∆t is the width of the spectrum.
1. The very fast computational algorithm of calculating Fourier transform, known as Cooley–
Tukey FFT, requires the number of time points to be an integer power of 2. If the number of
collected time points N is not a power of 2, NZ zeros are added to the data prior to Fourier
transformation so that N + NZ is an integer power of 2.
2. In order to obtain a spectrum with the full content of information by discrete Fourier transformation,
the collected data must be extended by a factor of 2 by zero-ﬁlling. This operation
reintroduces causality, as it was brieﬂy mentioned in Section 3.4 and is discussed in detail in
Section 3.10.7. The important consequence is that the full information content of N experimental
complex points (i.e., N points of the real part and N points of the imaginary part,
together 2N bits of information) is encoded in the spectrum (i.e., in the real part of the Fourier
transform, which now consists of 2N frequency points because we artiﬁcially increased the
maximum time from (N − 1)∆t to (2N − 1)∆t and therefore narrowed the frequency sampling
step ∆f from 1/N∆t to 1/2N∆t).
3. The digital resolution ∆f, given by 1/(N∆t), can be improved (narrowed) to 1/((N+NZ)∆t) by
zero-ﬁlling. In this manner, the visual appearance of spectra can be improved by interpolation
between data points. Note, however, that adding more than N zeros does not improve the
informational content of the spectrum. Although the digital resolution is improved, the real
resolution is the same, zero-ﬁlling does not help to resolve frequencies that diﬀer less than
1/(N∆t)!
3.8. PHASE CORRECTION 73
Signalinchannel1:{y(t)}
t
Signalinchannel2:{y(t)}
t
{Y(ω)}
ω
{Y(ω)}
ω
Figure 3.8: A signal with the initial phase of 60 ◦
(top) provides distorted spectra (bottom), unless a phase correction
is applied.
3.8 Phase correction
So-far, we ignored the eﬀect of the initial phase φ0 and analyzed Fourier transforms of NMR signals
consisting of a collection of (damped) cosine functions, with zero initial phase. In reality, the signal
has a non-zero phase, diﬃcult to predict
y(t) = Ae−R2(t+t0)
eiΩ(t+t0)
= |A|e−R2(t+t0)
ei(Ω(t+t0)+φ0)
. (3.15)
The phase has a dramatic impact on the result of the Fourier transformation. Real and imaginary
parts are mixtures of absorption and dispersion functions. If we plot the real part as a spectrum, it
looks really ugly for a non-zero phase.
For a single frequency, the phase correction is possible (multiplication by the function e−(iΩt0+φ0)
,
where t0 and φ0 are found empirically):
|A|e−R2(t+t0)
ei(Ω(t+t0)+φ0)
e−i(Ωt0+φ0)
= |A|e−R2(t+t0)
eiΩt
. (3.16)
In practice, phase corrections are applied also to signal with more frequencies, as described in
Section 3.10.10. The signal is multiplied by a function e−i(ϑ0+ϑ1ω)
, where ϑ0 and ϑ1 are zero-order and
ﬁrst-order phase corrections, respectively. We try to ﬁnd ϑ0 and ϑ1 giving the best-looking spectra.
The procedure is a trial-and-error process, but modern computers allow us to vary ϑ0 and ϑ1 and
repeat the Fourier transformation iteratively in a very short time. Note that phase correction is always
necessary, but only approximative corrections are possible for a signal with multiple frequencies!
74
3.9 Apodization
The NMR signal is very often multiplied by a so-called window function prior to Fourier transfor-
mation.2
This process is known as apodization. The goal is to
1. improve sensitivity. Due to the relaxation, signal of data acquired at later time points is lower,
but the noise is the same. Therefore, the late time points decrease the signal-to-noise ratio.
The sensitivity can be improved by discarding or attenuating the late time points.
2. improve resolution. As the resolution is given by 1/(N∆t), resolution is improved if the signal
is multiplied by a window function that ampliﬁes the late data points.
3. suppress truncation artifacts. We have seen that oscillations of the baseline appear if the data
acquisition stops before the signal relaxes to zero (i.e., to the noise level). The desired eﬀect of
relaxation can be mimicked by a window function that smoothly converges to zero at N∆t.
Obviously, the three listed goals are in conﬂict, and only a compromise can been reached. There
is no ”best apodization”. The choice of the optimal window function depends on the actual needs.
The simplest window function is a rectangle: multiplying the signal by a rectangular function
equal to 1 for j∆t ≤ m∆t and to 0 for j∆t > m∆t represents discarding data recorded for times
longer than m∆t. It is a very useful way of improving signal-to-noise ratio if the signal relaxed before
m∆t. Otherwise, it produces severe truncation artifacts.
The highest signal-to-noise ratio is provided by a matched ﬁlter window function. The matched
ﬁlter has the shape of the envelope of the signal. The matched ﬁlter for our ideal signal is e−R2j∆t
.
The price paid for the signal-to-noise improvement is a lower resolution: Multiplying e−R2t
eiΩ∆t
by
e−R2t
obviously doubles the linewidth, given by the decay rate, which is now 2R2. The best balance
between resolution and truncation artifacts for an allowed extra line broadening λ is obtained with
the Dolph–Chebyshev window, deﬁned in Section 3.10.11, which is, however, not used in practice
due to its very complex form. Instead, sine-bell windows sinp 2π−φ
N
j + φ are used routinely, usually
with the phase φ = π/2 (i.e., cosine function) and with the power p = 1 or p = 2.
HOMEWORK
Derive equations describing continuous and discrete Fourier transformation of an ideal NMR signal
(Sections 3.10.5 and 3.10.9, respectively), and describe the consequences of using discrete Fourier
transformation (Section 3.10.9).
2
The mathematical expression describing the Fourier-transformed product of two functions, signal and window in
our case, is given by the convolution theorem, presented in Section 3.10.6.
3.10. SUPPORTING INFORMATION 75
3.10 SUPPORTING INFORMATION
3.10.1 Setting up NMR experiment
• Temperature control and calibration. Temperature aﬀects molecular motions and chemical shifts, it should be controlled carefully to
obtain reproducible spectra and to analyze them quantitatively. The sample temperature is controlled by a ﬂow of pre-heated/cooled
air or nitrogen gas. The exact temperature inside the sample is not so easy to measure. Usually, spectra of compounds with known
temperature dependence of chemical shifts are recorded (e.g. methanol). The temperature is obtained by comparing a diﬀerence
of two well deﬁned chemical shifts (of methyl and hydroxyl protons in the case of methanol) with its values reported for various
temperatures. Purity of the standard samples is a critical issue.
• Field-frequency lock. The external magnetic ﬁeld should be stationary. It is achieved by a feedback system known as ﬁeld-frequency
lock. A deuterated compound (usually heavy water or other deuterated solvent) is added to the sample and the deuterium frequency
is measured continually and kept constant by adjusting electric current in an auxiliary electromagnet. The lock parameters for the
particular deuterium compound used are selected and the deuterium spectrometer is switched on before the measurement.
• Shimming. The external magnetic ﬁeld should be also homogeneous. The inhomogeneities caused e.g. by the presence of the
sample are compensated by adjusting electric current in a set of correction coils called shims. This is usually at least partially
automated.
• Tuning. Each radio-frequency circuit in the probe consists of a receiver coil and two adjustable capacitors. The capacitors should be
adjusted for each sample. The tuning capacitor of the capacitance CT and the coil of the inductance L make an LC circuit, acting
as a resonator. Adjusting the value of CT deﬁnes the resonant frequency, which should be equal to the precession frequency of the
measured nucleus ω0. If we neglect the second capacitor, the resonant frequency is ω = 1/
√
LCT. The second, matching capacitor
of the capacitance CM is used to adjust the impedance of the resonator. The radio waves do not travel from the transmitter to
the coil through air but through co-axial cables. In order to have minimum of the wave reﬂected back to the transmitter, the
impedance of the resonator (coil circuit) Zc, given by
Zc =
1
1
ZM
+ 1
ZT+ZL+R
=
1
iωCM + 1
1
iωCT
+ iωL + R
,
should match the input impedance Zin. In order to tune the circuit, CT and CM must be adjusted simultaneously to get (i)
Zc = Zin and (ii) ω = ω0.
• Calibration of pulse duration. The magnitude of B1 cannot be set directly. Therefore, the duration of irradiation rotating M by
360 ◦ at the given strength of radio waves is searched for empirically. This duration is equal to 2π/ω1 and can be used to calculate
ω1 or |B1| = ω1/γ. As |B1| is proportional to the square root of power P, durations of pulses of radio waves of other strengths need
not be calibrated, but can be recalculated. Power is measured in the units of Watt, but the relative power is usually expressed on
a logarithmic scale in decibells (dB). One Bell represents a ten-fold attenuation of power
log10
P2
P1
= attenuation/B.
Consequently,
10 log10
P2
P1
= attenuation/dB,
and
20 log10
|B1|2
|B1|1
= 10 log10
|B1|2
2
|B1|2
1
= 10 log10
P2
2
P2
1
= attenuation/dB. (3.17)
3.10.2 Quadrature detection and demodulation
Precession of the magnetization vector in the sample induces an electromotive force (voltage) oscillating with the same frequency ω0 in the
coil of the NMR probe.
The signal generated in the coil and ampliﬁed in the preampliﬁer is split into two channels, labeled a and b here. The signal in each
channel is mixed with a reference wave supplied by the radio-frequency synthesizer. The reference waves have the same frequency −ωradio
in both channels, but their phases are shifted by 90 ◦. Let us assume that the signal oscillates as a cosine function cos(ω0t) and that
76
the reference wave in the ﬁrst channel is a cosine wave cos(−ωradiot) and that the reference wave in the second channel is a sine wave
sin(−ωradiot). Mathematically, splitting the signal and mixing it with the reference wave can be described as
cos(ω0t) →
1
2
cos(ω0t) → 1
2
cos(ω0t) cos(−ωradiot) channel a
1
2
cos(ω0t) → 1
2
cos(ω0t) sin(−ωradiot) channel b
(3.18)
Basic trigonometric identities show that the result of mixing in the ﬁrst channel is a sum of a high-frequency cosine wave cos((ω0 −
ωradio)t) and a low-frequency cosine wave cos((ω0 + ωradio)t) = cos(Ωt), while the result of mixing in the second channel is a diﬀerence of
the corresponding sine waves:
1
2
cos(ω0t) cos(−ωradiot) =
1
4
cos((ω0 − ωradio)t) +
1
4
cos((ω0 + ωradio)t), (3.19)
1
2
cos(ω0t) sin(−ωradiot) =
1
4
sin((ω0 − ωradio)t) −
1
4
sin((ω0 + ωradio)t). (3.20)
The high-frequency waves are ﬁltered out by a low-pass ﬁlter, resulting in signals oscillating with a low frequency ω0 + ωradio = Ω.
The procedure, similar to the demodulation in an ordinary radio receiver, thus produces audio signals in both channels
cos(ω0t) →
1
2
cos(ω0t) → 1
2
cos(ω0t) cos(−ωradiot) → 1
4
cos(Ωt) channel a
1
2
cos(ω0t) → 1
2
cos(ω0t) sin(−ωradiot) → 1
4
sin(Ωt) channel b
(3.21)
The signal also has some amplitude, therefore, we replace the factor of 1/4 by an amplitude A and write
a = A cos(Ωt) b = A sin(Ωt). (3.22)
The described manipulation is called quadrature detection and the unit performing it is called the receiver. The outputs of the receiver
are converted to digital data (series of numbers describing values of the signal at discrete, equally spaced time points). It is convenient to
treat the outputs of the individual channels as a real and imaginary component of a single complex number, but physically they are stored
just as series of two numbers in the computer.
A very useful trick is to play with the order of the stored numbers. The four basic options are
data storing option: a, b conventionally labeled: x
b,−a y
−a,−b −x
−b, a −y
The given storage option is described as the receiver phase in the literature. It is not an accident that the same symbols are used for
the phase of the radio wave transmitted during the pulse and for the receiver phase. Choosing the right storage option (setting the receiver
phase) allows us to remove the eﬀect of changing the pulse phase. For example, a signal recorded immediately after an ideal 90◦ pulse of
phase x (by deﬁnition) oscillates as a = A sin(Ωt), b = −A cos(Ωt) (the magnetization starts to rotate from the −y direction). If we run
the same experiment but with the y phase of the ﬁrst pulse, the signal oscillates as a = A cos(Ωt), b = A sin(Ωt). However, if we use the
option y to store the data, the record is the same as in the previous experiment: b = A sin(Ωt), −a = −A cos(Ωt). We see that the same
signal is obtained if the receiver phase matches the transmitter phase.
3.10.3 Noise accumulation
Here we analyze accumulation of the noise in repeated signal acquisition. The related physics is discussed later in Section 7.10.4. The noise
Unoise(t) is random and so its average3 Unoise(t) = 0. The size of the noise is typically deﬁned by the root-mean-square Unoise(t)2 .
Sum of the noise from n independent experiments is
Unoise,1(t) + Unoise,2(t) + · · · + Unoise,n(t)
2
. (3.23)
All terms like 2Unoise,1(t)Unoise,2(t) are equal to zero because the random motions of electrons in the individual experiments are not
correlated (are independent). Therefore, calculation of the square in Eq. 3.23 simpliﬁes to
Unoise,1(t) + Unoise,2(t) + · · · + Unoise,n(t)
2
= Unoise,1(t)2 + Unoise,2(t)2 + · · · + Unoise,n(t)2 . (3.24)
We can also assume that the root-mean-square is the same in all experiments, and write it as Unoise(t)2 . The sum of the noise
can be then calculated as
n Unoise(t)2 =
√
n Unoise(t)2 . (3.25)
3
To avoid writing the integrals deﬁning averaging, we indicate the time average by the angled brackets.
3.10. SUPPORTING INFORMATION 77
We can now calculate the signal-to-noise ratio as
nUmeasured(t)
√
n Unoise(t)2
=
√
n
Umeasured(t)
Unoise(t)2
. (3.26)
3.10.4 Mathematical description of Fourier transformation
We start with a special case of a signal which can be described by a sum of cosine functions with frequencies that are integer multiples
of some small frequency increment ∆ω. All such cosine functions must have the same value at time t and t + 2π/∆ω: the whole signal is
periodic with the period 2π/∆ω. If we record such a signal using quadrature detection, we obtain
y(t) =
∞
j=−∞
Ajeiωj t
=
∞
j=−∞
Ajeij∆ωt
. (3.27)
The mentioned periodicity allows us to determine Ak by calculating the integrals
2π
∆ωˆ
0
y(t)e−iωkt
dt =
∞
j=−∞
Aj
2π
∆ωˆ
0
ei(j−k)∆ωt
dt =
2π
∆ω
Ak (3.28)
(All integrated functions are periodic and their integrals are therefore equal to zero with the exception of the case when k = j, which
is a constant function).
The same result is obtained for any integration limits which diﬀer by 2π/∆ω, e.g.
+ π
∆ωˆ
− π
∆ω
y(t)e−iωkt
dt =
∞
j=−∞
Aj
+ π
∆ωˆ
− π
∆ω
ei(j−k)∆ωt
dt =
2π
∆ω
Ak (3.29)
We can now continue in two diﬀerent directions. We can describe the signal as it is actually measured, not as a continuous function of
time, but as a discrete series of points sampled in time increments ∆t. Then, the integral in Eq. 3.28 is replaced by summation of a ﬁnite
number of measured signal points:
Yk =
N−1
j=0
yje−ik∆ωj∆t
∆t, (3.30)
where Yk = 2π
∆ω
Ak. As the time and frequency are treated in the same manner, we can also deﬁne the inverse operation
yj =
N−1
k=0
Ykeik∆ωj∆t
∆ω. (3.31)
This way of the signal analysis, discussed in more details in Section 3.5, handles the signal as it is measured in reality. It is also
instructive to follow the other direction and to increase the period 2π/∆ω by decreasing ∆ω. The series of ωk becomes a continuous
variable ω and π/∆ω → ∞ if ∆ω → 0. The sum in Eq. 3.27 is replaced by the integral
y(t) =
1
2π
∞ˆ
−∞
Y (ω)eiωt
dω (3.32)
and the integral in Eq. 3.29 becomes
Y (ω) =
∞ˆ
−∞
y(t)e−iωt
dt. (3.33)
If we apply Eq. 3.33 to a function y(t) and Eq. 3.32 to the obtained result, we should get back the function y(t). Such a double
transformation can be written as
y(t) =
1
2π
∞ˆ
−∞
Y (ω)eiωt
dω =
1
2π
∞ˆ
−∞
eiωt
dω
∞ˆ
−∞
y(t )e−iωt
dt =
∞ˆ
−∞
y(t )dt
1
2π
∞ˆ
−∞
eiω(t−t )
dω. (3.34)
This requires the second integral to be equal to 2π for t = t and to zero for t = t. Therefore, the integral can be used to deﬁne the
delta function
78
δ(t − t ) =
1
2π
∞ˆ
−∞
eiω(t−t )
dω. (3.35)
An alternative deﬁnition including a factor of 1/
√
2π in Aj
y(t) =
∞
j=−∞
Aj
√
2π
eiωj t
=
∞
j=−∞
Aj
√
2π
eij∆ωt
, (3.36)
Y (ω) =
1
√
2π
∞ˆ
−∞
y(t)e−iωt
dt, (3.37)
y(t) =
1
√
2π
∞ˆ
−∞
Y (ω)eiωt
dω. (3.38)
is equally acceptable.
3.10.5 Fourier transformation of an ideal NMR signal
Let us assume that an ideal NMR signal has the form Ae(i(Ω−ω)−R2)t. Its Fourier transformation can be calculated easily as
Y (ω) =
∞ˆ
−∞
y(t)e−iωt
dt =
∞ˆ
0
Ae(i(Ω−ω)−R2)t
dt =
−A
i(Ω − ω) − R2
= A
1
R2 − i(Ω − ω)
R2 + i(Ω − ω)
R2 + i(Ω − ω)
= A
R2 + i(Ω − ω)
R2
2 + (Ω − ω)2
(3.39)
3.10.6 Properties of continuous Fourier transformation
The continuous Fourier transformation has several important properties:
• Parseval’s theorem
∞´
−∞
|y(t)|2dt = 1
2π
∞´
−∞
|Y (ω)|2dω
A conservation law, documents that the signal energy (information content) is preserved by the Fourier transformation.
• Linearity
∞´
−∞
(y(t) + z(t))e−iωtdt = Y (ω) + Z(ω)
It documents that a sum of periodic functions (diﬃcult to be distinguished in the time domain) can be converted to a sum of
resonance peaks (easily distinguishable in the frequency domain if the resonance frequencies diﬀer).
• Convolution
∞´
−∞
(y(t) · z(t))e−iωtdt =
∞´
−∞
Y (ω)Z(ω − ω )dω
It provides mathematical description of apodization (Section 3.9)
• Time shift
∞´
−∞
y(t − t0)e−iωtdt = Y (ω)e−iωt0
It shows that time delays result in frequency-dependent phase shifts in the frequency domain (Section 3.8)
• Frequency modulation
∞´
−∞
y(t)eiω0te−iωtdt = Y (ω − ω0)
It shows that the apparent frequencies can be shifted after acquisition.
• Causality
∞´
−∞
y(t)e−iωtdt =
∞´
0
y(t)e−iωtdt
It says that no signal is present before the radio-wave pulse (this is why we can start integration at t = 0 or t = −∞, y(t) = 0 for
t < 0). This provides an extra piece of information allowing us to reconstruct the imaginary part of the signal from the real one
and vice versa (Figure 3.9 and Section 3.10.7).
3.10. SUPPORTING INFORMATION 79
3.10.7 Causality and reconstruction of imaginary signal
The consequence of causality mentioned at the end of Section 3.10.6 is rather subtle. As mentioned above, the NMR signal is recorded in
two channels, as a real and imaginary part of a complex number. It is because Fourier transformation of a cosine (or sine) function gives
a symmetric (or antisymmetric) spectrum with two frequency peaks and thus does not allow us to distinguish frequencies higher than the
carrier frequency from those lower than the carrier frequency. Once we have the transformed complex signal in the frequency domain, we
can ask whether we need both its parts (real and imaginary). It looks like we do because the inverse Fourier transformation of just the real
(imaginary) part produces a symmetric (antisymmetric) picture in the time domain (the second row in Figure 3.9). But the causality tells
us that this is not a problem because we know that there is no signal left from the zero time – the symmetry does not bother us because we
know that we can reconstruct the time signal simply by discarding the left half of the inverse Fourier image (the third row in Figure 3.9).
The time signal reconstructed from the real part of the frequency spectrum only can be then Fourier transformed to provide the missing
imaginary part of the frequency spectrum. The time signal can be reconstructed from the imaginary part of the frequency spectrum in the
same manner (although this is not done typically). This discussion shows that the real and imaginary parts of the frequency spectrum are
not independent but carry the same information.
It should be emphasized, however that the causality principle does not apply to the NMR signal as it is acquired. In reality, the signal
is ﬁnite, i.e., it is acquisition is stopped at tmax < ∞. Therefore, our generalization to Eq. 3.32 does not correspond to the reality: when
the Fourier transformation of really acquired signal is performed, the upper limit of integration is not inﬁnity but tmax. The transformation
is mathematically equivalent to Eq. 3.28, with tmax = 2π/∆ω = 1/∆f. Therefore, the signal can be classiﬁed as periodic4 with the period
tmax = 2π/∆ω = 1/∆f. The real and imaginary parts of the integral are independent series of cosine and sine functions (Fourier series,
see Eq. 3.27), which contradicts the causality principle (the real and imaginary parts of a causal signal are dependent, as discussed above).
In order to introduce causality, zero ﬁlling has to be applied to the acquired signal, as described in Section 3.7. We take the signal as
a function y(t) deﬁned in the range 0 ≤ t ≤ tmax. We extend y(t) to a function yZF(t) deﬁned in a range −tmax ≤ t ≤ tmax by setting
yZF
(t) =
0 for − tmax ≤ t ≤ 0
y(t) for 0 ≤ t ≤ tmax
(3.40)
The extended function yZF(t) fulﬁlls the requirement of causality (there is no signal before applying the radio-wave pulse at t = 0).
As a consequence, the {Y ZF(t)} and {Y ZF(t)} are not independent, and each of them carries the full information. In practice, yZF(t)
is constructed in a slightly diﬀerent manner (see Section 3.7):
yZF
(t) =
y(t) for 0 ≤ t ≤ tmax
0 for tmax ≤ t ≤ 2tmax
(3.41)
Both variants (zeros before t = and zeros after tmax) are mathematically equivalent because y(t) deﬁned in the range 0 ≤ t ≤ tmax
has the periodic nature. Increasing 2tmax Eq. 3.41 to a higher number does not have any eﬀect on the information content of {Y ZF(t)}
and {Y ZF(t)}.
3.10.8 Spectral width, resolution, and sampling
We may try to deﬁne the discrete Fourier transform as
Yk =
N−1
j=0
yje−ik∆ωj∆t
∆t =
N−1
j=0
yje−i2π∆f∆tkj
∆t, (3.42)
yj =
N−1
k=0
Ykeik∆ωj∆t
∆t =
N−1
k=0
Ykei2π∆f∆tkj
∆f. (3.43)
However, there is a catch here. It turns out that ∆t and ∆f are not independent, but closely related. The transformation can be
written in a matrix form as







Y0
Y1
Y2
..
.
YN−1







=







F0,0 F0,1 F0,2 . . . F0,N−1
F1,0 F1,1 F1,2 . . . F1,N−1
F2,0 F2,1 F2,2 . . . F2,N−1
..
.
...
...
...
...
FN−1,0 FN−1,1 FN−1,2 . . . FN−1,N−1







ˆF







y0
y1
y2
...
yN−1







∆t, (3.44)
where the elements of the matrix ˆF are Fjk = e−i2π∆f∆t·k·j.
Let us now try to transform Yk back to the time domain:
4
Note that the signal is really acquired in a periodic manner in practice. The experiment is repeated several times in order to accumulate
data and improve the signal-to-noise ratio. The repeated experiments start from some steady state, not from equilibrium because we do
not let the relaxation to act for an inﬁnite time.
80
{Y(ω)}
ω
Ω
{Y(ω)}
ω
1
2π
∞´
−∞
{Y(ω)}eiωt
dω
t
1
2π
∞´
−∞
{Y(ω)}eiωt
dω
t
Signalinchannel1:{y(t)}
t
Signalinchannel2:{y(t)}
t
Figure 3.9: Causality of NMR signal. If we take a frequency spectrum, discard its imaginary part (the ﬁrst row), and perform the
inverse Fourier transformation, we do not get the original signal (starting at t = 0), but a set of symmetric (real part) and antisymmetric
(imaginary part) functions predicting non-zero signal before t = 0 (the second row). However, we can apply our knowledge that no signal
was present before t = 0 and multiply the left half of the predicted signal by zero. This recovers the actual signal (the third row). Fourier
transformation of this signal provides both real and imaginary parts of the spectrum, as shown in Figure 3.2.
3.10. SUPPORTING INFORMATION 81







y0
y1
y2
...
yN−1







=









F−1
0,0 F−1
0,1 F−1
0,2 . . . F−1
0,N−1
F−1
1,0 F−1
1,1 F−1
1,2 . . . F−1
1,N−1
F−1
2,0 F−1
2,1 F−1
2,2 . . . F−1
2,N−1
...
...
...
...
...
F−1
N−1,0 F−1
N−1,1 F−1
N−1,2 . . . F−1
N−1,N−1









ˆF −1







Y0
Y1
Y2
...
YN−1







∆f, (3.45)
where the elements of the matrix ˆF−1 are F−1
jk = e+i2π∆f∆t·k·j. Substituting from Eq. 3.44,







y0
y1
y2
..
.
yN−1







=









F−1
0,0 F−1
0,1 F−1
0,2 . . . F−1
0,N−1
F−1
1,0 F−1
1,1 F−1
1,2 . . . F−1
1,N−1
F−1
2,0 F−1
2,1 F−1
2,2 . . . F−1
2,N−1
...
...
...
...
...
F−1
N−1,0 F−1
N−1,1 F−1
N−1,2 . . . F−1
N−1,N−1
















F0,0 F0,1 F0,2 . . . F0,N−1
F1,0 F1,1 F1,2 . . . F1,N−1
F2,0 F2,1 F2,2 . . . F2,N−1
...
...
...
...
...
FN−1,0 FN−1,1 FN−1,2 . . . FN−1,N−1














y0
y1
y2
...
yN−1







∆f∆t. (3.46)
In order to get the original signal, the product of the transformation matrices, ˆF−1 ˆF multiplied by ∆f∆t, must be a unit matrix:









F−1
0,0 F−1
0,1 F−1
0,2 . . . F−1
0,N−1
F−1
1,0 F−1
1,1 F−1
1,2 . . . F−1
1,N−1
F−1
2,0 F−1
2,1 F−1
2,2 . . . F−1
2,N−1
...
...
...
...
...
F−1
N−1,0 F−1
N−1,1 F−1
N−1,2 . . . F−1
N−1,N−1
















F0,0 F0,1 F0,2 . . . F0,N−1
F1,0 F1,1 F1,2 . . . F1,N−1
F2,0 F2,1 F2,2 . . . F2,N−1
...
...
...
...
...
FN−1,0 FN−1,1 FN−1,2 . . . FN−1,N−1







∆f∆t =







1 0 0 . . . 0
0 1 0 . . . 0
0 0 1 . . . 0
...
...
...
...
...
0 0 0 . . . 1







. (3.47)
According to the matrix multiplication rule, the jl-element of the product ˆF−1 ˆF is given by
N−1
k=0
e−i2π∆f∆t(jk−kl)
∆t. (3.48)
Clearly, the exponential terms in the sums representing the diagonal elements (j = l) are equal to e−i2π∆f∆t(jk−kj)∆t = e0 = 1.
Therefore, the diagonal elements (sums of N terms e0 = 1) are equal to N. Obviously, we need to set N∆f∆t = 1 to get the elements of
the product ˆF−1 ˆF equal to one.
What about the oﬀ-diagonal elements? For N∆f∆t = 1, the elements of ˆF−1 ˆF are equal to
N−1
k=0
e−i 2π
N
(j−l)k
∆t. (3.49)
The complex numbers in the sum can be visualized as points in the Gauss plane (plane of complex numbers) with the phase of
2πk(l − j)/N. Let us assume that N is an integer power of two (N = 2n, a typical choice in discrete Fourier transform). Then all numbers
in the series are symmetrically distributed in the Gauss plane. As a consequence, their sum is equal to zero (they cancel each other). We
can therefore conclude that setting N∆f∆t = 1 ensures that the product ˆF−1 ˆF is a unit matrix.
3.10.9 Discrete ideal signal
The ideal NMR signal converted to the digital form has a Fourier transform
Yk =
N−1
j=0
Ae−R2j∆t
ei2πνj∆t
e−i 2π
N
kj
∆t. (3.50)
The summation formula5
5
The summation formula can be derived easily. Write the sum
z0
+ z1
+ z2
+ · · · + zN−1
=
N−1
j=0
zj
82
N−1
j=0
zj
=
1 − zN
1 − z
(3.51)
helps us to evaluate the sum. For the sake of simplicity, let us assume that the carrier frequency is chosen so that the peak is in the
middle of the spectrum
ν =
1
2
N∆f =
1
2∆t
. (3.52)
Then, z and zN in the summation formula are
z = e−R2∆t
ei2π( 1
2
− k
N ) = e−R2∆t
1−R2∆t
eiπ
−1
e−i2π k
N = −(1 − R2∆t)e−i2π k
N , (3.53)
zN
= e−R2N∆t
eiπ(N−2k)
. (3.54)
Note that we replaced e−R2∆t by the Taylor series in Eq. 3.53, and neglected terms higher than linear in R2∆t because ∆t is usually
much shorter than 1/R2. Therefore,
Yk = A∆t
1 − e−R2N∆teiπ(N−2k)
1 + (1 − R2∆t)e−i2π k
N
. (3.55)
The consequences of the discrete nature of the signal are:
Aliasing: If we add a value of N∆f to the frequency which was originally in the middle of the frequency spectrum ( 1
2
N∆f = 1
2∆t
),
i.e. add N to k = N/2 in Eq. 3.11, the last exponent in the sum in Eq. 3.11 changes from iπj to i3πj, i.e. by one period (2π), and the
transformed signal (the spectrum) does not change. In general, a peak of the real frequency ν + N∆f (outside the spectral width) appears
at the apparent frequency ν in the spectrum (Nyquist theorem: frequencies ν and ν + 1/∆t cannot be distinguished).
Oﬀset: Peak height of the continuous Fourier transform Y (f) = A/R2 and oﬀset of the continuous Fourier transform Y (±∞) = 0.
Peak height of the discrete Fourier transform.
Y N
2
= A∆t
1 − e−R2N∆t
R2∆t
→ A/R2 (3.56)
for N∆t → ∞, but oﬀset of the discrete Fourier transform
Y0 = A∆t
1 − e−R2N∆teiNπ
2 − R2∆t
→
1
2
A∆t =
1
2
y0∆t (3.57)
for N∆t → ∞ and ∆t → 0. The oﬀset of discrete Fourier transform is non-zero, equal to half of the intensity of the signal at the ﬁrst
time point y(0) if the signal was acquired suﬃciently long to relax completely (N∆t 1/R2).
3.10.10 Zero- and ﬁrst-order phase corrections
Let us assume that the acquired signal was created by a set of rotating magnetization vectors that diﬀer in Ω and that started to rotate
at some unspeciﬁed time −t0 (i.e., before the acquisition started at t = 0) with the same unspeciﬁed phase φ0. For the sake of simplicity,
we assume that all magnetization vectors relax exponentially with the same rate constant R2. Each such magnetization vector produces a
signal
yn(t) = |An|e−R2(t+t0)
ei(Ωn(t+t0)+φ0)
. (3.58)
We multiply the whole signal, i.e., each yn(t), by the correction function e−i(ϑ0+ϑ1ω)
yn(t)e−i(ϑ0+ϑ1ω)
= |An|e−R2(t+t0)
ei(Ωn(t+t0)+φ0)
e−i(ϑ0+ϑ1ω)
= |An|e−R2(t+t0)
ei(Ωn(t+t0)+φ0−ϑ0−ϑ1ω)
. (3.59)
The Fourier transform of such modiﬁed signal is
and multiply it by (1 − z):
(1 − z)(z0
+ z1
+ z2
+ · · · + zN−1
) = z0
− z1
+ z1
− z2
+ z2
+ · · · − zN−1
+ zN−1
− zN
= 1 − zN
= (1 − z)
N−1
j=0
zj
.
Divide the last equation on the previous line by (1 − z) to obtain the summation formula.
3.10. SUPPORTING INFORMATION 83
|An|
∞ˆ
−∞
e−R2(t+t0)
ei(Ωn(t+t0)+φ0−ϑ0−ϑ1ω)
e−iωt
dt = |An|
∞ˆ
−∞
e−R2(t+t0)
ei(Ωn(t+t0)+φ0−ϑ0−ϑ1ω−ωt)
dt. (3.60)
This expression does not change if we multiply it by 1 = ei(ωt0−ωt0).
|An|
∞ˆ
−∞
e−R2(t+t0)
ei(Ωn(t+t0)+φ0−ϑ0−ϑ1ω−ωt)
dt = |An|
∞ˆ
−∞
e−R2(t+t0)
ei(Ωn(t+t0)+φ0−ϑ0−ϑ1ω+ωt0−ω(t+t0))
dt. (3.61)
Note that changing the variable from t to t + t0 does not change the Fourier transform (the integral) because dt = d(t + t0) and the
integration limits are −∞ and ∞ in both cases
|An|
∞ˆ
−∞
e−R2(t+t0)
ei(Ωn(t+t0)+φ0−ϑ0−ϑ1ω+ωt0−ω(t+t0))
d(t+t0) = |An|ei(φ0−ϑ0+ω(t0−ϑ1))
=1
if φ0=ϑ0 and t0=ϑ1
∞ˆ
−∞
e−R2(t+t0)
ei(Ωn(t+t0)
e−ω(t+t0))
d(t + t0)
Yn(ω)
.
(3.62)
As we can see, the eﬀects of the initial phase and time shift are removed by multiplying the signal with e−i(ϑ0+ϑ1ω) if we succeed to
ﬁnd the ﬁrst-order phase correction ϑ0 = φ0 and the second-order phase correction ϑ1 = t0.
3.10.11 Dolph–Chebyshev window
The Dolph–Chebyshev window function is deﬁned as
1
√
N
N−1
k=0
cos 2(N − 1) arccos
cos(πk/N)
cos(πλ∆t/2)
cosh 2(N − 1)arccosh 1
cos(πλ∆t/2)
ei 2π
N
kj
. (3.63)
84
Part II
Quantum description
85
Lecture 4
Review of quantum mechanics
Literature: This chapter starts with a brief review of quantum mechanics. Textbooks covering
this topic represent the best source of information. Brown presents in B9 a useful review of classical
mechanics, usually missing in the quantum mechanics textbooks (assuming that students learnt the
classical mechanics earlier, which is true in the case of students of physics, but not so often in the case
of chemistry or biology students), and reviews quantum mechanics in B13, B15, and B16. B1–B5
provides overview of the relevant mathematical tools. NMR books also provide some introduction.
Keeler reviews quantum mechanics in very understandable fashion, using the concept of spin from
the very beginning (K3.2 and K6). Levitt proceeds more like us (L6–7). A condensed summary is
presented in C2.1 (short, rigorous, but not a good start for a novice).
4.1 Wave function and state of the system
This course should not provide explanation of principles of quantum mechanics, it should build on
an already acquired knowledge. Nevertheless, we brieﬂy review basics of quantum mechanics in this
lecture because in the following lectures, we use the quantum mechanical approach to describe NMR.
Quantum mechanics was introduced because Newton mechanics did not described experiments
correctly. Yet, knowledge of classical mechanics is very helpful in discussions of various ideas and
approaches of quantum mechanics. Those who did not have chance to study Hamiltonian and
Lagrangian mechanics in other courses may ﬁnd a short summary of issues related to our topic
(NMR) in Sections 4.9.1–4.9.3.
Quantum mechanics is postulated, not derived. It can be only tested experimentally. The basic
diﬀerences between Newton and quantum mechanics are listed below.
• Newton mechanics: coordinates x, y, z and moments p of all particles describe all properties of
the current state and all future states
• Quantum mechanics: wave function Ψ describes all properties of the current state and all future
states
We postulate that the state of the system is completely described by a wave function.
The two-slit (Young) experiment may serve as an example of motivation to use quantum mechanics
to describe experimental results. The experiment (presumably known to the reader) asks
87
88
the question whether the studied microscopic objects (e.g. electrons) are particles or waves. The
answer is ”Particles, but with probabilities combined like waves”.1
The wave function used to describe
the studied object can be interpreted as a (complex) probability amplitude Ψ = CeiΦ
. The
(real) probability density is then ρ = Ψ∗
Ψ = |Ψ|2
= |C|2
and the probability of ﬁnding single particle
in volume L3
is
L´
0
L´
0
L´
0
Ψ∗
Ψdxdydz. We see that calculating a probability includes a calculation of
square of the complex probability amplitude. Deﬁnitions of square values of diﬀerent mathematical
objects and the notation used in quantum mechanics are listed in Section 4.9.4. In particular, the
quantum-mechanical notation includes a convention to write
|Ψ ≡ Ψ, Ψ| ≡ Ψ∗
, Ψ|Ψ ≡
Lˆ
0
Lˆ
0
Lˆ
0
Ψ∗
Ψdxdydz. (4.1)
Wave function of a free particle moving in direction x (coordinate frame can be always chosen so
that x is the direction of motion of a free particle) can be written as
Ψ = Cei2π( x
λ
− t
T
)
= Ce
i
(px−Et)
, (4.2)
where h = 2π is the Planck’s constant, p = mv is momentum (along x), and E is (kinetic)
energy. Note that Ψ corresponds to a monochromatic wave with period equal to h/E, wavelength
equal to h/p, and a complex amplitude C (it may contain a phase factor eiφ
).
4.2 Superposition and localization in space
Note that a monochromatic wave function describes exactly what is p of the particle (Figure 4.1A,B),
but does not say anything about position of the particle because ρ = Ψ∗
Ψ = |C| is the same for any
x (distribution of probability is constant from x = −∞ to x = ∞, (Figure 4.1C). Wave function
describing a particle (more) localized in space can be obtained by superposition of monochromatic
waves (Figure 4.2).
Ψ(x, t) = c1 Ae
i
(p1x−E1t)
ψ1
+c2 Ae
i
(p2x−E2t)
ψ2
+ · · · (4.3)
We postulate that if possible states of our system are described by wave functions ψ1, ψ2, . . . ,
their linear combination also describes a possible state of the system.
Note that monochromatic waves are orthogonal and can be normalized (Section 4.9.5).
4.3 Operators and possible results of measurement
We postulated that the wave function contains a complete information about the system, but how
can we extract this information from the wave function?
1
Quantum ﬁeld theory provides more elegant description of fundamental ”particles” than presented in this text.
However, the relations presented in this text can be recovered from the quantum ﬁeld approach.
4.3. OPERATORS AND POSSIBLE RESULTS OF MEASUREMENT 89
A /p
{Ψ}
B /p
{Ψ}
C
Ψ∗
Ψ
x
Figure 4.1: Free particle described by a monochromatic wave function Ψ. The real and imaginary parts of the wave
function are plotted in Panels A and B, respectively, the probability density ρ = Ψ∗
Ψ is plotted in Panel C. Note
that the wavelength and consequently the value of the momentum p is sharply deﬁned (A,B), but the position of the
particle is completely undeﬁned (C).
90
A
{Ψ}
B
{Ψ}
C
Ψ∗
Ψ
x
Figure 4.2: Free particle described by a superposition of ten monochromatic waves functions of the same amplitude.
The real and imaginary parts of the monochromatic wave functions (thin lines) and of the ﬁnal wave function Ψ
(normalized to ahve the same amplitude as the monochromatic wave functions, thick line) are plotted in Panels A and
B, respectively, the probability density ρ = Ψ∗
Ψ is plotted in Panel C. Note that the position of the particle starts
to be deﬁned by the maximum of ρ = Ψ∗
Ψ, but the wavelength and consequently the value of the momentum p is no
longer well deﬁned (A,B).
4.4. EXPECTED RESULT OF MEASUREMENT 91
We postulate that any measurable property is represented by an Hermitian operator (acting on
the wave function) and that result of a measurement must be one of eigenvalues of the operator.
The term eigenvalue and a related term eigenfunction are explained and an example is given is
Section 4.9.6. An operator ˆA is called Hermitian if
Ψ| ˆAΨ = ˆAΨ|Ψ ≡ Ψ| ˆA|Ψ . (4.4)
In this text, we usually write operators with ”hats”, like ˆA. Writing ˆAΨ means ”take function Ψ
and modify it as described by ˆA”. It is not a multiplication: ˆAΨ = ˆA · Ψ, ˆA is not a number but an
instruction what to do with Ψ!
A recipe to calculate possible results of a measurement is:
1. Identify the operator representing what you measure ( ˆA)
2. Find all eigenfunctions |ψ1 , |ψ2 , . . . of the operator and use them as an orthonormal basis2
for Ψ: |Ψ = c1|ψ1 + c2|ψ2 + . . .
3. Calculate individual eigenvalues Aj as
ψj| ˆAψj = ψj|Aj · ψj = Aj ψj|ψj
=1
= Aj. (4.5)
The ﬁrst equation in 4.5 follows from the deﬁnition of eigenfunctions. Aj is just a number and
can be factored out of the brackets (representing integration or summation) as described by the
second equation. The number Aj is the eigenvalue of ˆA for |ψj . The last equation in 4.5 reﬂects
orthonormality of |ψj . If ˆA is Hermitian, the same result must be obtained by calculating
ˆAψj|ψj = A∗
j · ψj|ψj = A∗
j ψj|ψj
=1
= A∗
j , (4.6)
i.e. Aj must be equal to its complex conjugate A∗
j . This is true only for real numbers. As a
result of measurement is always a real number, the eigenvalues must be real numbers. This is why
operators representing a measurable quantity mus be Hermitian.
4.4 Expected result of measurement
Eq. 4.5 tells us what are the possible results of a measurement, but it does not say which value is
actually measured. We can only calculate probabilities of getting individual eigenvalues and predict
the expected result of the measurement.
We postulate that the expected result of measuring a quantity A represented by an operator ˆA
in a state of the system described by a wave function Ψ is
A = Ψ| ˆA|Ψ . (4.7)
2
The term ”orthonormal basis” is described in Section 4.9.5.
92
There are three ways how to do the calculation described by Eq. 4.7:
1. Express Ψ, calculate its complex conjugate Ψ∗
≡ Ψ|, calculate ˆAΨ ≡ | ˆAΨ , and in the manner
of Eq. 4.79
A = Ψ| ˆA|Ψ ≡ Ψ|( ˆAΨ) =
∞ˆ
−∞
Ψ∗
(x, . . . ) ˆAΨ(x, . . . )dx . . . . (4.8)
Three dots in Eq. 4.8 tell us that for anything else than a single free particle (with zero spin)
we integrate over all degrees of freedom, not just over x.
2. Find eigenfunctions ψ1, ψ2, . . . of ˆA and write Ψ as their linear combination Ψ = c1ψ1+c2ψ2+· · ·
(use the eigenfunctions as an orthonormal basis for Ψ). Due to the orthonormality of the basis
functions, the result of Eq. 4.8 is A = c∗
1c1A1 +c∗
2c2A2 +· · · , where A1, A2, . . . are eigenvalues
of ˆA. We see that A is a weighted average of eigenvalues Aj with the weights equal to the
squares of the coeﬃcients (c∗
j cj = |cj|2
). The same result is obtained if we calculate
A = c∗
1 c∗
2 · · ·



A1 0 · · ·
0 A2 · · ·
...
...
...






c1
c2
...


 . (4.9)
We see that we can replace (i) operators by two-dimensional diagonal matrices, with eigenvalues
forming the diagonal, and (ii) wave functions by one-dimensional matrices (known as state
vectors) composed of the coeﬃcients cj. Eq. 4.9 shows calculation of the expected results of
the measurement of A using matrix representation of operators and wave functions. Matrix
representation is a big simpliﬁcation because it allows us to calculate A without knowing how
the operator ˆA and its eigenfunctions look like! We just need the eigenvalues and coeﬃcients
cj. This simpliﬁcation is possible because the right coeﬃcients are deﬁned by the right choice
of the basis.
3. Write Ψ as a linear combination of basis functions ψ1, ψ2, . . . (not necessarily eigenfunctions of
ˆA)
Ψ = c1ψ1 + c2ψ2 + · · · (4.10)
Build a two-dimensional matrix ˆP from the products of coeﬃcients c ∗
j ck:
ˆP =



c1c ∗
1 c1c ∗
2 · · ·
c2c ∗
1 c2c ∗
2 · · ·
...
...
...


 . (4.11)
4.5. OPERATORS OF POSITION AND MOMENTUM, COMMUTATORS 93
Multiply the matrix ˆP by a matrix3 ˆA representing the operator ˆA in the basis ψ1, ψ1, . . . .
The sum of the diagonal elements (called trace) of the resulting matrix ˆP ˆA is equal4
to the
expected value A
A = Tr{ ˆP ˆA }. (4.12)
Why should we use such a bizarre way of calculating the expected value of A when it can be
calculated easily from Eq. 4.9? The answer is that Eq. 4.12 is more general. We can use the
same basis for operators with diﬀerent sets of eigenfunctions.
For the sake of completeness, we should point out that quantum mechanics must also somehow
describe result of a measurement that has been already done and that gave us one value of A. We
need an operator that identiﬁes the wave function describing the state corresponding to the measured
value Am. Such operator is called projection operator ˆPm and its form is derived in Section 4.9.9.
We postulate that if Am is the result of measuring A in the state described by |Ψ , then the
state immediately after the measurement is described by ˆPm|Ψ / Ψ| ˆPm|Ψ , where ˆPm is the
projection operator associated with Am.
4.5 Operators of position and momentum, commutators
We need to ﬁnd operators in order to describe measurable quantities. Let us start with the most
fundamental quantities, position of a particle x and momentum p = mv, that completely describe
studied systems in Newton mechanics. Their operators are deﬁned in terms of their mutual relation.
Therefore, we ﬁrst discuss a general relation of two operators. If we apply two operators subsequently
to the same wave function, order of the operators sometimes does not matter
3
How can we get a matrix representation of an operator with eigenfunctions diﬀerent from the basis? The complete
set of N functions deﬁnes an abstract N-dimensional space (N = ∞ for free particles!). The wave function Ψ is
represented by a vector in this space built from coeﬃcients c1, c2, . . . , as described by Eq. 4.10, and a change of the
basis is described as a rotation in this space. The same rotation describes how the matrix representing the operator
ˆA changes upon changing the basis. Note that the matrix is not diagonal if the basis functions are not eigenfunctions
of ˆA.
4
The trace of the product ˆP ˆA is equal to
c ∗
1 c ∗
2 · · ·



A11 A12 · · ·
A21 A22 · · ·
...
...
...






c1
c2
...


 ,
but it must be also equal to
c ∗
1 c ∗
2 · · ·



A ∗
11 A ∗
12 · · ·
A ∗
21 A ∗
22 · · ·
...
...
...






c1
c2
...



because ˆA is Hermitian. This shows that the diagonal elements of matrices representing Hermitian operators must be
real numbers A ∗
jj = Ajj and the oﬀ-diagonal elements must be such complex numbers that A ∗
jk = Akj.
94
ˆA ˆBf = ˆB ˆAf ⇒ ˆA ˆBf − ˆB ˆAf = 0. (4.13)
However, sometimes the order of operators makes a diﬀerence
ˆA ˆBf = ˆB ˆAf ⇒ ˆA ˆBf − ˆB ˆAf = 0. (4.14)
The diﬀerence of the operators can be viewed as a new operator ( ˆA ˆB − ˆB ˆA) known as the
commutator and written as
ˆA ˆBf − ˆB ˆAf = ( ˆA ˆB − ˆB ˆA)f = [ ˆA, ˆB]f. (4.15)
A non-zero commutator tells us that the quantities represented by ˆA and ˆB are not independent
and cannot be measured exactly at the same time.
We postulate that operators of position and momentum obey the relations
[ˆrj, ˆpk] = i δjk [ˆrj, ˆrk] = [ˆpj, ˆpk] = 0. (4.16)
Note that we only postulate relations between operators. Various choices of expressing the operators
are possible and correct as long as Eq. 4.16 holds. A frequently used choice is described
below.
The wave function Ψ(x, t) deﬁned by Eq. 4.3 is a function of the position of the particle, not of
the momentum (it is a sum of contributions of all possible momenta). If we deﬁne basis as a set of
functions ψj = Ψ(xj, t) for all possible positions xj, the operator of position is simply multiplication
by the value of the coordinate describing the given position (see Section 4.9.7). Operators of the
positions in the y and z directions are deﬁned in the same manner.
ˆx ≡ x · ˆy ≡ y · ˆz ≡ z · (4.17)
In Section 4.9.6, an operator of momentum of a particle moving in the x direction is obtained by
calculating ∂Ψ/∂x (Eq. 4.87). If a particle moves in a general direction, operators of components of
the momentum tensor are derived in the same manner.
ˆpx ≡ −i
∂
∂x
, (4.18)
ˆpy ≡ −i
∂
∂y
, (4.19)
ˆpz ≡ −i
∂
∂z
. (4.20)
It is shown in Section 4.9.8 that such a choice is compatible with the postulate described by
Eq. 4.16. Note that the commutator relations described in Section 4.9.8 follow from the way how we
deﬁned Ψ in Eq. 4.3. However, we can also use Eq. 4.16 as the fundamental deﬁnition and Eq. 4.3
as its consequence. This is how we postulate the deﬁnition of the position and momentum operators
here.
4.6. OPERATOR OF ENERGY AND EQUATION OF MOTION 95
4.6 Operator of energy and equation of motion
The arguments presented in Section 4.9.10 show that the eigenvalues of the total (kinetic and potential)
energy of a free particle can be obtained by calculating ∂Ψ/∂t. If the particles experience forces
that depend only on the coordinates (and can be calculated as gradients of the potential energy),
the sum of kinetic and potential energy is equal to the Hamiltonian H in the classical mechanics
(Section 4.9.1). The same term is used for the corresponding quantum mechanical operator, labeled
ˆH.
The association of Hamiltonian (energy operator) with the time derivative makes it essential for
analysis of dynamics of systems in quantum mechanics:
We postulate that evolution of a system in time is given by the Hamiltonian:
i
∂Ψ
∂t
= ˆHΨ. (4.21)
Note that our ﬁrst postulate (the wave function completely describes the system, including its
future) requires that the wave equation contains only the ﬁrst time derivative (not e.g. the second
time derivative). The explanation is provided in Section 4.9.11.
Eq. 4.21 can be also written for matrix representation of Ψ and ˆH. If eigenfunctions of ˆH are
used as a basis (Ψ = c1(t)ψ1 +c2(t)ψ2 +· · · ), the time-independent eigenfunctions ψj can be factored
out from ∂Ψ/∂t (left-hand side) and Ψ (right-hand side), and canceled, giving
i
d
dt



c1
c2
...


 =



E1 0 · · ·
0 E2 · · ·
...
...
...






c1
c2
...


 , (4.22)
which is simply a set of independent diﬀerential equations
dcj
dt
= −i
Ej
cj ⇒ cj = aje−i
Ej
t
, (4.23)
where the (possibly complex) integration constant aj is given by the value of cj at t = 0.
Note that the coeﬃcients cj evolve, but the products c∗
j cj = |aj|2
do not change in time. Each
product c∗
j cj describes the probability that the system is in the state with the energy equal to the
eigenvalue Ej, described by an eigenfunction ψj.
• States corresponding to the eigenfunctions of the Hamiltonian are stationary (do not vary in
time).
• Only stationary states can be described by the energy level diagram.
Since our goal is quantum description of NMR, it is useful to see how is the evolution of a wave
function inﬂuenced by the magnetic ﬁelds. Therefore, we list the equations of motions for wave
functions describing a free particle, a particle in an electric ﬁeld, and a particle in an electric and
magnetic ﬁeld. All three variants are known as the Schr¨odinger equation.
96
• Free particle. As shown in Section 4.9.10, a wave function describing a free particle evolves as
i
∂Ψ
∂t
= −
2
2m
∂2
∂x2
+
∂2
∂x2
+
∂2
∂x2
ˆH
Ψ. (4.24)
• Charged particle in an electric ﬁeld. Electric forces depend only on the position of the charge
in the electrical ﬁeld. Therefore, the electric potential energy can be described as QV (x, y, z),
where Q is the electric charge and V (x, y, z) is an electrostatic potential. As follows from the
classical mechanics (Section 4.9.1), and is also shown in Section 4.9.10, the eﬀect of en electric
ﬁeld is accounted for simply by adding the electric potential energy Epot(x, y, z) = QV (x, y, z)
to the Hamiltonian
i
∂Ψ
∂t
= −
2
2m
∂2
∂x2
+
∂2
∂y2
+
∂2
∂z2
+ QV (x, y, z)
ˆH
Ψ. (4.25)
• Charged particle in an electromagnetic ﬁeld. The real challenge is to describe the eﬀect of
the magnetic ﬁeld on the evolution in time. The problem is that the magnetic force does not
depend solely on the position in the ﬁeld, but also on the velocity of the charge (Eq. 4.53).
This case is analyzed in detail in Section 4.9.2, showing that the eﬀect of the magnetic ﬁeld can
be described by the vector potential, a vector quantity that can be used to deﬁne the magnetic
induction B = × A = ∂Az
∂y
− ∂Ay
∂z
, ∂Ax
∂z
− ∂Az
∂x
, ∂Ay
∂x
− ∂Ax
∂y
. As shown in Section 4.9.2, the
vector potential modiﬁes the momentum p → p − QA and the resulting wave equation is
i
∂Ψ
∂t
=
1
2m
i
∂
∂x
+ QAx
2
+ i
∂
∂y
+ QAy
2
+ i
∂
∂z
+ QAz
2
+ QV (x, y, z)
ˆH
Ψ.
(4.26)
4.7 Operator of angular momentum
In order to understand NMR experiments, we also need to describe rotation in space. The fundamental
quantity related to the rotation is the angular momentum. In a search for its operator, we start
from what we know, position and momentum operators. We use classical physics and just replace
the values of coordinates and momentum components by their operators.
Classical deﬁnition of the vector of angular momentum L is
L = r × p. (4.27)
4.7. OPERATOR OF ANGULAR MOMENTUM 97
The vector product represents the following set of equations:
Lx = rypz − rzpy, (4.28)
Ly = rzpx − rxpz, (4.29)
Lz = rxpy − rypx. (4.30)
Going to the operators
ˆLx = ˆry ˆpz − ˆrz ˆpy = −i y
∂
∂z
+ i z
∂
∂y
, (4.31)
ˆLy = ˆrz ˆpx − ˆrx ˆpz = −i z
∂
∂x
+ i x
∂
∂z
, (4.32)
ˆLz = ˆrx ˆpy − ˆry ˆpx = −i x
∂
∂y
+ i y
∂
∂x
, (4.33)
ˆL2
= ˆL2
x + ˆL2
y + ˆL2
z. (4.34)
As shown in Section 4.9.12
[ˆLx, ˆLy] = i ˆLz, (4.35)
[ˆLy, ˆLz] = i ˆLx, (4.36)
[ˆLz, ˆLx] = i ˆLy, (4.37)
but
[ˆL2
, ˆLx] = [ˆL2
, ˆLy] = [ˆL2
, ˆLz] = 0. (4.38)
Note that
• Two components of angular momentum cannot be measured exactly at the same time.
• Eqs. 4.35–4.38 can be used as a deﬁnition of angular momentum operators if the position and
momentum operators are not available.5
The relationship between the angular momentum and rotation is discussed in Sections 4.9.13 and
4.9.14. Eigenvalues and eigenfunctions of the commuting operators ˆL2
and ˆLz are derived in Sections
4.9.15 and 4.9.16, respectively.
5
Eqs. 4.35–4.38 are sometimes written in a condensed form as [ˆLj, ˆLk] = i jkl
ˆLl and [ˆL2
, ˆLj] = 0, where j, k, l ∈
{x, y, z} and jkl = 1 for jkl = xyz or any even permutation of x, y, z in xyz (even number of exchanges of subscripts
x, y, z in xyz, e.g. yzx is obtained by two exchanges: ﬁrst x ↔ y and subsequently x ↔ z), jkl = −1 for any odd
permutation of x, y, z in xyz, jkl = 0 for two or three identical subscripts (e.g. xyy).
98
4.8 Operator of orbital magnetic moment
Knowing the operator of the angular momentum, we can easily deﬁne the operators of the orbital
magnetic moment.
A moving charged particle can be viewed as an electric current. Classical deﬁnition of the magnetic
moment of a charged particle travelling in a circular path (orbit) is (Section 0.6.2)
µ =
Q
2
(r × v) =
Q
2m
(r × p) =
Q
2m
L = γL, (4.39)
where Q is the charge of the particle, m is the mass of the particle, v is the velocity of the particle,
and γ is known as the magnetogyric ratio (constant).6
Therefore, we can write the operators
ˆµx = γ ˆLx ˆµy = γ ˆLy ˆµz = γ ˆLz ˆµ2
= γ2 ˆL2
. (4.40)
Finally, we can deﬁne the operator of energy (Hamiltonian) of a magnetic moment in a magnetic
ﬁeld. Classically, the energy of a magnetic moment µ in a magnetic ﬁeld of induction B is E = −µ·B.
Accordingly, the Hamiltonian of the interactions of an orbital magnetic moment with a magnetic ﬁeld
is
ˆH = −Bx ˆµx − By ˆµy − Bz ˆµz = −γ Bx
ˆLx + By
ˆLy + Bz
ˆLz = −
Q
2m
Bx
ˆLx + By
ˆLy + Bz
ˆLz . (4.41)
In contrast to the operators of orbital angular momentum and magnetic moment, derivation of
intrinsic angular momentum, known as the spin, and of the associated magnetic moment, requires
a more fundamental (and much more demanding) approach. We discuss such approach in the next
Lecture.
HOMEWORK
As a preparation for the next lecture, derive the Dirac equation (Section 5.7.3), and check if you
understand why the ˆγ matrices in Dirac equation (Eq. 5.2) can have the required properties, whereas
numbers cannot (Section 5.7.4).
6
The term gyromagnetic ratio is also used.
4.9. SUPPORTING INFORMATION 99
4.9 SUPPORTING INFORMATION
4.9.1 Classical mechanics: Newton, Lagrange, Hamilton
Newton’s laws describe mechanics using forces. In the presence of a force F, motion of a particle of a mass m is described by the second
Newton’s law
F = ma =
dp
dt
. (4.42)
As an alternative, the Newton mechanics can be reformulated in terms of energies. The total kinetic energy of a body consisting of
N particles is
Ekin =
1
2
m
N
k=1
vk · vk (4.43)
and depends only on the velocities of the particles vk, not on their positions rk. The total kinetic energy can be related to the
accelerations as follows
∂Ekin
∂vkl
=
1
2
m(2vkl) = mvkl = pkl, (4.44)
makl =
d
dt
(mvkl) =
dpkl
dt
=
d
dt
∂Ekin
∂vkl
, (4.45)
where k is the particle number and l is the direction (x, y, or z). In the presence of forces that depend only on the coordinates (x, y,
or z) and can be calculated as gradients of potential energy, the formulation of the second Newton’s law is straightforward
dpkl
dt
=
d
dt
∂Ekin
∂vkl
=
∂Epot
∂rkl
= Fkl. (4.46)
Since our Ekin depends only on velocities and not on position in space, and Epot depends only on position in space and not on velocities,
Ekin and Epot can be combined into one variable called Lagrangian L:
0 =
dpkl
dt
− Fkl =
d
dt
∂Ekin
∂vkl
−
∂Epot
∂rkl
=
d
dt
∂(Ekin − Epot)
∂vkl
−
∂(Ekin − Epot)
∂rkl
≡
d
dt
∂L
∂vkl
−
∂L
∂rkl
. (4.47)
A set of Eq. 4.47 for all values of k and l (3N combinations) describes well a set of N free particles, which has 3N degrees of freedom.
If the mutual positions of particles are constrained by C constrains (e.g. atoms in a molecule), the number of degrees of freedom is lower
(3N − C) and the number of equations can be reduced. It is therefore desirable to replace the 3N values of rkl by 3N − C values of
generalized coordinates qj. Each value of rkl is then a combination of qj values, and
drkl =
3N−C
j=1
∂rkl
∂qj
dqj, (4.48)
and (if the constraints do not depend on time)
vkl =
drkl
dt
=
3N−C
j=1
∂rkl
∂qj
dqj
dt
≡
3N−C
j=1
∂rkl
∂qj
˙qj, (4.49)
where the dot represents time derivative. The equation of motion can be thus rewritten as
d
dt
∂L
∂ ˙qj
=
∂L
∂qj
. (4.50)
We obtained Eq. 4.50 starting from the second Newton’s law. However, mechanics can be also built in the opposite direction, starting
from the following statement. Equation of motion describing a physical process that starts at time t1 and ends at time t2 must be such
that the integral
´ t2
t1
Ldt is stationary, in other words, that the variation of the integral is zero. This statement is known as the least action
principle and, using calculus of variation (as nicely described in The Feynman Lectures on Physics, Vol. 2, Chapter 19), Eq. 4.50 can be
derived from it.7 There is, however, no general rule how to express the Lagrangian as an explicit function of generalized coordinates and
velocities. Finding the Lagrangian may be a demanding task, requiring experience and physical intuition.
7
Richard Feynman showed that quantum mechanics can be reformulated by using
e
i
´ t2
t1
L/ dt
as a probability amplitude (path integral approach).
100
Lagrangian can be converted to yet another energy-related function, known as Hamiltonian. Lagrangian and Hamiltonian are related
by the Legendre transformation (see Section 4.9.3).
H(qj, pj) + L(qj, ˙qj) =
j
(pj · ˙qj), (4.51)
where
pj =
∂L
∂ ˙qj
. (4.52)
For our set of N unconstrained particles exposed to forces that do not depend on the particle velocities, qj = rkl and pj = ∂L
∂ ˙qj
is the
linear momentum of the k-th particle in the direction l (cf. Eq. 4.44) and the Hamiltonian is simply the sum of total kinetic and potential
energy (H = Ekin + Epot). In general, pj is called the canonical momentum.
The introduction of Lagrangian and Hamiltonian approaches may seem to be an unnecessarily complication of the description of
classical mechanics. However, Hamiltonians and Lagrangians become essential when we search for quantum mechanical description of
particles observed in magnetic resonance experiments because Hamiltonian describes evolution of quantum states in time.8
4.9.2 Lagrangian and Hamiltonian including magnetism
Derivation of the Hamiltonian (classical or quantum) for magnetic particles in magnetic ﬁelds is much more demanding because the magnetic
force depends on the velocity of moving charged particles. Therefore, velocity enters the Lagrangian not only through the kinetic energy
and the canonical momentum is no longer identical with the linear momentum. We start our analysis by searching for a classical Lagrangian
describing motion of a charged particle in a magnetic ﬁeld, and then convert it to the Hamiltonian using Legendre transformation.
We know that the Lagrangian should give us the Lorentz force
F = Q(E + v × B). (4.53)
We know that a velocity-independent force is a gradient of the corresponding potential energy. For the electric force,
F = Eel = Q V, (4.54)
where the electric potential energy Eel and the electric potential V are scalar quantities. Intuitively, we expect the magnetic force to
be also a gradient of some scalar quantity (some sort of magnetic potential energy or magnetic potential). The magnetic force is given by
Qv × B, so the magnetic energy should be proportional to the velocity. But the velocity is a vector quantity, not a scalar. We may guess
that the scalar quantity resembling the electric potential may be a scalar product of velocity with another vector. This tells us that the
search for the electromagnetic Lagrangian is a search for a vector that, when included in the Lagrangian, correctly reproduces the Lorentz
force, expressed in terms of E and B in Eq. 4.53. The information about E and B can be extracted from the following Maxwell equations
· B = 0 (4.55)
× E = −
∂B
∂t
, (4.56)
but we have to employ our knowledge of vector algebra to handle the divergence in Eq. 4.55 and the curl in Eq. 4.56.
First, note that we look for a scalar product, but Eq. 4.53 contains a vector product. The useful identity a × (b × c) = b(a · c) − (a · b)c
tells us that it would be nice to replace B with a curl of another vector because it would give us, after inserting in Eq. 4.53, the desired
gradient of scalar product:
v × ( × A) = (v · A) − (v · )A). (4.57)
The vector A is a so-called vector potential.
Another identity says that a · (a × b) = 0 for any vectors a and b because a × b ⊥ a. As a consequence, we can really replace B by a
curl (rotation) of some vector A because · ( × A) = 0 as required by Eq. 4.55. The ﬁrst step thus gives us a new deﬁnition of B
B = × A (4.58)
which can be inserted into Eq. 4.53
F = Q(E + v × B) = Q(E + v × ( × A)), (4.59)
and using the aforementioned identity a × (b × c) = b(a · c) − (a · b)c,
8
The Hamiltonian can be also used to describe time evolution in classical mechanics.
4.9. SUPPORTING INFORMATION 101
F = Q(E + v × B) = Q(E + v × ( × A)) = Q(E + (v · A) − (v · )A). (4.60)
Second, we use our new deﬁnition of B and rewrite Eq. 4.56 as
0 =
∂B
∂t
+ × E = ×
∂A
∂t
+ × E = ×
∂A
∂t
+ E . (4.61)
Third, we notice that for any vector a and constant c, a × (ca) = 0 because a ca. As a consequence, we can replace (∂A/∂t + E)
by a gradient of some scalar V because × ( (∂A/∂t + E)) = × (− V ) = 0 as required by Eq. 4.56. The scalar V is the well-known
electric potential and allows us to express E as w combine
E = −
∂A
∂t
− V. (4.62)
which can be also inserted into Eq. 4.53
F = Q(E + v × B) = Q −
∂A
∂t
− V + (v · A) − (v · )A . (4.63)
Finally, we notice that
dA
dt
=
∂A
∂t
+
∂A
∂x
dx
dt
+
∂A
∂y
dy
dt
+
∂A
∂z
dz
dt
=
∂A
∂t
+ v · A ⇒
∂A
∂t
=
dA
dt
− v · A, (4.64)
which shows that v · A in Eq. 4.63 can be can be included into dA/dt
F = Q(E + v × B) = Q −
∂A
∂t
− V + (v · A) − (v · )A = Q −
dA
dt
− V + (v · A) . (4.65)
Let us now try to write L as
L = Ekin − Eel + Emagn =
1
2
mv2
− QV + Emagn, (4.66)
where Eel is a typical potential energy dependent on position but not on speed, and Emagn can depend on both position and speed.
For this Lagrangian,
∂L
∂x
=
∂Eel
∂x
+
∂Emagn
∂x
= −Q
∂V
∂x
+
∂Emagn
∂x
(4.67)
d
dt
∂L
∂vx
=
d
dt
∂Ekin
∂vx
+
∂Emagn
∂vx
= max +
d
dt
∂Emagn
∂vx
. (4.68)
If we use Emagn = Qv · A, Eqs. 4.67 and 4.68 with Eq. 4.50 for q = x give us
max = −Q
dAx
dt
−
∂V
∂x
+
∂(v · A)
∂x
(4.69)
and a sum with similar y- and z-components is equal to the Lorentz force
ma = F = Q −
dA
dt
− V + (v · A) = Q(E + v × B). (4.70)
We have found that our (classical and non-relativistic) Lagrangian has the form
L =
1
2
mv2
− QV + Q(v · A). (4.71)
According to Eq. 4.52, the canonical momentum has the following components
px =
∂L
∂vx
= mvx + QAx py =
∂L
∂vy
= mvy + QAy pz =
∂L
∂vz
= mvy + QAz. (4.72)
The Hamiltonian can be obtained as usually as the Legendre transform
102
H =
j=x,y,z
pjvj − L = p · v − L = p · v −
1
2
mv2
+ QV − Q(v · A). (4.73)
In order to express H as a function of p, we express v as (p − QA)/m:
H =
p · (p − QA)
m
−
(p − QA)2
2m
+ QV −
Q(p − QA) · A
m
=
2p · (p − QA) − (p − QA)2 − 2Q(p − QA) · A
2m
+ QV
=
2p2 − 2Qp · A − p2 + 2Qp · A − Q2A2 − 2Qp · A + 2Q2A2
2m
+ QV =
p2 − 2Qp · A + Q2A2
2m
+ QV =
(p − QA)2
2m
+ QV. (4.74)
We use Eq. 4.74 in Section 5.7.7 as a starting point of quantum mechanical description of the spin magnetic moment.
4.9.3 Legendre transformation
The Legendre transformation has a simple graphical representation (Figure 4.3). If we plot (Figure 4.3A) a function of a variable x, e.g.
f(x), slope at a certain value of x = ξ is equal to s(ξ) = (∂f/∂x)ξ. A tangent line y(ξ) touching the plotted f for x = ξ is described by
the slope s(ξ) and intercept g(ξ) as y = g + s(ξ)x. The value of the intercept for all possible values of ξ can be expressed as a function of
the slope g(s) = y(ξ) − s(ξ)ξ = f(ξ) − s(ξ)ξ (y and f are equal at x = ξ because they touch each other). If we identify x with ˙q, f with
L, and −g with H (Figure 4.3B), we get Eq. 4.51 for a one-dimensional case (j = 1). The inverse Legendre transformation is deﬁned in a
similar manner for the function g(s) and its slope t at s = σ (Figure 4.3C), or for −g = H and f = L (Figure 4.3D).
4.9.4 Calculating square
Recall how ”square” is calculated for various mathematical objects: for a real number c2 = cc, for a complex number |c|2 = cc∗, for vector
v composed of N real numbers v1, v2, . . . , which can be written in a matrix form as a row or column of the numbers v1, v2, . . . ,
|v|2
= v · v = v1v1 + v2v2 + · · · =
N
j=1
vjvj = v1 v2 · · ·



v1
v2
...


 , (4.75)
for a vector v composed of N complex numbers c1 = a1 + ib1, c2 = a2 + ib2, . . .
|v|2
= v†
· v = c∗
1c1 + c∗
2c2 + · · · =
N
j=1
c∗
j cj =
N
j=1
(aj − ibj)(aj + ibj) = c∗
1 c∗
2 · · ·



c1
c2
...


 = a1 − ib1 a2 − ib2 · · ·



a1 + ib1
a2 + ib2
.
..


 ,
(4.76)
for a (continuous and possibly complex) function
∞ˆ
−∞
f∗
(x)f(x)dx (4.77)
(function can be viewed as a vector of inﬁnite number of inﬁnitely ”dense” elements, summation is therefore replaced by integration).
Paul Dirac introduced the following notation: |v , |f is a vector v or function f, respectively, and
v|v = v†
· v =
N
j=1
v∗
j vj, (4.78)
f|f =
∞ˆ
−∞
f∗
(x)f(x)dx. (4.79)
4.9. SUPPORTING INFORMATION 103
xξ
g
f
y = g(ξ) + s(ξ)x
A
s
σ
f
g
t = f(σ) − x(σ)s
C
˙q
−Epot
−H
L
y = −H + p ˙q
B
p
Epot
−L
H
t = −L + p ˙q
D
Figure 4.3: Legendre transformation of a general function f(x) (A) and of one-dimensional Lagrangian L (B), and inverse Legendre
transformation of a general function g(s) of one-dimensional Lagrangian L and Hamiltonian H (D). The transformation is presented for a
Lagrangian L and a Hamiltonian H describing forces independent of the velocity.
104
4.9.5 Orthogonality and normalization of monochromatic waves
Note that monochromatic waves are orthogonal, i.e., a scalar product of two waves diﬀering in p is equal to zero:
ψ1|ψ2 =
∞ˆ
−∞
ψ∗
1 ψ2dx =
∞ˆ
−∞
A∗
e− i (p1x−E1t)
Ae
i (p2x−E2t)
dx = |A|2
e
i (E1−E2)t
∞ˆ
−∞
e
i (p1−p2)x
dx =
|A|2
e
i (E1−E2)t
∞ˆ
−∞
cos
(p1 − p2)x
dx + i|A|2
e
i (E1−E2)t
∞ˆ
−∞
sin
(p1 − p2)x
dx = 0 (4.80)
unless p1 = p2 (positive and negative parts of sine and cosine functions cancel each other during integration, with the exception of
cos 0 = 1).
Values of A can be also normalized to give the result of Eq. 4.80 equal to 1 if p1 = p2 and E1 = E2. The requirement ψ1|ψ2 = 0 for
p1 = p2, E1 = E2 and ψ1|ψ2 = 1 for p1 = p2, E1 = E2 can be written using the delta function (see Section 3.10.4):
|A|2
∞ˆ
−∞
e
i (p1−p2)x
dx = δ(p1 − p2), (4.81)
taken into account the fact that e
i (E1−E2)t
= 1 for E1 = E2. Repeating the analysis presented in Section 3.10.4 (replacing ω by p/ ,
and t by x) shows that
|A|2
∞ˆ
−∞
e
i (p1−p2)x
dx =
1
√
2π
∞ˆ
−∞
e
i (p1−p2)x
dx = h− 1
2
∞ˆ
−∞
e
i (p1−p2)x
dx = δ(p1 − p2) (4.82)
(cf. Eqs. 3.35 and 3.37). The procedure can be extended to the three-dimensional case, where all three coordinates of the momentum
vectors p1 and p2 must be equal to get non-zero ψ1|ψ2 . This can be written as
ψ1|ψ2 = h− 3
2
∞ˆ
−∞
∞ˆ
−∞
∞ˆ
−∞
e
i (p1−p2)·r
dr = h− 3
2
∞ˆ
−∞
e
i (p1,x−p2,x)x
dx
∞ˆ
−∞
e
i (p1,y−p2,y)y
dy
∞ˆ
−∞
e
i (p1,z−p2,z)z
dz
= δ(p1 − p2) = δ(p1,x − p2,x) · δ(p1,y − p2,y) · δ(p1,z − p2,z). (4.83)
In the language of algebra, the complete set of normalized monochromatic waves constitutes an orthonormal basis for wave functions,
in a similar way as unit vectors ı, , k are the orthonormal basis for all vectors in the Cartesian coordinate system x, y, z.
Also, Ψ (linear combination of ψ1, ψ2, . . . ) can be normalized based on the condition
∞ˆ
−∞
Ψ∗
Ψdx = P = 1 (4.84)
(if a particle exists, it must be somewhere). It requires
∞ˆ
−∞
(c∗
1c1 + c∗
2c2 + · · · )dx = 1. (4.85)
4.9.6 Eigenfunctions and eigenvalues, operator of momentum
In order to understand what quantum mechanics says about measurable properties of the studied system, let us ask a question: How can
we get the value of a momentum of a free particle described by Eq. 4.3? What operation should be applied to Ψ(x) (a function of x) in
order to get the value of the momentum? Calculation of ∂Ψ/∂x gives us a clue:
∂Ψ
∂x
= c1
∂
∂x
e
i (p1x−E1t)
+ c2
∂
∂x
e
i (p2x−E2t)
+ · · · =
i
p1c1e
i (p1x−E1t)
+
i
p2c2e
i (p2x−E2t)
+ · · · (4.86)
It implies that
− i
∂
∂x
e
i (p1x−E1t)
= p1e
i (p1x−E1t)
, −i
∂
∂x
e
i (p2x−E2t)
= p2e
i (p2x−E2t)
, . . . (4.87)
We see that
4.9. SUPPORTING INFORMATION 105
1. Calculation of the partial derivative of any monochromatic wave and multiplying the result by −i gives us the same wave just
multiplied by a constant. The instruction to calculate the partial derivative and multiply the result by −i is an example of an
operator. If application of the operator to a function gives the same function, only multiplied by a constant, the function is called
eigenfunction of the operator and the constant is called eigenvalue of the operator.
2. The eigenvalues are well-deﬁned, measurable physical quantities – possible values of the momentum along x.
3. The eigenvalues can be obtained by applying the operator to the eigenfunctions and multiplying the results by the complex
conjugates of the eigenfunctions, e.g.
p1 = e− i (p1x−E1t)
−i
∂
∂x
e
i (p1x−E1t)
= e− i (p1x−E1t)
p1e
i (p1x−E1t)
= p1 e− i (p1x−E1t)
e
i (p1x−E1t)
=1
. (4.88)
4.9.7 Operator of position
The question we ask now is: What operation should I apply to Ψ (a function of x) in order to get the value of its coordinate? When
−i ∂/∂x is used as an operator of momentum (in the x direction), applied to Ψ(x), multiplication by the coordinate x is an operator of
the position of the particle (in the x direction). To see how the operator acts, let us write Ψ(x, t) as a series of the values Ψ(xj, t) for all
possible positions xj.9 Then, the product xΨ(x, t) can be written as
x · Ψ(x, t) =







x1c1e
i (p1x1−E1t)
+ x1c2e
i (p2x1−E2t)
+ x1c3e
i (p3x1−E3t)
+ · · ·
x2c1e
i (p1x2−E1t)
+ x2c2e
i (p2x2−E2t)
+ x2c3e
i (p3x2−E3t)
+ · · ·
x3c1e
i (p1x3−E1t)
+ x3c2e
i (p2x3−E2t)
+ x3c3e
i (p3x3−E3t)
+ · · ·
...







=





x1 · Ψ(x1)
x2 · Ψ(x2)
x3 · Ψ(x3)
...





. (4.89)
If the position of the particle is e.g. x2,
Ψ(x2, t) =






0
c1e
i (p1x2−E1t)
+ c2e
i (p2x2−E2t)
+ c3e
i (p3x2−E3t)
+ · · ·
0
...






=





0
Ψ(x2)
0
...





(4.90)
and x · Ψ(x, t) for x = x2 is
x2 · Ψ(x2, t) =






0
x2 c1e
i (p1x2−E1t)
+ c2e
i (p2x2−E2t)
+ c3e
i (p3x2−E3t)
+ · · ·
0
...






=





0
x2 · Ψ(x2)
0
..
.





. (4.91)
We see that multiplication of Ψ(x2, t) by x2 results in x2Ψ(x2), i.e., Ψ(x2) is an eigenfunction of the operator ˆx = x· and x2 is the
corresponding eigenvalue.
Note that multiplication by pj does not work in the same way! We could multiply Ψ(x2) by x2 because Ψ(x2) does not depend on any
other value of the x coordinate. However, Ψ(x2) depends on all possible values of p. On the other hand, the partial derivative ∂Ψ/∂x in
Eq 4.86 gave us each monochromatic wave multiplied by its value of p and ensured that the monochromatic waves acted as eigenfunctions.
4.9.8 Commutation relations of the position and momentum operators
It is easy to check that subsequently applied operators related to diﬀerent coordinates commute. For example
ˆxˆyΨ = xyΨ = yxΨ = ˆyˆxΨ, (4.92)
ˆpx ˆpyΨ = − 2 ∂2Ψ
∂x∂y
= − 2 ∂2Ψ
∂y∂x
= ˆpy ˆpxΨ, (4.93)
or
9
We write the continuous function Ψ(x) as a vector formally containing distinct elements Ψ(x1), Ψ(x2), . . . . In a similar fashion, we
write x as a vector containing a series of all values of the coordinate x: x1, x2, . . . ..
106
ˆxˆpyΨ = −i x
∂Ψ
∂y
= −i
∂(xΨ)
∂y
= ˆpy ˆxΨ. (4.94)
However,
ˆxˆpxΨ = −i x
∂Ψ
∂x
(4.95)
but
ˆpx ˆxΨ = −i
∂(xΨ)
∂x
= −i Ψ − i x
∂Ψ
∂x
. (4.96)
We see that
• commutators of operators of a coordinate and the momentum component in the same direction are equal to i (i.e., multiplication
of Ψ by the factor i ),
• all other position and coordinate operators commute,
in agreement with Eq. 4.16.
4.9.9 Projection operator
Let us assume that the state of the studied system immediately before measuring a quantity A was described by the wave function |Ψ ,
expressed in the basis of orthogonal eigenfunctions of ˆA as
|Ψ = c1|ψ1 + c2|ψ2 + · · · = |c1|eiφ1 |ψ1 + |c2|eiφ2 |ψ2 + · · · (4.97)
The measurement provided a value Am = cmc∗
m = |cm|2, one of eigenvalues of ˆA. Let us also assume that all eigenvalues are diﬀerent.
Application of the operator |ψm ψm|/|cm|2 to |Ψ gives
1
|cm|
|ψm ψm|Ψ =
1
|cm|
|ψm ψm|(c1|ψ1 + · · · + cm|ψm + · · · ) =
1
|cm|
cm|ψm =
1
|cm|
|cm|eiφm |ψm = eiφm |ψm , (4.98)
where eiφm |ψm describes the state immediately after the measurement.
If n diﬀerent eigenfunctions |ψm,1 , |ψm,2 , . . . |ψm,n have the same eigenvalue |cm|, Eq. 4.98 is modiﬁed to
=
n
j=1
|ψm,j ψm,j|
√
n|cm|

c1|ψ1 + · · · +
n
j=1
cm,j|ψm,j + · · ·

 =
1
√
n|cm|
|cm|
n
j=1
eiφm,j |ψm,j =
n
j=1
eiφm,j
√
n
|ψm,j , (4.99)
where the ﬁnal sum (linear combination of eigenfunctions |ψm,j ) describes the state immediately after the measurement.
The operator ˆPm =
n
j=1
|ψm,j ψm,j| is known as the projection operator, and the normalization constant can be deﬁned using the
relation
Ψ| ˆPm|Ψ =

c∗
1 ψ1| + · · · +
n
j=1
c∗
m,j ψm,j| + · · ·


n
j=1
|ψm,j ψm,j|

c1|ψ1 + · · · +
n
j=1
cm,j|ψm,j + · · ·

 = n|cm|2
. (4.100)
4.9. SUPPORTING INFORMATION 107
4.9.10 Schr¨odinger equation
We obtained the operator of momentum by calculating ∂Ψ/∂x. What happens if we calculate ∂Ψ/∂t?
∂Ψ
∂t
= c1
∂
∂t
e
i (p1x−E1t)
+ c2
∂
∂t
e
i (p2x−E2t)
+ · · · = −
i
E1c1e
i (p1x−E1t)
−
i
E2c2e
i (p2x−E2t)
− · · · (4.101)
and consequently
i
∂
∂t
e
i (p1x−E1t)
= E1e
i (p1x−E1t)
, i
∂
∂t
e
i (p2x−E2t)
= E2e
i (p2x−E2t)
, . . . (4.102)
1. First, we obtain the operator of energy from Eq. 4.102, in analogy to Eq. 4.87.
2. The second achievement is Eq. 4.101 itself. Energy of free particles is just the kinetic energy (by deﬁnition, ”free” particles do not
experience any forces). Therefore, all energies Ej in the right-hand side of Eq. 4.101 can be written as
Ej =
mv2
j
2
=
p2
j
2m
, (4.103)
resulting in
∂Ψ
∂t
= −
i p2
1
2m
c1e
i (p1x−E1t)
+
p2
2
2m
c2e
i (p2x−E2t)
+ · · · . (4.104)
But an equation with the p2
j terms can be also obtained by calculating
1
2m
∂2Ψ
∂x2
=
1
2m
∂
∂x
∂Ψ
∂x
= −
1
2
p2
1
2m
c1e
i (p1x−E1t)
+
p2
2
2m
c2e
i (p2x−E2t)
+ · · · . (4.105)
Comparison of Eqs. 4.104 and 4.105 gives us the equation of motion
i
∂Ψ
∂t
= −
2
2m
∂2Ψ
∂x2
. (4.106)
If we extend our analysis to particles experiencing a time-independent potential energy Epot(x, y, z), the energy will be given by
Ej =
p2
j
2m
+ Epot (4.107)
where pj is now the absolute value of a momentum vector pj (we have to consider all three direction x, y, z because particles change
direction of motion in the presence of a potential). The time derivative of Ψ is now
∂Ψ
∂t
= −
i p2
1
2m
c1e
i (p1r−E1t)
+
p2
2
2m
c2e
i (p2r−E2t)
+ · · · −
i
Epot(r)Ψ (4.108)
and
p2
1
2m
c1e
i (p1r−E1t)
+
p2
2
2m
c2e
i (p2r−E2t)
+ · · · = −
2
2m
∂2Ψ
∂x2
+
∂2Ψ
∂x2
+
∂2Ψ
∂x2
. (4.109)
Substituting Eq. 4.109 into Eq. 4.108 gives us the famous Schr¨odinger equation
i
∂Ψ
∂t
= −
2
2m
∂2
∂x2
+
∂2
∂x2
+
∂2
∂x2
+ Epot(x, y, z)
ˆH
Ψ. (4.110)
In our case, the Hamiltonian is expressed in terms of the linear momentum p = mv. This is suﬃcient to describe action of forces
that depend only on the position in space and can be therefore calculated as the gradients of the potential energy (e.g. electric forces).
However, using the linear momentum does not allow us to describe forces that depend on velocities of the particles (e.g., magnetic forces).
Therefore, the canonical (or generalized) momentum should be used in general. The canonical momentum is deﬁned by the Lagrange
mechanics, reviewed in Section 4.9.1. We return to the description of a particle in a magnetic ﬁeld in Section 5.7.7.
108
4.9.11 Limitation of wave equation to ﬁrst time derivative
Before saying what a wave equation must fulﬁll in order to describe evolution of a quantum state in time, let us review similar requirements
for the equation of motion in Newton mechanics. In the classical Newton mechanics, the state of the system is fully described by the
coordinates x, y, z and momenta mvx, mvy, mvz of the particles. Therefore, the solution of the equation of motion must depend only on
the starting values of the coordinates and momenta, not on any additional parameter. What does it say about the equation of motion
itself? It can contain only ﬁrst and second derivatives in time. Why? Because:
• Solutions of equation containing only ∂x/∂t require the knowledge of x(t = 0) = x(0).
For example, solution of
∂x
∂t
+ kx = 0 (4.111)
is x = x(0)e−kt, i.e., it depends only on x(0).
• Solutions of equation containing only ∂x/∂t and ∂2x/∂t2 require the knowledge of x(0) and ∂x/∂t(t = 0) = vx(0).
For example, let us look at the wave equation
∂2x
∂t2
+ ω2
x = 0. (4.112)
Note that this equation corresponds to the second Newton’s law, with −mω2x being the force (for the sake of simplicity assumed not
to change in time). The solution is well known, but we can derive it easily because we know how to play with operators:
∂2x
∂t2
+ ω2
x =
∂
∂t
∂
∂t
x + ω2
x =
∂
∂t
2
+ ω2
x =
∂
∂t
+ iω
∂
∂t
− iω x = 0. (4.113)
Obviously, there are two solutions of the equation
∂
∂t
− iω x+ = 0 ⇒ x+ = C+eiωt
= C+(cos(ωt)+i sin(ωt))
∂
∂t
+ iω x− = 0 ⇒ x− = C−e−iωt
= C−(cos(ωt)−i sin(ωt)),
(4.114)
but the solution must be also any linear combination of x+ and x− because 0 + 0 = 0:
x = A+x+ + A−x− = (A+C+ + A−C−)
C1
cos(ωt) + i(A+C+ − A−C−)
C2
sin(ωt) = C1 cos(ωt) + C2 sin(ωt). (4.115)
Consequently, the velocity
vx =
∂x
∂t
= C1
∂ cos(ωt)
∂t
+ C2
∂ sin(ωt)
∂t
= −ωC1 sin(ωt) + ωC2 cos(ωt). (4.116)
It is clear that the so-far unknown parameters C1 and C2 can be obtained by calculating x and vx at t = 0
cos(0) = 1, sin(0) = 0 ⇒ x(0) = C1 vx(0) = ωC2 (4.117)
and that the evolution of x and vx depends only on x(0) and vx(0), as required in Newton mechanics:
x(t) = x(0) cos(ωt) +
vx(0)
ω
sin(ωt) vx(t) = vx(0) cos(ωt) − ω · x(0) sin(ωt). (4.118)
• Solutions of equations containing higher than second time derivative of x require knowledge of the initial values of higher than ﬁrst
time derivatives of x.
For example, let us inspect
∂3x
∂t3
+ ω3
x = 0 (4.119)
Following the same strategy as in Eq. 4.113
∂3x
∂t3
+ λ3
x =
∂
∂t
+ λ
∂2
∂t2
−
∂
∂t
λ + λ2
x =
∂
∂t
+ λ
∂2
∂t2
− 2
∂
∂t
λ
2
+
1
4
λ2
+
3
4
λ2
x =
∂
∂t
+ λ
∂
∂t
−
λ
2
2
+
3
4
λ2
x =
4.9. SUPPORTING INFORMATION 109
∂
∂t
+ λ

 ∂
∂t
−
λ
2
2
− i
√
3
2
λ
2

 x =
∂
∂t
+ λ
∂
∂t
−
1 + i
√
3
2
λ
∂
∂t
−
1 − i
√
3
2
λ x = 0, (4.120)
which has three solutions
x0 = C0e−λt
, x+ = C+e
1+i
√
3λ
2
t
, x− = C−e
1−i
√
3λ
2
t
(4.121)
and any of their linear combinations is also a valid solution
x = A0x0 + A+x+ + A−x− = C1e−λt
+ C2e
1+i
√
3λ
2
t
+ C3e
1−i
√
3λ
2
t
(4.122)
where C1 = A0C0, C2 = A+C+, C3 = A−C−. In order to determine C1, C2, and C3, we need three initial conditions, not only x(0)
and vx(0), but also the initial acceleration a(0) = ∂2x/∂t2. However, the acceleration should not represent an additional degree of freedom.
In Newton mechanics, the acceleration should be completely deﬁned by the initial coordinates and velocities, and by forces that are already
incorporated in the constants in the equation. Therefore, the equation containing the third time derivative is not a Newton’s equation of
motion.
After making sure that we understand the Newton mechanics, we can return to the quantum mechanics. We have postulated that the
wave function Ψ contains the complete information about the studied particle (or system in general). In contrast to the Newton mechanics,
we must require that the wave equation describing the evolution of the system must depend only on Ψ at t = 0. Therefore, our wave
function must contain only ﬁrst derivative in time. If it contained e.g. also ∂2Ψ/∂t2, the evolution in time would depend also on ∂Ψ/∂t
at t = 0, which is against our ﬁrst postulate.
Another problem of an equation containing second time derivative is related to our interpretation of the wave function. We interpret
Ψ(x, y, z)∗Ψ(x, y, z) as a distribution of the probability that the particle’s coordinates are x, y, z. How is this related to the wave equation?
The Schr¨odinger’s equation Eq. 4.24 and its complex conjugate are
i
∂Ψ
∂t
= ˆHΨ − i
∂Ψ∗
∂t
= ˆH∗
Ψ∗
. (4.123)
When we multiply the equations by Ψ∗ and Ψ, respectively, subtract them, and divide the result by i , we obtain
Ψ∗ ∂Ψ
∂t
+ Ψ
∂Ψ∗
∂t
=
1
i
(Ψ∗ ˆHΨ − Ψ ˆH∗
Ψ∗
)
∂(Ψ∗Ψ)
∂t
=
1
i
(Ψ∗ ˆHΨ − Ψ ˆH∗
Ψ∗
). (4.124)
If we assume that a free particle does not move (has a zero momentum and therefore zero Hamiltonian), we ﬁnd that
∂(Ψ∗Ψ)
∂t
= 0. (4.125)
The result is expected, if the particle does not move, ρ = Ψ∗Ψ does not change in time. But if we repeat the procedure with the
equations containing the second time derivative (i.e., when the operator i ∂/∂t is applied twice)
− 2 ∂2Ψ
∂t2
= ˆHΨ − 2 ∂2Ψ∗
∂t2
= ˆH∗
Ψ∗
, (4.126)
we get
−Ψ∗ ∂2Ψ
∂t2
+ Ψ
∂2Ψ∗
∂t2
=
1
2
(Ψ∗ ˆHΨ − Ψ ˆH∗
Ψ∗
)
∂
∂t
Ψ
∂Ψ∗
∂t
−
∂
∂t
Ψ∗ ∂Ψ
∂t
=
1
2
(Ψ∗ ˆHΨ − Ψ ˆH∗
Ψ∗
)
∂
∂t
Ψ
∂Ψ∗
∂t
− Ψ∗ ∂Ψ
∂t
=
1
2
(Ψ∗ ˆHΨ − Ψ ˆH∗
Ψ∗
). (4.127)
If we now assume that a free particle does not move (has a zero momentum and therefore zero Hamiltonian), the conserved quantity
is not Ψ∗Ψ, but Ψ ∂Ψ∗
∂t
− Ψ∗ ∂Ψ
∂t
, containing both Ψ and its time derivative. This contradicts our interpretation of the wave function as a
probability amplitude.
110
4.9.12 Commutators of angular momentum operators
The operators of angular momentum components are
ˆLx = ˆry ˆpz − ˆrz ˆpy = −i y
∂
∂z
+ i z
∂
∂y
, (4.128)
ˆLy = ˆrz ˆpx − ˆrx ˆpz = −i z
∂
∂x
+ i x
∂
∂z
, (4.129)
ˆLz = ˆrx ˆpy − ˆry ˆpx = −i x
∂
∂y
+ i y
∂
∂x
, (4.130)
ˆL2
= ˆL2
x + ˆL2
y + ˆL2
z. (4.131)
Therefore,
[ˆLx, ˆLy] = (ˆry ˆpz − ˆrz ˆpy)(ˆrz ˆpx − ˆrx ˆpz) − (ˆrz ˆpx − ˆrx ˆpz)(ˆry ˆpz − ˆrz ˆpy)
= ˆry ˆpz ˆrz ˆpx − ˆrz ˆpy ˆrz ˆpx − ˆry ˆrx ˆpz ˆpz + ˆrz ˆpy ˆrx ˆpz − ˆrz ˆpxˆry ˆpz + ˆrx ˆpz ˆry ˆpz + ˆrz ˆpxˆrz ˆpy − ˆrx ˆpz ˆrz ˆpy (4.132)
The commutation relations postulated in Eq. 4.16 allow us to exchange some of the operators and write ﬁrst the operators that
commute
[ˆLx, ˆLy] = ˆry ˆpx ˆpz ˆrz−ˆrz ˆrz ˆpx ˆpy − ˆry ˆpxˆrz ˆpz + ˆrx ˆpy ˆrz ˆpz − ˆry ˆpxˆrz ˆpz+ˆrxˆry ˆpz ˆpz + ˆrz ˆrz ˆpx ˆpy − ˆrx ˆpy ˆpz ˆrz (4.133)
The red terms cancel each other and using Eq. 4.16
[ˆLx, ˆLy] = (ˆry ˆpx − ˆrx ˆpy)(ˆpz ˆrz − ˆrz ˆpz) = (−ˆLz)(−i ) = i ˆLz. (4.134)
The other commutators can be derived in the same manner.
It is also useful to calculate commutators of the following combinations of operators
ˆLx + iˆLy = ˆL+
ˆLx − iˆLy = ˆL− : (4.135)
[ˆL+, ˆL−] = [ˆLx + iˆLy, ˆLx − iˆLy] = [ˆLx, ˆLx] + [ˆLy, ˆLy] + i[ˆLy, ˆLx] − i[ˆLx, ˆLy] = −2i[ˆLx, ˆLy] = −2i(i ˆLz) = 2 ˆLz (4.136)
[ˆLz, ˆL+] = [ˆLz, ˆLx + iˆLy] = [ˆLz, ˆLx] + i[ˆLz, ˆLy] = i ˆLy + i(−i ˆLx) = ˆLx + i ˆLy = ˆL+ (4.137)
[ˆLz, ˆL−] = [ˆLz, ˆLx − iˆLy] = [ˆLz, ˆLx] − i[ˆLz, ˆLy] = i ˆLy − i(−i ˆLx) = ˆLx − i ˆLy = − ˆL− (4.138)
[ˆL2
, ˆL±] = [ˆL2
, ˆLx ± iˆLy] = [ˆL2
, ˆLx] ± i[ˆL2
, ˆLy] = 0. (4.139)
4.9.13 Angular momentum and rotation
To see the relation between angular momentum and rotation in space, we ﬁrst ﬁnd eigenvalues Lz,k and eigenfunctions ψk of ˆLz. As
described in B15.3 (and in textbooks discussing quantum mechanics), the operator ˆLz written in the spherical coordinates (r, ϑ, ϕ) is
ˆLz = −i
∂
∂ϕ
(4.140)
and we can assume that the part of its eigenfunctions dependent on the coordinate ϕ (azimuth) can be separated: ψk = Q(r, ϑ)Rk(ϕ).
Eigenvalues and eigenfunctions of ˆLz are deﬁned by
ˆLzψk = Lz,kψk, (4.141)
−i
∂(QRk)
∂ϕ
= Lz,k(QRk), (4.142)
−i Q
dRk
dϕ
= Lz,kQRk, (4.143)
−i
d ln Rk
dϕ
= Lz,k, (4.144)
Rk = ei
Lz,k
ϕ
. (4.145)
4.9. SUPPORTING INFORMATION 111
Note that ψk(ϕ) and ψk(ϕ + 2πk) are equal for any integer k:
ei
Lz,k
(ϕ+2π)
= ei
Lz,k
ϕ
· ei2π
Lz,k
= 1
if
Lz,k
is integer
(4.146)
Therefore,
• value of the z-component of the angular momentum must be an integer multiple of .
There is a close relation between the angular momentum operators and description of rotation in quantum mechanics. Rotation of a
point deﬁned by the position vector r about an axis given by the angular frequency vector ω can be described as
dr
dt
= ω × r, (4.147)
or more explicitly
drx
dt
= ωyrz − ωzry, (4.148)
dry
dt
= ωzrx − ωxrz, (4.149)
drz
dt
= ωxry − ωyrx. (4.150)
If a coordinate frame is chosen so that ω = (0, 0, ω)
drx
dt
= −ωry, (4.151)
dry
dt
= ωrx, (4.152)
drz
dt
= 0. (4.153)
We already know (see Section 1.5.8) that such a set of equation can be solved easily: multiply the second equation by i and add it to
the ﬁrst equation or subtract it from the ﬁrst equation.
d(rx + iry)
dt
= ω(−ry + irx) = +iω(rx + iry), (4.154)
d(rx − iry)
dt
= ω(−ry − irx) = −iω(rx − iry), (4.155)
rx + iry = C+e+iωt
, (4.156)
rx − iry = C−e−iωt
, (4.157)
where the integration constants C+ = rx(0) + iry(0) = reiφ0 and C− = rx(0) − iry(0) = re−iφ0 are given by the initial phase φ0 of r
in the coordinate system:
rx + iry = re+i(ωt+φ0)
= r(cos(ωt + φ0) + i(sin(ωt + φ0)), (4.158)
rx − iry = re−i(ωt+φ0)
= r(cos(ωt + φ0) − i(sin(ωt + φ0)). (4.159)
The angle of rotation ϕ is obviously given by ωt.
rx + iry = re+iφ0 e+iϕ
= (rx(0) + iry(0))e+iϕ
, (4.160)
rx − iry = re−iφ0 e−iϕ
= (rx(0) − iry(0))e−iϕ
. (4.161)
Comparison with Eq. 4.145 documents the relation between ˆLz and rotation:
112
• The eigenfunction of ˆLz with the eigenvalue ˆLz = describes counterclockwise rotation of a vector about z.
This conclusion of course applies to any vector a, and can be used to describe rotation in a diﬀerent, and often more useful manner
than in in Section 1.5.3. We express a vector a in a diﬀerent basis, namely we decompose a not into its three Cartesian components, but
to the z component plus the linear combinations ax ± iay used above. With the proper normalization (to keep a∗a in the new basis equal
to a2 = a2
x + a2
y + a2
z), a can be written as a row vector10
a =
ax−iay
√
2
az −
ax+iay
√
2
= a−1 a0 a+1 . (4.162)
Then, rotation of a about z is described by a transformation matrix (applied from right to row vectors) which is diagonal:
a = a−1 a0 a+1 = a−1(0) a0(0) a+1(0)


e−iϕ 0 0
0 1 0
0 0 eiϕ

 = a−1(0) a0(0) a+1(0)


ei·(−1)·ϕ 0 0
0 ei·0·ϕ 0
0 0 ei·1·ϕ

 . (4.163)
4.9.14 Rotation described by Wigner matrices
So far, we have described only rotation about one axis (z). The discussion can be extended to a general rotation is space, relating two
diﬀerent coordinate systems. As described in Section 1.5.3, a Cartesian coordinate system can be transformed to another one by three
successive rotations. Two of them are rotations about the z axis (about the ”new” z axis by the angle −ϕ and about the ”original” z axis
by the angle −χ and ). Such rotations are, according to Eq. 4.163, described by the functions eikϕ and eik χ, where k and k are −1, 0,
and +1. The middle rotation about the y axis is more diﬃcult to describe because the operator ˆLy has a more complicated form in the
spherical coordinates (cf B15.3)
ˆLy = i − cos ϕ
∂
∂ϑ
+ cot θ sin ϕ
∂
∂ϑ
. (4.164)
The middle rotation is deﬁned by a matrix with components traditionally labeled by d1
k,k
(ϑ).
ˆd1
(ϑ) =



1
2
(1 + cos ϑ) 1√
2
sin ϑ 1
2
(1 − cos ϑ)
− 1√
2
sin ϑ cos ϑ 1√
2
sin ϑ
1
2
(1 − cos ϑ) − 1√
2
sin ϑ 1
2
(1 + cos ϑ)


 . (4.165)
The complete matrix, traditionally denoted ˆD1, has the following form
ˆD1
(ϕ, ϑ, χ) =


eiϕ 0 0
0 1 0
0 0 e−iϕ





1
2
(1 + cos ϑ) 1√
2
sin ϑ 1
2
(1 − cos ϑ)
− 1√
2
sin ϑ cos ϑ 1√
2
sin ϑ
1
2
(1 − cos ϑ) − 1√
2
sin ϑ 1
2
(1 + cos ϑ)





eiχ 0 0
0 1 0
0 0 e−iχ


=



1
2
eiϕeiχ(1 + cos ϑ) eiϕ 1√
2
sin ϑ 1
2
eiϕe−iχ(1 − cos ϑ)
− 1√
2
eiχ sin ϑ cos ϑ 1√
2
e−iχ sin ϑ
1
2
e−iϕeiχ(1 − cos ϑ) −e−iϕ 1√
2
sin ϑ 1
2
e−iϕe−iχ(1 + cos ϑ)


 . (4.166)
This matrix describes active rotation of an object in space by the angles (ϕ, ϑ, χ) or passive rotation by the angles (−ϕ, −ϑ, −χ)
describing how a is seen from diﬀerent coordinate frames (cf. Section 1.5.3). Transformation of a coordinate frame can be therefore
described as
a = ak =−1 ak =0 ak =+1 = ak=−1 ak=0 ak=+1



1
2
eiϕeiχ(1 + cos ϑ) eiϕ 1√
2
sin ϑ 1
2
eiϕe−iχ(1 − cos ϑ)
− 1√
2
eiχ sin ϑ cos ϑ 1√
2
e−iχ sin ϑ
1
2
e−iϕeiχ(1 − cos ϑ) −e−iϕ 1√
2
sin ϑ 1
2
e−iϕe−iχ(1 + cos ϑ)



(4.167)
or simply11
10
We present a as a row vector here because it is a convention to use row vectors (and the corresponding representations of tensors)
when describing rotation in the manner discussed in Section 4.9.14.
11
Here we keep notation introduced in Section 1.5.3 to describe a passive rotation corresponding to the transformation from an ”original”
coordinate system to a new, ”primed” frame. The elements of the transformation matrix are typically marked with indices m m (summing
over m ) in the literature.
4.9. SUPPORTING INFORMATION 113
ak =
1
k=−1
D1
k,k (ϕ, ϑ, χ)ak. (4.168)
Rotation of tensors discussed in Section 1.5.3 can be also described in a similar manner as presented above for vectors. The tensors
of our interest can be expressed in the following basis
T0,0 = −
1
√
3
(Txx + Tyy + Tzz) (4.169)
T1,−1 = −
1
2
(Txz − Tzx − i(Tyz − Tzy)) (4.170)
T1,0 = i
1
√
2
(Txy − Tyx) (4.171)
T1,+1 = −
1
2
(Txz − Tzx + i(Tyz − Tzy)) (4.172)
T2,−2 = +
1
2
(Txx − Tyy − i(Txy − Tyx)) (4.173)
T2,−1 = +
1
2
(Txz + Tzx − i(Tyz + Tzy)) (4.174)
T2,0 = −
1
√
6
(Txx + Tyy − 2Tzz) (4.175)
T2,+1 = −
1
2
(Txz + Tzx + i(Tyz + Tzy)) (4.176)
T2,+2 = +
1
2
(Txx − Tyy + i(Txy − Tyx)) (4.177)
The ﬁrst subscript deﬁnes rank of the component and also the transformation matrix. The zero-rank component T0,0 is a scalar
and does not change under rotation. Therefore, its transformation ”matrix” is simply D0 = 1. An example is the atomic orbital 1s.
The ﬁrst-rank components T1,k transform as vectors12 and their rotation is described by our familiar matrix ˆD1. Examples of ﬁrst-rank
tensors are the functions describing angular dependence of atomic orbitals 2p. The matrix describing transformation of the second-rank
components T2,k is
ˆD2
(ϕ, ϑ, χ) =






e2iϕ 0 0 0 0
0 eiϕ 0 0 0
0 0 1 0 0
0 0 0 e−iϕ 0
0 0 0 0 e−2iϕ






ˆd2
(ϑ)






e2iχ 0 0 0 0
0 eiχ 0 0 0
0 0 1 0 0
0 0 0 e−iχ 0
0 0 0 0 e−2iχ






, (4.178)
where
ˆd2
(ϑ) =











1
4
(1 + cos ϑ)2 1
2
sin ϑ(1 + cos ϑ) 3
8
sin2 ϑ 1
2
sin ϑ(1 − cos ϑ) 1
4
(1 − cos ϑ)2
−1
2
sin ϑ(1 + cos ϑ) 1
2
(2 cos ϑ − 1)(1 + cos ϑ) 3
2
sin ϑ cos ϑ 1
2
(2 cos ϑ + 1)(1 − cos ϑ) 1
2
sin ϑ(1 − cos ϑ)
3
8
sin2 ϑ − 3
2
sin ϑ cos ϑ 1
2
(3 cos2 −1) 3
2
sin ϑ cos ϑ 3
8
sin2 ϑ
1
2
sin ϑ(1 − cos ϑ) 1
2
(2 cos ϑ + 1)(1 − cos ϑ) − 3
2
sin ϑ cos ϑ 1
2
(2 cos ϑ − 1)(1 + cos ϑ) 1
2
sin ϑ(1 + cos ϑ)
1
4
(1 − cos ϑ)2 −1
2
sin ϑ(1 − cos ϑ) 3
8
sin2 ϑ −1
2
sin ϑ(1 + cos ϑ) 1
4
(1 + cos ϑ)2











. (4.179)
Examples of second-rank tensors are the functions describing angular dependence of the atomic orbitals 3d.
In summary, components of tensors transform as
Tj,k =
j
k=−j
Dj
k,k
(ϕ, ϑ, χ)Tj,k =
j
k=−j
e−ikϕ
dj
kk
(ϑ)e−ik χ
(4.180)
and the elements of the transformation matrices are listed in Table 4.1. Description of rotation described in this section was introduced
in 1927 by Wigner and the transformation matrices are usually called Wigner matrices.
12
If the Cartesian components of the tensor T are products of Cartesian components of vectors a and b (Txy = axbx etc.), then
T1,k = i√
2
ck, where ck are elements of c = a × b in the basis deﬁned by Eq. 4.162.
114
Table 4.1: Elements of Wigner matrices describing rotation of vectors and tensors.
j k k e−ikϕ dj
k,k
(ϑ) e−ik χ Dj
k,k
(ϕ, ϑ, χ)
0 0 0 1 1 1 D0
0,0 = 1
1 −1 −1 e+iϕ 1
2
(1 + cos ϑ) e+iχ D1
−1,−1 = 1
2
ei(ϕ+χ)(1 + cos ϑ)
1 −1 0 e−iϕ 1√
2
sin ϑ cos ϑ 1 D1
−1,0 = 1√
2
sin ϑ cos ϑ
1 −1 +1 e+iϕ 1
2
(1 − cos ϑ) e−iχ D1
−1,+1 = 1
2
ei(ϕ−χ)eiϕ(1 − cos ϑ)
1 0 −1 1 1√
2
sin ϑ cos ϑ e+iχ D1
0,−1 = − 1√
2
eiχ sin ϑ cos ϑ
1 0 0 1 cos ϑ 1 D1
0,0 = cos ϑ
1 0 +1 1 1√
2
sin ϑ cos ϑ e−iχ D1
0,+1 = 1√
2
e−iχ sin ϑ cos ϑ
1 +1 −1 e−iϕ 1
2
(1 − cos ϑ) e+iχ D1
+1,−1 = 1
2
e−i(ϕ−χ)(1 − cos ϑ)
1 +1 0 e−iϕ 1√
2
sin ϑ cos ϑ 1 D1
+1,0 = − 1√
2
e−iϕ sin ϑ cos ϑ
1 +1 +1 e−iϕ 1
2
(1 + cos ϑ) e−iχ D1
+1,+1 = 1
2
e−i(ϕ+χ)(1 + cos ϑ)
2 −2 −2 e+2iϕ 1
4
(1 + cos ϑ)2 e+2iχ D2
−2,−2 = 1
4
e2i(ϕ+χ)(1 + cos ϑ)2
2 −2 −1 e+2iϕ 1
2
sin ϑ(1 + cos ϑ) e+iχ D2
−2,−1 = 1
2
ei(2ϕ+χ) sin ϑ(1 + cos ϑ)
2 −2 0 e+2iϕ 3
8
sin2 ϑ 1 D2
−2,0 = 3
8
e2iϕ sin2 ϑ
2 −2 +1 e+2iϕ 1
2
sin ϑ(1 − cos ϑ) e−iχ D2
−2,+1 = 1
2
ei(2ϕ−χ) sin ϑ(1 − cos ϑ)
2 −2 +2 e+2iϕ 1
4
(1 − cos ϑ)2 e−2iχ D2
−2,+2 = 1
4
e2i(ϕ−χ)(1 − cos ϑ)2
2 −1 −2 e+iϕ −1
2
sin ϑ(1 + cos ϑ) e+2iχ D2
−1,−2 = −1
2
ei(ϕ+2χ) sin ϑ(1 + cos ϑ)
2 −1 −1 e+iϕ 1
2
(2 cos ϑ − 1)(1 + cos ϑ) e+iχ D2
−1,−1 = 1
2
ei(ϕ+χ)(2 cos ϑ − 1)(1 + cos ϑ)
2 −1 0 e+iϕ 3
2
sin ϑ cos ϑ 1 D2
−1,0 = 3
2
eiϕ sin ϑ cos ϑ
2 −1 +1 e+iϕ 1
2
(2 cos ϑ + 1)(1 − cos ϑ) e−iχ D2
−1,+1 = 1
2
ei(ϕ−χ)(2 cos ϑ + 1)(1 − cos ϑ)
2 −1 +2 e+iϕ 1
2
sin ϑ(1 − cos ϑ) e−2iχ D2
−1,+2 = 1
2
ei(ϕ−2χ) sin ϑ(1 − cos ϑ)
2 0 −2 1 3
8
sin2 ϑ e+2iχ D2
0,−2 = 3
8
e2iχ sin2 ϑ
2 0 −1 1 − 3
2
sin ϑ cos ϑ e+iχ D2
0,−1 = − 3
2
eiχ sin ϑ cos ϑ
2 0 0 1 1
2
(3 cos2 ϑ − 1) 1 D2
0,0 = 1
2
(3 cos2 ϑ − 1)
2 0 +1 1 3
2
sin ϑ cos ϑ e−iχ D2
0,+1 = 3
2
e−iχ sin ϑ cos ϑ
2 0 +2 1 3
8
sin2 ϑ e−2iχ D2
0,+2 = 3
8
e−2iχ sin2 ϑ
2 +1 −2 e−iϕ −1
2
sin ϑ(1 − cos ϑ) e+2iχ D2
+1,−2 = −1
2
e−i(2ϕ−χ) sin ϑ(1 − cos ϑ)
2 +1 −1 e−iϕ 1
2
(2 cos ϑ + 1)(1 − cos ϑ) e+iχ D2
+1,−1 = 1
2
e−i(ϕ−χ)(2 cos ϑ + 1)(1 − cos ϑ)
2 +1 0 e−iϕ − 3
2
sin ϑ cos ϑ 1 D2
+1,0 = − 3
2
e−iϕ sin ϑ cos ϑ
2 +1 +1 e−iϕ 1
2
(2 cos ϑ − 1)(1 + cos ϑ e−iχ D2
+1,+1 = 1
2
e−i(ϕ+χ)(2 cos ϑ − 1)(1 + cos ϑ)
2 +1 +2 e−iϕ 1
2
sin ϑ(1 + cos ϑ) e−2iχ D2
+1,+2 = 1
2
e−i(2ϕ+χ) sin ϑ(1 + cos ϑ)
2 +2 −2 e−2iϕ 1
4
(1 − cos ϑ)2 e+2iχ D2
+2,−2 = 1
4
e−2i(ϕ−χ)(1 − cos ϑ)2
2 +2 −1 e−2iϕ −1
2
sin ϑ(1 − cos ϑ) e+iχ D2
+2,−1 = −1
2
e−i(2ϕ−χ) sin ϑ(1 − cos ϑ)
2 +2 0 e−2iϕ 3
8
sin2 ϑ 1 D2
+2,0 = 3
8
e−2iϕ sin2 ϑ
2 +2 +1 e−2iϕ −1
2
sin ϑ(1 + cos ϑ) e−iχ D2
+2,+1 = −1
2
e−i(2ϕ+χ) sin ϑ(1 + cos ϑ)
2 +2 +2 e−2iϕ 1
4
(1 + cos ϑ)2 e−2iχ D2
+2,+2 = 1
4
e−2i(ϕ+χ)(1 + cos ϑ)2
4.9. SUPPORTING INFORMATION 115
4.9.15 Eigenvalues of angular momentum operators
In the preceding section, we have found that eigenfunctions of ˆLz describe rotation about z and that eigenvalues of ˆLz are integer multiples
of . In this section, we complete the description of the eigenvalues of ˆLz and of ˆL2. Determination of eigenvalues of angular momentum
operators is very important for describing electron conﬁguration of atoms (atomic orbitals) and solving issues related to molecular rotation
(e.g. infrared spectroscopy). The motivation to include its discussion here is to explain notation that is also used in some areas of NMR
spectroscopy (NMR of nuclei with spin number higher than 1/2). A reader who is not interested in such issues (that are not directly
discussed in this course) may skip this section and Section 4.9.16 without loosing information important for the following lectures.
A set of eigenvalues L2
j of the operator ˆL2 is given by a set of equations
ˆL2
ψj = L2
j ψj, (4.181)
where ψj are individual eigenfunctions of ˆL2. The same applies to ˆLz and its eigenfunctions ψk:
ˆLzψk = Lz,kψk. (4.182)
As ˆL2 and ˆLz commute ([ˆL2, ˆLz] = 0), their eigenvalues can be evaluated simultaneously, using the same eigenfunction. The j-th
eigenfunction of ˆL2 can be also the k-th eigenfunction of ˆLz. Here we denote such simultaneous eigenfunctions as ψj,k. To relate the
eigenvalues, we use the Pythagorean theorem
ˆL2
= ˆL2
x + ˆL2
y + ˆL2
z ⇒ (ˆL2
x + ˆL2
y)ψj,k = (ˆL2
− ˆL2
z)ψj,k = ˆL2
ψj,k − ˆLz(ˆLzψj,k) = (L2
j − L2
z,k)ψj,k. (4.183)
Because L2
j − L2
z,k are eigenvalues of a square operator ˆL2
x + ˆL2
y, they cannot be negative. Therefore, L2
z,k cannot exceed L2
j .
In the next step, we take advantage of the operators ˆL+ and ˆL− introduced in Eq. 4.135. As ˆL2 and ˆL± commute,
ˆL2 ˆL+ψj,k = ˆL+
ˆL2
ψj,k = ˆL+(L2
j ψj,k) = L2
j (ˆL+ψj,k) (4.184)
ˆL2 ˆL−ψj,k = ˆL−
ˆL2
ψj,k = ˆL−(L2
j ψj,k) = L2
j (ˆL−ψj,k) (4.185)
According to Eqs. 4.137 and 4.138
[ˆLz, ˆL+] = ˆLz
ˆL+ − ˆL+
ˆLz = + ˆL+ ⇒ ˆLz
ˆL+ = ˆL+
ˆLz + ˆL+ (4.186)
[ˆLz, ˆL−] = ˆLz
ˆL− − ˆL−
ˆLz = − ˆL− ⇒ ˆLz
ˆL− = ˆL−
ˆLz − ˆL− (4.187)
and therefore
ˆLz
ˆL+ψj,k = ˆL+
ˆLzψj,k + ˆL+ψj,k = ˆL+(Lz,kψj,k) + ˆL+ψj,k = (Lz,k + )(ˆL+ψj,k) (4.188)
ˆLz
ˆL−ψj,k = ˆL−
ˆLzψj,k − ˆL−ψj,k = ˆL−(Lz,kψj,k) − ˆL−ψj,k = (Lz,k − )(ˆL−ψj,k). (4.189)
This tells us that the operator ˆL+ converts ψj,k to another eigenfunction of ˆLz, to an eigenfunction associated with the eigenvalue
Lz,k + , i.e. with the eigenvalue associated with ψj,k increased by . But we already know (Eq. 4.146), that is the diﬀerence between
two successive eigenvalues of ˆLz. Therefore, the eigenfunction created by application of ˆL+ to ψj,k can be called C+ψj,k+1:
ˆL+ψj,k = C+ψj,k+1, (4.190)
where C+ is a so-far unknown coeﬃcient that cancels out in Eq. 4.188. If we insert ψj,k+1 into Eq. 4.188, we obtain ψj,k+2 and so
on. But we cannot play this game forever because we know that L2
z,k cannot exceed L2
j . There must be some maximum value kmax which
cannot be increased any further:
ˆL+ψj,kmax = 0 · ψj,kmax . (4.191)
In a very similar way, ˆL− decreases k down to kmin:
ˆL−ψj,kmin
= 0 · ψj,kmin
. (4.192)
We now apply ˆL− to ˆL+ψj,kmax and ˆL+ to ˆL−ψj,kmin
:
ˆL−
ˆL+ψj,kmax = (ˆLx − iˆLy)(ˆLx + iˆLy)ψj,kmax = (ˆL2
x + ˆL2
y + i(ˆLx
ˆLy − ˆLy
ˆLx))ψj,kmax = (ˆL2
− ˆL2
z + i([ˆLx, ˆLy]))ψj,kmax
= (ˆL2
− ˆL2
z + i(i ˆLz))ψj,kmax = (ˆL2
− ˆL2
z − ˆLz)ψj,kmax = (L2
j − L2
z,kmax
− Lz,kmax )ψj,kmax (4.193)
ˆL+
ˆL−ψj,kmin
= (ˆLx + iˆLy)(ˆLx − iˆLy)ψj,kmin
= (ˆL2
x + ˆL2
y − i(ˆLx
ˆLy − ˆLy
ˆLx))ψj,kmin
= (ˆL2
− ˆL2
z − i([ˆLx, ˆLy]))ψj,kmin
= (ˆL2
− ˆL2
z − i(i ˆLz))ψj,kmin
= (ˆL2
− ˆL2
z + ˆLz)ψj,kmin
= (L2
j − L2
z,kmin
+ Lz,kmin
)ψj,kmin
. (4.194)
116
Comparison with Eqs. 4.191 and 4.192 requires
L2
j − L2
z,kmax
− Lz,kmax = 0 (4.195)
L2
j − L2
z,kmin
+ Lz,kmin
= 0. (4.196)
Subtracting the Eq. 4.195 from Eq. 4.196,
L2
z,kmax
− L2
z,kmin
+ Lz,kmax + Lz,kmin
= (Lz,kmax − Lz,kmin
)(Lz,kmax + Lz,kmin
) + (Lz,kmax + Lz,kmin
) =
(Lz,kmax + Lz,kmin
)(Lz,kmax − Lz,kmin
+ ) = 0. (4.197)
Obviously, (Lz,kmax +Lz,kmin
) or (Lz,kmax −Lz,kmin
+ ) must be equal to zero. Because Lz,kmax ≥ Lz,kmin
, the only possible solution
is Lz,kmax + Lz,kmin
= 0. But the diﬀerence Lz,kmax − Lz,kmin
is also restricted. As successive values of Lz,k diﬀer by (Eq. 4.146),
Lz,kmax − Lz,kmin
must be also a multiple of . Both conditions are fulﬁlled for Lz,kmax = +j and Lz,kmin
= −j , where j is integer
or half-integer. Considering what we learned about angular momentum and rotation in Section 4.9.13, the half-integer values seem to be
allowed mathematically but not physically (terms with half-integer values of the rotation angles do not appear in matrices ˆDj describing
rotations of vectors and tensors). However, we ﬁnd a meaningful physical interpretation of j = 1/2 in the following lecture.
Substituting Lz,kmax = +j into Eq. 4.195 deﬁnes
L2
j = j2 2
+ j 2
= j(j + 1) 2
. (4.198)
For such an eigenvalue of ˆL2, the possible eigenvalues of ˆLz are
− j , −(j − 1) , −(j − 2) , . . . , (j − 2) , (j − 1) , j . (4.199)
Our last task is to evaluate C+ in Eq. 4.190 and a similar coeﬃcient C− for ˆL−ψj,k = C−ψj,k−1. We start by evaluating |C+|2,
which requires calculation of the complex conjugate of the (in general complex) coeﬃcient C+. We express the complex conjugate taking
advantage of the fact that operators and eigenfunctions can be represented by vectors and matrices.
(C+ψj,k)∗
= (ˆL+|ψj,k )†
= ψj,k|ˆL†
+ = ψj,k|ˆL−, (4.200)
where |ψj,k and ψj,k| are treated as a column and row vector, respectively, and ˆL+ and ˆL− as mutually transposed square matrices.
Then,
|C+ψj,k|2
= ψj,k|ˆL−
ˆL+|ψj,k = ψj,k|ˆL2
− ˆL2
z − ˆLz|ψj,k = j(j + 1) 2
− m(m + 1) 2
. (4.201)
In a similar manner,
|C−ψj,k|2
= ψj,k|ˆL+
ˆL−|ψj,k = ψj,k|ˆL2
− ˆL2
z + ˆLz|ψj,k = j(j + 1) 2
− m(m − 1) 2
. (4.202)
The absolute values of C+ and C− are
|C+| = j(j + 1) 2 − m(m + 1), (4.203)
|C−| = j(j + 1) 2 − m(m − 1). (4.204)
The phases of C+ and C− are not restricted. We can therefore set them to zero and deﬁne C+ and C− as real numbers
C+ = j(j + 1) 2 − m(m + 1), (4.205)
C− = j(j + 1) 2 − m(m − 1). (4.206)
4.9. SUPPORTING INFORMATION 117
4.9.16 Eigenfunctions of angular momentum operators
After evaluation of eigenvalues of ˆL2 and ˆLz, we can proceed to the determination of the eigenfunctions ψj,k. We have already found
eigenfunctions of ˆLz in Section 4.9.13 (Eq. 4.145). In order to ﬁnd wave functions that are simultaneously eigenfunctions of ˆL2 and ˆLz,
we have to express both operators in spherical coordinates. For ˆLz, it has been done already in Eq. 4.140, for ˆL2, the desired algebraic
expression is (see e.g. B15.3)
ˆL2
= −
2
sin ϑ
∂
∂ϑ
sin ϑ
∂
∂ϑ
+
∂
∂ϕ
1
sin ϑ
∂
∂ϕ
. (4.207)
We have therefore to solve a set of equations
ˆLzψj,k = −i
∂
∂ϕ
ψj,k = Lz,kψj,k = k ψj,k, (4.208)
ˆL2
ψj,k = −
2
sin ϑ
∂
∂ϑ
sin ϑ
∂
∂ϑ
+
∂
∂ϕ
1
sin ϑ
∂
∂ϕ
ψj,k = Ljψj,k = j(j + 1) 2
ψj,k. (4.209)
If ψj,k = Qj,k(r, ϑ)Rj,k(ϕ),
−iQj,k
∂
∂ϕ
Rj,k = kQj,kRj,k, (4.210)
−
Rj,k
sin ϑ
∂
∂ϑ
sin ϑ
∂Qj,k
∂ϑ
+
Qj,k
sin ϑ
∂2Rj,k
∂ϕ2
= j(j + 1)Qj,kRj,k. (4.211)
The ﬁrst equation has been already solved in Section 4.9.13, yielding ((Eq. 4.145)
Rj,k = Rk = eikϕ
. (4.212)
We can use this solution to calculate ∂2Rk/∂ϕ2
∂2Rk
∂ϕ2
=
∂2eikϕ
∂ϕ2
= −k2
eikϕ
= −k2
Rk (4.213)
and insert it into Eq. 4.211:
−
Rk
sin ϑ
d
dϑ
sin ϑ
dQj,k
dϑ
−
Qj,k
sin ϑ
k2
Rk = j(j + 1)Qj,kRk (4.214)
1
sin ϑ
d
dϑ
sin ϑ
dQj,k
dϑ
+ j(j + 1) −
k2
sin2 ϑ
Qj,k = 0 (4.215)
(1 − u2
)
d2Qj,k
du2
− 2u
dQj,k
du
+
j(j + 1)(1 − u2) − k2
1 − u2
Qj,k = 0, (4.216)
where the substitution u = cos ϑ (and ∂u = − sin ϑ∂ϑ) was used on the last line. This equation has the same form as Eq. 2.58 derived
in Section 2.6.3. A simpliﬁed version of Eq. 2.58 was solved in Section 2.6.4, where the solution was searched for in a form of a series
of powers of u. Solving the complete equation is more diﬃcult due to the presence of the (1 − u2)−1 factor in the term proportional to
Qj,k. In order to cope with the (1 − u2)−1 factor, we notice that each diﬀerentiation of a function multiplied by a so-far undeﬁned power
of (1 − u2) produces terms with the power decreased by one (in addition to other terms). We may hope that this compensates for the
opposite trend in Eq. 4.216: Qj,k is associated with (1 − u2)−1, the ﬁrst derivative of Qj,k with (1 − u2)0, the second derivative of Qj,k
with (1 − u2)1. This motivates us to look for solutions in a form
Qj,k = (1 − u2
)s
∞
l=0
alul
, (4.217)
with the ﬁrst derivative
dQj,k
du
= (1 − u2
)s d
du
∞
l=0
alul
− 2us(1 − u2
)s−1
∞
l=0
alul
= (1 − u2
)s
∞
l=0
lalul−1
− 2s(1 − u2
)s−1
∞
l=0
alul+1
(4.218)
and the second derivative
118
d2Qj,k
du2
= (1 − u2
)s d
du
∞
l=0
lalul−1
− 2s(1 − u2
)s−1 d
du
∞
l=0
alul+1
− 2us(1 − u2
)s−1
∞
l=0
lalul−1
+ 4us(s − 1)(1 − u2
)s−2
∞
l=0
alul+1
= (1 − u2
)s
∞
l=0
l(l − 1)alul−2
− 2s(1 − u2
)s−1
∞
l=0
(2l + 1)alul
+ 4s(s − 1)(1 − u2
)s−2
∞
l=0
alul+2
(4.219)
Substituting Qj,k,
dQj,k
du
, and
d2
Qj,k
du2 into Eq. 4.216,
(1 − u2
) (1 − u2
)s
∞
l=0
l(l − 1)alul−2
− 2s(1 − u2
)s−1
∞
l=0
(2l + 1)alul
+ 4s(s − 1)(1 − u2
)s−2
∞
l=0
alul+2
−2u (1 − u2
)s
∞
l=0
lalul−1
− 2s(1 − u2
)s−1
∞
l=0
alul+1
+
j(j + 1)(1 − u2) − k2
1 − u2
(1 − u2
)s
∞
l=0
alul
= 0. (4.220)
Now we use the u and 1 − u2 factors to adjust the exponents in the sums to ul
(1 − u2
)s
∞
l=0
l(l − 1)alul−2
− (1 − u2
)s
∞
l=0
l(l − 1)alul
− 2s(1 − u2
)s
∞
l=0
(2l + 1)alul
+ 4s(s − 1)(1 − u2
)s−1
u2
∞
l=0
alul
−2(1 − u2
)s
∞
l=0
lalul
+ 4s(1 − u2
)s−1
u2
∞
l=0
alul
+j(j + 1)(1 − u2
)s
∞
l=0
alul
− k2
(1 − u2
)s−1
∞
l=0
alul
= 0.(4.221)
We succeeded except for the ﬁrst sum. We factor out (1 − u2)s and combine all terms into one sum
(1 − u2
)s
∞
l=0
l(l − 1)alul−2
+ −l(l − 1) − 2s(2l + 1) − 2l + j(j + 1) +
4s(s − 1)u2 + 4su2 − k2
1 − u2
alul
= 0 (4.222)
(1 − u2
)s
∞
l=0
ul
l(l − 1)alul−2
+ −l2
− 4sl − 2s − l + j(j + 1) +
4s2u2 − k2
1 − u2
alul
= 0. (4.223)
This equation is satisﬁed if (1 − u2)s or the sum is equal to zero for every value of l. Let us ﬁrst inspect solutions for l = 0
(1 − u2
)s
0 + −2s + j(j + 1) +
4s2u2 − k2
1 − u2
a0 = 0. (4.224)
This must be true for any u, including u = 1 or u = −1 (solutions for u = cos ϑ = 1, i.e., θ = 0, and u = cos ϑ = −1, i.e., θ = π). But
u = ±1 makes the denominator in the last term to tend to zero. Such singularities must be checked carefully. If u = ±1, all terms in the
sum multiplied by (1 − u2)s are zero, except for the last one. The last term approaches inﬁnity unless the numerator (equal to 4s2 − k2
for u = ±1) is equal to zero. Therefore, the conditions
4s2
= k2
⇒ s =
|k|
2
(4.225)
must be fulﬁlled in order to satisfy Eq. 4.224 for a0 = 0. The inspection of the boundary conditions (the singularities at θ = 0 and
θ = π) thus provided the so-far unknown value of s .
We now return to Eq. 4.223 and move the exponents in the ﬁrst sum to ul. We proceed as in Section 2.6.4. The ﬁrst two terms of the
ﬁrst sum are equal to zero because the ﬁrst term includes multiplication by l = 0 and the second term includes multiplication by l − 1 = 0
for l = 1. Therefore, starting summation from l = 2 does not change anything.
∞
l=0
l(l − 1)alul−2
=
∞
l=2
l(l − 1)alul−2
. (4.226)
4.9. SUPPORTING INFORMATION 119
Decreasing the index l in this sum by two moves ul−2 to ul, as desired.
∞
l=2
l(l − 1)alul−2
=
∞
l=0
(l + 2)(l + 1)al+2ul
. (4.227)
We factor out ul
(1 − u2
)|k|/2
∞
l=0
ul
(l + 2)(l + 1)al+2 + −l2
− 2|k|l − |k| − l + j(j + 1) +
k2u2 − k2
1 − u2
al = 0 (4.228)
(1 − u2
)|k|/2
∞
l=0
ul
(l + 2)(l + 1)al+2 − l2
+ 2|k|l + |k| + l + k2
− j(j + 1) al = 0 (4.229)
(1 − u2
)|k|/2
∞
l=0
ul
{(l + 2)(l + 1)al+2 − [(|k| + l)(|k| + l + 1) − j(j + 1)] al} = 0 (4.230)
and obtain a recurrence formula very similar to Eq. 2.74
al+2 =
(l + |k|)(l + |k| + 1) − j(j + 1)
(l + 2)(l + 1)
al. (4.231)
Therefore, we can express the series
∞
l=0
alul as
∞
l=0
alul
= a0 1 +
(|k|)(|k| + 1) − j(j + 1)
1 · 2
u2
+
(|k|)(|k| + 1) − j(j + 1)
1 · 2
·
(|k| + 2)(|k| + 3) − j(j + 1)
3 · 4
u4
+ . . .
+ a1 u +
(|k| + 1)(|k| + 2) − j(j + 1)
2 · 3
u3
+
(|k| + 1)(|k| + 2) − j(j + 1)
2 · 3
·
(|k| + 3)(|k| + 4) − j(j + 1)
4 · 5
u5
+ . . . (4.232)
The recurrence formula also tells us that every value of l satisfying the condition l + |k| = j, i.e., l = j − |k|, terminates one of the
series in the parenthesis because
aj−|k|+2 =
(j − |k| + |k|)(j − |k| + |k| + 1) − j(j + 1)
(j − |k| + 2)(j − |k| + 1)
aj−|k| =
0
(j − |k| + 2)(j − |k| + 1)
aj−|k| = 0. (4.233)
We can therefore express all solutions Qj,k using the same approach as in Section 2.6.4. For each combination of j and k, one of the
series in Eq. 4.232 is terminated at l = j − |k| and the other one grows to inﬁnity. To keep the whole sum ﬁnite, the so-far undetermined
coeﬃcient multiplying the unterminated series is set to zero. For example, if l = j − |k| occurs in the series following a0, then a1 is set to
zero, and vice versa. The coeﬃcient multiplying the terminated series is determined by the normalization condition, as discussed below.
In this manner, we can ﬁnd, step-by-step, all solutions of Qj,k as possible ﬁnite sums multiplied by (1 − u2)|k|/2 = sin|k| ϑ with the
corresponding value of |k|. The ﬁrst solutions are listed in Table 4.2. The complete eigenfunctions are the products Rj,kQj,k = Yj,k, called
spherical harmonics. They are orthogonal, often normalized (by setting the values of a0 and a1) so that the integral of the square of Yj,k
over all possible orientations is unity:
ˆ 2π
0
dϕ
ˆ π
0
sin ϑdϑ Y ∗
j,k(ϕ, ϑ)Yj ,k (ϕ, ϑ) =
ˆ 2π
0
dϕ
ˆ 1
−1
du Y ∗
j,k(ϕ, u)Yj ,k (ϕ, u) = δjj δkk . (4.234)
The derived eigenfunctions deserve some remarks
• Spherical harmonics Yj,k(ϑ, ϕ) describe simultaneous eigenfunctions of ˆL2 and ˆLz for all integer eigenvalues j , but not for the
half-integer eigenvalues j . Spherical harmonics are eigenfunctions of orbital angular momentum. The half-integer eigenvalues j
and the corresponding eigenfunctions are discussed in the next lecture when the spin angular momentum is introduced.
• Spherical harmonics describe the angular dependence of the familiar atomic orbitals (derived for hydrogen).
• As expected for eigenfunctions of operator ˆLz, the square Y ∗
j,k(ϕ, ϑ)Yj,k(ϕ, ϑ) depends only on ϑ, not on ϕ (note that ˆLz represents
projection of L on the z axis).
• Comparison of Tables 4.1 and 4.2 reveals a close relation between the spherical harmonics and Wigner matrix elements: except for
the normalization factor, Yj,k(ϑ, ϕ) is equal to Dj
k,0
120
Table 4.2: Eigenfunctions of the operators ˆL2 and ˆLz.
j k l recurrence Qj,k ψj,k
0 0 0 = j − |k| a2 = 0 · a0 ⇒ a1 = 0 a0 Y0,0 = 1
4π
1 0 0 a2 = −2
1·2
· a0
1 = j − |k| a3 = 0 · a1 ⇒ a0 = 0 a1u = a1 cos ϑ Y1,0 = 3
4π
cos ϑ
1 ±1 1 = j − |k| a2 = 0 · a0 ⇒ a1 = 0 a0
√
1 − u2 = a0 sin ϑ Y1,±1 = 3
8π
e±iϕ sin ϑ
2 0 0 a2 = −6
1·2
· a0
1 a3 = −4
2·3
· a1
2 = j − |k| a4 = 0 · a2 ⇒ a1 = 0 a0 1 − 3u2 = a0 1 − 3 cos2 ϑ Y2,0 = 5
16π
3 cos2 ϑ − 1
2 ±1 0 a2 = −4
1·2
· a0
2 ±1 1 = j − |k| a3 = 0 · a1 ⇒ a0 = 0 a1
√
1 − u2u = a1 sin ϑ cos ϑ Y2,±1 = 15
8π
e±iϕ sin ϑ cos ϑ
2 ±2 0 = j − |k| a2 = 0 · a0 ⇒ a1 = 0 a0(1 − u2) = a0 sin2 ϑ Y2,±2 = 15
32π
e±2iϕ sin2 ϑ
3 0 0 a2 = −12
1·2
· a0
1 a3 = −10
2·3
· a1
2 a4 = −6
3·4
· a2
3 = j − |k| a5 = 0 · a3 ⇒ a0 = 0 a1
3u−5u3
3
= a1
3 cos ϑ−5 cos3
ϑ
3
Y3,0 = 7
16π
5 cos3 ϑ − 3 cos ϑ
3 ±1 0 a2 = −4
1·2
· a0
3 ±1 1 a3 = −6
2·3
· a1
3 ±1 2 = j − |k| a4 = 0 · a2 ⇒ a1 = 0 a0
√
1 − u2(1 − 5u2) = a0 sin ϑ 1 − 5 cos2 ϑ Y3,±1 = 21
64π
e±iϕ sin ϑ 5 cos2 ϑ − 1
3 ±2 0 a2 = −6
1·2
· a0
3 ±2 1 = j − |k| a3 = 0 · a1 ⇒ a0 = 0 a1(1 − u2)u = a1 sin2 ϑ cos ϑ Y3,±2 = 105
32π
e±2iϕ sin2 ϑ cos ϑ
3 ±3 0 = j − |k| a2 = 0 · a0 ⇒ a1 = 0 a0 (1 − u2)3 = a0 sin3 ϑ Y3,±3 = 35
64π
e±3iϕ sin3 ϑ
• Wigner matrices can be used to express Yj,k(ϑ, ϕ) in diﬀerent coordinate frames. The transformations are not limited to j = 0, 1, 2,
discussed in Section 4.9.14. Wigner derived a general form of the transformation matrix applicable to eigenfunctions associated
with all eigenvalues j (including the half-integer j). The elements of the general Wigner matrix are given by
e−ikϕ
dj
k,k
(ϑ)e−ik χ
= e−i(kϕ+k χ)
(j + k)!(j − k)!(j + k )!(j − k )!
l
(−1)k−k +l cos −ϑ
2
2j−k+k −2l
sin −ϑ
2
k−k +2l
(j − k − l)!l!(k − k + l)!(j + k − l)!
,
(4.235)
where l are integer values l ≥ 0, l ≤ j −k, l ≤ j +k , and l ≥ k −k so that the factorials are computed from non-negative numbers.
Lecture 5
Spin
Literature: Introduction to the special theory of relativity can be found in B10, but relativistic
quantum mechanics is not discussed in the literature recommended for this course or in general
physical chemistry textbooks (despite the important role of spin in chemistry). Therefore, more
background information is presented here than in the other chapters. NMR can be correctly described
if the spin is introduced ad hoc. The purpose of Section 5.7.3 is to show how the spin emerges
naturally. Origin of nuclear magnetism is touched in L1.3 and L1.4. Quantum mechanics of spin
angular momentum is reviewed in K6, L7, and L10.
5.1 Dirac equation
The angular momentum discussed in Section 4.7 is associated with the change of direction of a
moving particle. However, the theory discussed so-far does not explain the experimental observation
that even point-like particles moving along straight lines possess a well deﬁned angular momentum,
so-called spin.
The origin of the spin is a consequence of the symmetry of Nature that is taken into account in
the theory of relativity. The Schr¨odinger equation is not relativistic and does not describe the spin
naturally. In this lecture, we describe spin using relativistic quantum mechanics, a theory which is
in agreement with two fundamental postulates of the special theory of relativity (see Sections 5.7.1
and 5.7.2 for review of the special theory of relativity):
The laws of physics are invariant (i.e. identical) in all inertial systems (non-accelerating frames
of reference).
The speed of light in a vacuum is the same for all observers, regardless of the motion of the
light source.
The arguments presented in Sections 5.7.3 and 5.7.4 lead to the wave equation
i
∂
∂t
ˆγ0
+ ic
∂
∂x
ˆγ1
+ ic
∂
∂y
ˆγ2
+ ic
∂
∂z
ˆγ3
− m0c2ˆ1 Ψ = 0, (5.1)
where ˆγj
are the following 4 × 4 matrices
121
122
ˆγ0
=




1 0 0 0
0 1 0 0
0 0 −1 0
0 0 0 −1



 ˆγ1
=




0 0 0 1
0 0 1 0
0 −1 0 0
−1 0 0 0



 ˆγ2
=




0 0 0 −i
0 0 i 0
0 i 0 0
−i 0 0 0



 ˆγ3
=




0 0 1 0
0 0 0 −1
−1 0 0 0
0 1 0 0



 .
(5.2)
The presented matrices ˆγj
represent only one possible choice, but it is a good choice for describing
NMR as as the following sections show.
The solution of Eq. 5.1 is a wave function consisting of four components
Ψ =




ψ1
ψ2
ψ3
ψ4



 . (5.3)
The explicit form of the solution for a free particle is presented in Section 5.7.5. Note that the
solution is written as a four-component vector, but the indices 1, 2, 3, 4 are not related to time and
space coordinate. Instead, they represent new degrees of freedom, distinguishing diﬀerent spin states
and particles from antiparticles.
When postulated by Dirac, Eq. 5.1 naturally explained the behavior of particles with spin number
1/2 and predicted existence of antiparticles, discovered a few years later. Relation of Eq. 5.1 to the
non-relativistic Schr¨odinger equation is described in Section 5.7.6.
After describing the free particle, we should move to the description of particles interacting with
their surroundings, in particular with the electromagnetic ﬁelds. Strictly speaking, both spin-1/2
particles and the ﬁelds should be treated in the same manner, i.e., as quantum particles or, more
precisely, as states of various quantum ﬁelds. Such approach is reviewed in Engelke, Concepts Magn.
Reson. 36(A) (2010) 266-339, DOI 10.1002/cmr.a.20166. However, the energy of the electromagnetic
quanta (photons) used in NMR spectroscopy is low and their number is very high. As a consequence,
the quantum and classical1
description of the ﬁelds give almost identical results. As we try to keep
the theoretical description as simple as possible in this text, we follow with the classical description
of the electromagnetic ﬁeld.2
5.2 Operator of the spin magnetic moment
The Dirac equation allows us to ﬁnd the operator of the spin magnetic moment. We start by
deriving the Hamiltonian describing the energy of the spin magnetic moment in a magnetic ﬁeld
(Section 5.7.7). In a limit of energies much lower than the rest-mass energy m0c2
, the Hamiltonian
is
1
Here, ”classical” means ”non-quantum, but relativistic” because the Maxwell equations are consistent with the
special theory of relativity.
2
A consequence of the classical treatment of the electromagnetic ﬁelds is that we derive a value of the magnetogyric
ratio slightly lower than observed and predicted by the fully quantum approach. This fact is mentioned in Section 5.6.
5.3. OPERATORS OF SPIN ANGULAR MOMENTUM 123
ˆH ≈
1
2m0
i
∂
∂x
+ QAx
2
+ i
∂
∂y
+ QAy
2
+ i
∂
∂z
+ QAz
2
+ QV
1 0
0 1
−
Q
2m0
Bx
0 1
1 0
+ By
0 −i
i 0
+ Bz
1 0
0 −1
. (5.4)
The Hamiltonian contains a part (shown in green on the ﬁrst line) which is identical with the nonrelativistic
Hamiltonian in the Schr¨odinger equation describing a particle in an electromagnetic ﬁeld
(Eq. 4.26), but it also contains a new part (shown in red on the second line), which appears only in
the relativistic treatment (and survives the simpliﬁcation to the low-energy limit. This ”relativistic”
component closely resembles the Hamiltonian of the interaction of the orbital magnetic moment with
the magnetic ﬁeld (Eq. 4.41) and, as we discuss below, has all properties expected for the Hamiltonian
of the spin magnetic moment, despite the fact that we analyze a point-like particle which cannot
spin. Comparison of Eqs. 5.4 and 4.41 helps us to identify the operator of the components of the
spin magnetic moment:
ˆµx =
Q
2m0
0 1
1 0
, (5.5)
ˆµy =
Q
2m0
0 −i
i 0
, (5.6)
ˆµz =
Q
2m0
1 0
0 −1
. (5.7)
5.3 Operators of spin angular momentum
Our ﬁnal task is to ﬁnd the operators of the components of the spin angular momentum, which also
gives us the value of the magnetogyric ratio. Eq. 5.4 itself is not suﬃcient because it does not say
which constants belong to the spin angular momentum and which constitute the magnetogyric ratio.
We cannot use the classical deﬁnition either because our case does not have a classical counterpart.
But we can use
• the general relation between magnetic moment and angular momentum µ = γL and
• the commutation relations Eqs. 4.35–4.38, which deﬁne operators of x, y, z components of any
angular momentum.
In order to distinguish it from the orbital angular momentum L, we label the spin angular
momentum I, whereas we use the symbol µ for the spin magnetic moment. The operators of µx, µy, µz
are given by
ˆµx = γ ˆIx, ˆµx = γ ˆIy, ˆµx = γ ˆIz, (5.8)
124
and the operators of Ix, Iy, Iz must fulﬁll the same commutation relations as the operators of
Lx, Ly, Lz:
ˆIx
ˆIy − ˆIy
ˆIx = i ˆIz, ˆIy
ˆIz − ˆIz
ˆIy = i ˆIx, ˆIz
ˆIx − ˆIx
ˆIz = i ˆIy. (5.9)
Following the classical deﬁnition µ = γL, we can express the operators ˆIx, ˆIy, ˆIz as ˆµx/γ, ˆµx/γ,
ˆµx/γ, respectively, where ˆµx, ˆµx, ˆµx are already deﬁned by Eqs. 5.5–5.7. As shown in Section 5.7.8,
the commutation relations summarized in Eq. 5.9 require that the magnetogyric ratio diﬀers by a
factor of 2 from the value for orbital magnetic moment:
γ = 2
Q
2m
. (5.10)
When we divide deﬁnitions of ˆµx, ˆµx, ˆµx by this value of γ, we obtain the deﬁnition of the spin
operators
ˆIx =
2
0 1
1 0
ˆIy =
2
0 −i
i 0
ˆIz =
2
1 0
0 −1
ˆI2
=
3 2
4
1 0
0 1
. (5.11)
5.4 Eigenfunctions and eigenvalues of ˆIz
The fact that ˆIz is diagonal tells us that we have written the matrix representations of the operators
of the spin angular momentum in the basis formed by the eigenfunctions of ˆIz. This basis is a
good choice if the matrix representing Hamiltonian is also diagonal in this basis and, therefore,
eigenfunctions of ˆIz are the same as eigenfunctions of the Hamiltonian.3
These eigenfunctions can
be
1
h3
ψ
0
=
1
h3
ψ
1
0
,
1
h3
0
ψ
=
1
h3
ψ
0
1
, (5.12)
i.e., the two-component variants of the free-particle wave functions from Eq. 5.141 in the lowenergy
approximation (the explicit form of the four-component wave function and the normalization
factor h−3/2
are described in Section 5.7.5). The normalization coeﬃcient h−3/2
and ψ can be canceled
out in the eigenvalue equations and the eigenfunctions can be replaced by the vectors
1
0
,
0
1
(5.13)
corresponding to the ﬁrst and second wave functions in Eq. 5.141.
The states represented by the eigenfunctions of ˆIz (eigenstates) are traditionally called states α
and β and are further discussed in Section 5.5. The eigenfunctions of ˆIz are usually labeled as |α
or | ↑ and |β or | ↓ :
3
This is a good choice, because such eigenfunctions represent states that are stationary, as was shown in Section 4.6
an is further discussed in Section 5.5.
5.5. EVOLUTION, EIGENSTATES AND ENERGY LEVELS 125
ˆIz|α = +
2
|α ˆIz| ↑ = +
2
| ↑
2
1 0
0 −1
1
0
= +
2
1
0
, (5.14)
ˆIz|β = −
2
|β ˆIz| ↓ = −
2
| ↓
2
1 0
0 −1
0
1
= −
2
0
1
. (5.15)
The physical signiﬁcance of the found eigenvalues ± /2 is discussed in Section 5.7.9.
Note that the vectors used to represent |α and |β in Eqs. 5.14 and 5.15 are not the only choice.
Vectors in Eqs. 5.14 and 5.15 have a phase set to zero (they are made of real numbers). Any other
phase φ would work as well, e.g.
1
0
→
eiφ
0
. (5.16)
The postulates of quantum mechanics, discussed in the preceding lecture, tell us that measurement
of spin angular momentum or spin magnetic moment of a single particle is limited by quantum
indeterminacy, described bellow and shown in Figure 5.1.
• If the particle is in state |α , the result of measuring Iz is always + /2. The expected value is
Iz = α|Iz|α = 1 0
2
1 0
0 −1
1
0
= +
2
. (5.17)
• If the particle is in state |β , the result of measuring Iz is always − /2. The expected value is
Iz = β|Iz|β = 0 1
2
1 0
0 −1
0
1
= −
2
. (5.18)
• Any state cα|α + cβ|β is possible, but the result of a single measurement of Iz is always + /2
or − /2. However, the expected value of Iz is
Iz = α|Iz|β = c∗
α c∗
β
2
1 0
0 −1
cα
cβ
= (|cα|2
− |cβ|2
)
2
. (5.19)
Wave functions |α and |β are not eigenfunctions of ˆIx or ˆIy. Eigenfunctions of ˆIx and ˆIy are
presented in Section 5.7.10
5.5 Evolution, eigenstates and energy levels
Knowledge of the Hamiltonian allows us to describe how the studied system evolves. We have learnt
in Section 4.6 that states corresponding to eigenfuctions, i.e., the eigenstates, are stationary. This
is shown for the eigenfunctions of ˆIz in Section 5.7.11 and in Figure 5.2. If the system is in the
stationary state, its eigenvalue does not change in time. Therefore, a system in a state described by
126
A |Ψ = |α
−γ 2
+γ 2
1 2 3 4 5 6 7 8 9 10
µz = +γ 2
µz
B |Ψ = |β
−γ 2
+γ 2
1 2 3 4 5 6 7 8 9 10
µz = −γ 2
µz
C |Ψ = 1√
2
|α + 1√
2
|β
−γ 2
+γ 2
1 2 3 4 5 6 7 8 9 10
µz = 0
µz
Measurement number
Figure 5.1: Plot of hypothetical results of individual measurements of the z components of the magnetic moment of
a spin-1/2 particle in a vertical magnetic ﬁeld B0. Individual measured values (equal to one of eigenvalues of ˆµz) and
average measured values (equal to the expectation value µz ) are shown as red circles and green arrows, respectively,
for a particle in the α eigenstate (A), in the β eigenstate (B) and in the superposition state described by 1√
2
|α + 1√
2
|β
(C).
5.6. REAL PARTICLES 127
an eigenfunction of the Hamiltonian can be associated with a certain eigenvalue of the Hamiltonian,
i.e., with a certain energy.
The states described by basis functions which are eigenfunctions of the Hamiltonian do not
evolve (are stationary). It makes sense to draw energy level diagram for such states, with
energy of each state given by the corresponding eigenvalue of the Hamiltonian. Energy of the
|α state is − ω0/2 and energy of the |β state is + ω0/2. The measurable quantity is the
energy diﬀerence ω0, corresponding to the angular frequency ω0.
In general, the studied system can be present in a state that is not described by a single eigenfunction,
but by a linear combination (superposition) of eigenfunctions. A shown in Section 5.7.12
and in Figure 5.2, such a superposition state evolves in time and cannot be associated with a single
energy.
The states described by basis functions diﬀerent from eigenfunctions of the Hamiltonian are
not stationary but oscillate between |α and |β with the angular frequency ω1, given by the
diﬀerence of the eigenvalues of the Hamiltonian (− ω1/2 and ω1/2).
It should be stressed that eigenstates of individual magnetic moments are not eigenstates of the
macroscopic ensembles of nuclear magnetic moments. Eigenstates of individual magnetic moments
do not determine the possible result of measurement of bulk magnetization. We present the correct
description of large ensembles in the next lecture.
Our ability to analyze evolution of the coeﬃcients cα and cβ also allows us to describe rigorously
the eﬀect of radio waves on the spin states. Detailed analysis presented in Sections 5.7.13–5.7.17
shows that equations describing weak ﬁelds oscillating in one direction (a physically realistic model of
a radio wave) have the same form as equations describing rotating ﬁelds (such ﬁelds are not applied
in reality, but equations describing their eﬀects on spin states have simple analytical solutions).
5.6 Real particles
Eq. 5.4, used to derive the value of γ, describes interaction of a particle with an external electromagnetic
ﬁeld. However, charged particles are themselves sources of electromagnetic ﬁelds. Therefore, γ
is not exactly twice Q/2m. In general, the value of γ is
γ = g
Q
2m
, (5.20)
where the constant g include corrections for interactions of the particle with its own ﬁeld (and
other eﬀects). For electron, the corrections are small and easy to calculate in the fully quantum
approach (quantum electrodynamics). The current theoretical prediction is g = 2.0023318361(10),
compared to a recent experimental measured value of g = 2.0023318416(13). On the other hand,
”corrections” for the constituents of atomic nuclei, quarks, are two orders of magnitude higher than
the basic value of 2! It is because quarks are not ”bare” as electrons, they are conﬁned in protons
and nucleons, ”dressed” by interactions, not only electromagnetic, but mostly strong nuclear with
gluon. Therefore, the magnetogyric ratio of the proton is diﬃcult to calculate and we rely on its
experimental value. Everything is even more complicated when we go to higher nuclei, consisting of
multiple protons and neutrons. In such cases, adding spin angular momenta represents another level
128
A |Ψ (t = 0) = |α ; ˆH = −γB0
ˆIz = ω0
ˆIz
0
1
Pα
B |Ψ (t = 0) = |β ; ˆH = −γB0
ˆIz = ω0
ˆIz
0
1
Pα
C |Ψ (t = 0) = |α ; ˆH = −γB1
ˆIx = ω1
ˆIx
0
1
1/ω1
Pα
t
Figure 5.2: Evolution of the probability Pα that a spin-1/2 particle is found in the α state: for a particle in a vertical
magnetic ﬁeld B0 and in the α eigenstate at t = 0 (A), for a particle in a vertical magnetic ﬁeld B0 and in the β
eigenstate at t = 0 (B), and for a particle in a horizontal magnetic ﬁeld B1 and in the α state at t = 0 (C). The states
α and β are represented by eigenfunctions of ˆIz (Panels A and B), but |α is not an eigenfunction of ˆIx.
5.6. REAL PARTICLES 129
Table 5.1: Values of the magnetogyric ratios of selected nuclei
Nucleus magnetogyric ratio
1
1H 267.513 × 106
rad.s−1
.T−1
13
6C 67.262 × 106
rad.s−1
.T−1
15
7N −27.116 × 106
rad.s−1
.T−1
19
9F 251.662 × 106
rad.s−1
.T−1
31
15P 108.291 × 106
rad.s−1
.T−1
electron 176 085.952 × 106
rad.s−1
.T−1
of complexity. Fortunately, all equations derived for the electron also apply to nuclei with the same
eigenvalues of spin magnetic moments (spin-1/2 nuclei), if the value of γ is replaced by the correct
value for the given nucleus.4
Magnetogyric ratios of the nuclei observed most frequently are listed in
Table 5.1
HOMEWORK
Check that you understand how commutators of the operator of the orbital angular momentum are
derived (Section 4.9.12) and derive the Hamiltonian of the spin magnetic moment (Section 5.7.7).
4
NMR in organic chemistry and biochemistry is usually limited to spin-1/2 nuclei because signal decays too fast if
the spin number is grater than 1/2.
130
5.7 SUPPORTING INFORMATION
5.7.1 Special theory of relativity
The ﬁrst postulate of the special theory of relativity says that physical laws have the same form in all inertial coordinate frames. Two
inertial frames can diﬀer in the orientation in space. Vectors expressed in one frame can be transformed to those in another frame using
relations presented in Section 1.5.3. But two inertial frames can also diﬀer in velocity. For example, we may describe one coordinate system
as stationary and another one as moving with a constant velocity v. Galileo proposed that velocity in the stationary system is a vector
sum of the velocity in the moving system and of v. If we describe velocity in the stationary system as dr/dt (change of the position r in
time), and velocity in the moving system as dr /dt, we can write the Galilean transformation as
dr
dt
=
dr
dt
+ v. (5.21)
Note that both dr and dr are diﬀerentiated with respect to the same time t. Galileo expected that time is a global parameter, same
in all systems.
dt = dt . (5.22)
Eqs. 5.21 and 5.22 obviously imply
dr = r + vdt. (5.23)
We can use the direction of v to deﬁne the x axis and to write for individual coordinates
dx = dx + vdt (5.24)
dy = dy (5.25)
dz = dz (5.26)
The Galilean transformation yields correct results for v much slower than the speed of light c.
The second postulate of the special theory of relativity says that the value of c is the same in all inertial coordinate frames. This
contradicts the Galilean transformation and another transformation relations must be searched for. The relations that satisfy both
postulates of the special theory are known as Lorentz transformation. We are not going to derive them rigorously, but we present simple
arguments that point to them (without proving assumptions that we make).
If anything moves with a constant speed u in the stationary system, or u in a moving system,
dr = udt ⇒ dr2
− u2
dt2
= dx2
+ dy2
+ dz2
− u2
dt2
= 0 (5.27)
and
dr = u dt ⇒ dr
2
− u
2
dt
2
= dx
2
+ dy
2
+ dz
2
− u
2
dt
2
= 0, (5.28)
where the expression with squares do not depend on the directions of u and u . Note that we are ready to accept that time may be
diﬀerent in the coordinate frames (we distinguish t from t).
We know that the Galilean transformation dr = dr + vdt is correct for low speeds. Therefore, we can assume that for u = 0 and
u = 0
dr2
− u2
dt2
= dr2
= dx2
+ dy2
+ dz2
= dr
2
− u
2
dt
2
= dr
2
= kx (dx − vdt)2
+ kydy2
+ kzdz2
, (5.29)
where the unknown coeﬃcients kx, ky, and kz must tend to one for v c.
The second postulate of the special theory of relativity requires that for u = |u| = c
dr2
− |u|2
dt2
= dr2
− c2
dt2
= dx2
+ dy2
+ dz2
− c2
dt2
= 0 (5.30)
but also
dr
2
− |u |2
dt
2
= dr
2
− c2
dt
2
= dx
2
+ dy
2
+ dz
2
− c2
dt
2
= 0. (5.31)
Combination of Eqs. 5.29–5.31 gives
dx2
+ dy2
+ dz2
− c2
dt2
= kx (dx − vdt)2
+ kydy2
+ kzdz2
− c2
dt
2
= kx dx2
− 2vdxdt + v2
dt2
+ kydy2
+ kzdz2
− c2
dt
2
. (5.32)
Obviously, dt cannot be equal to dt, but the transformation of dt must generate a term that would cancel the red term in Eq. 5.32.
We can guess that the following relation does what we need:
5.7. SUPPORTING INFORMATION 131
dt = kx dt −
1
c2
vdx ⇒ dt
2
= kx dt2
−
2
c2
vdxdt +
1
c4
v2
dx2
. (5.33)
Plugging this into Eq. 5.32,
dx2
+ dy2
+ dz2
− c2
dt2
= kx dx2
− 2vdxdt + v2
dt2
+ kydy2
+ kzdz2
− kx c2
dt2
− 2vdxdt +
1
c2
v2
dx2
= kx dx2
+ v2
dt2
− c2
dt2
−
v2
c2
dx2
+ kydy2
+ kzdz2
= kx 1 −
v2
c2
dx2
− c2
dt2
+ kydy2
+ kzdz2
. (5.34)
The left-hand side is equal to the right-hand side if ky = kz = 1 and kx = 1/(1 − v2/c2). In summary, the transformation consistent
with both postulates of special theory of relativity is
ct =
ct − vx/c
1 − v2/c2
= +
1
1 − v2/c2
ct −
v/c
1 − v2/c2
x = +
c
√
c2 − v2
ct −
v
√
c2 − v2
x, (5.35)
x =
x − vt
1 − v2/c2
= −
v/c
1 − v2/c2
ct +
1
1 − v2/c2
x = −
v
√
c2 − v2
ct +
c
√
c2 − v2
x, (5.36)
y = y, (5.37)
z = z, (5.38)
where the direction of v deﬁnes the x axis. Transformations between inertial coordinate systems with other directions of v are
described in Section 1.5.3. For example, transformation to a coordinate system where v has a diﬀerent orientation in the xy plane
(vx = cos ϕ, vy = sin ϕ) corresponds to a rotation of the coordinate frame about the z axis by the azimuth angle ϕ:
ct = ct, (5.39)
x = +(cos ϕ)x + (sin ϕ)y = +
vx
v
x −
vy
v
y = +
vx
vx
2
+ vy
2
x −
vy
vx
2
+ vy
2
y, (5.40)
y = −(sin ϕ)x + (cos ϕ)y = +
vy
v
x +
vx
v
y = +
vy
vx
2
+ vy
2
x +
vx
vx
2
+ vy
2
y, (5.41)
z = z, (5.42)
Note that time and space coordinates are not independent in the special theory of relativity (as they were in Newton mechanics). Eqs.
5.30 and 5.31 tell us that
dx2
+ dy2
+ dy2
− c2
dt2
= dx
2
+ dy
2
+ dz
2
− c2
dt
2
. (5.43)
If the ﬁrst system is stationary, i.e., the position r does not change, then dx = dy = dy = 0 and
c2
dt2
= c2
dt2
0 = c2
dt
2
− dx
2
− dy
2
− dz
2
. (5.44)
The time measured in the stationary system is called proper time, we label it t0, and it describes diﬀerence in time between two events
that occur at the same position. The quantity c2dt2 − dx2 − dy2 − dz2, called space-time interval, is invariant (the same in all inertial
frames), equal to c2dt2
0.
Eq. 5.44 is analogous to the Pythagorean theorem in a four-dimensional space, called space-time, consisting of time and three space
dimensions. Events in the space-time are described by four-vectors (ct, x, y, z). Note however, that the space-time does not have Euclidean
geometry as the familiar three-dimensional space of the x, y, z dimensions. This is why the minus signs appear in Eq. 5.44, in contrast to
the Pythagorean theorem r2 = x2 + y2 + z2. Square in the space-time is calculated as
c2
dt2
0 = cdt dx dy dz




1 0 0 0
0 −1 0 0
0 0 −1 0
0 0 0 −1








cdt
dx
dy
dz



 = c2
dt2
− dx2
− dy2
− dz2
. (5.45)
Transformation of four-vectors can be described by four-dimensional matrices, obtained by extending the three-dimensional matrices
of Section 1.5.3 to the time dimension. For example, the transformations discussed above (change of velocity in the x direction and rotation
in the xy plane) can be described as
132




ct
x
y
z



 =






+ 1√
1−v2/c2
−
v/c
√
1−v2/c2
0 0
−
v/c
√
1−v2/c2
+ 1√
1−v2/c2
0 0
0 0 1 0
0 0 0 1










ct
x
y
z



 =





+ c√
c2−v2
− v√
c2−v2
0 0
− v√
c2−v2
+ c√
c2−v2
0 0
0 0 1 0
0 0 0 1









ct
x
y
z



 (5.46)
and




ct
x
y
z



 =




1 0 0 0
0 + cos ϕ + sin ϕ 0
0 − sin ϕ + cos ϕ 0
0 0 0 1








ct
x
y
z



 =








1 0 0 0
0 +
vx
vx
2+vy
2
−
vy
vx
2+vy
2
0
0 +
vy
vx
2+vy
2
+
vx
vx
2+vy
2
0
0 0 0 1












ct
x
y
z



 , (5.47)
respectively.
5.7.2 Relativistic momentum and energy
According to the ﬁrst postulate of special theory of relativity, the second Newton’s law
F =
dp
dt
(5.48)
must have the same form in all inertial frames. In Newton’s mechanics, the momentum p = mdr/dt, where the velocity dr/dt is
measured diﬀerently in diﬀerent frames. In order to keep the second Newton’s law the same in all inertial frames, momentum is deﬁned in
the special theory of relativity as
p = m0
dr
dt0
= m0
dr
dt
dt
dt0
=
m0u
1 − u2/c2
, (5.49)
where u = dr/dt is the velocity in the reference frame, not a velocity of the reference frame relative to another coordinate frame.
Eq 5.35 was used to evaluate dt = dt0/ 1 − u2/c2 (note that dx = 0 for a stationary system). By writing m0, we stress that we use mass
measured in the stationary system (rest mass).
Let us assume that work done by a force along a ceratin path is converted to the kinetic energy:
Ekin = −W12 =
r2ˆ
r1
F · dr =
t2ˆ
t1
dp
dt
· udt =
t2ˆ
t1
d
dt
m0u
1 − u2/c2
· udt = m0
u2ˆ
0
ud
u
1 − u2/c2
, (5.50)
where the velocity increased from zero to u2. The ﬁnal integral is well suited for integration by parts (per partes):
Ekin = m0
u2ˆ
0
ud
u
1 − u2/c2
= m0
u2
2
1 − u2
2/c2
− m0
u2ˆ
0
u
1 − u2/c2
du. (5.51)
We notice that
d 1 − u2/c2
1
2
du
= −
u
c2
1 − u2
/c2 − 1
2 ⇒ −
u
1 − u2/c2
du = c2
d 1 − u2/c2 (5.52)
and consequently
Ekin = m0
u2
2
1 − u2
2/c2
+ m0c2
1−u2
2/c2
ˆ
1
d 1 − u2/c2 = m0
u2
2
1 − u2
2/c2
+ m0c2
1 − u2
2/c2 − m0c2
= m0
u2
2 + c2 − u2
2
1 − u2
2/c2
− m0c2
=
m0c2
1 − u2
2/c2
− m0c2
(5.53)
Relabeling the ﬁnal velocity u2 → u,
5.7. SUPPORTING INFORMATION 133
Ekin =
m0c2
1 − u2/c2
− m0c2
, (5.54)
or
m0c2
1 − u2/c2
= Et = Ekin + m0c2
, (5.55)
Where Et = m0c2/ 1 − u2/c2 has the form of total energy, consisting of Ekin and Einstein’s famous m0c2, playing the role of a
potential energy. We can now compare relativistic deﬁnitions of momentum and energy:
Et =
m0c2
1 − u2/c2
⇒ E2
t =
m2
0c4
1 − u2/c2
(5.56)
p =
m0u
1 − u2/c2
⇒ p2
=
m2
0u2
1 − u2/c2
. (5.57)
This comparison shows that
E2
t − p2
c2
=
m2
0c4 − m2
0u2c2
1 − u2/c2
=
m2
0c4 1 − u2/c2
1 − u2/c2
= m2
0c4
. (5.58)
Note that both m0 (the rest mass) and c are identical in all inertial frames (m0 by deﬁnition, c by the second postulate of the special
theory of relativity). We have found another invariant, the diﬀerence E2
t − p2c2, which is the same in all inertial coordinate frames. This
invariant is the space-time square of the momentum four-vector (Et, cpx, cpy, cpz).
5.7.3 Relativistic quantum mechanics
We found in Section 5.7.2 that the special theory of relativity requires that the quantity E2
t − p2c2 is equal to the invariant m2
0c4. This
can be written as
− E2
t + c2
p2
x + c2
p2
y + c2
p2
z + m2
0c4
= 0 (5.59)
Let us look for an operator which represents the quantity −E2
t + c2p2
x + c2p2
y + c2p2
z + m2
0c4. We know that for a monochromatic wave
function
ψ = e
i (pxx+pyy+pzz−Ett)
, (5.60)
partial derivatives of ψ serve as operators of energy and momentum:
i
∂ψ
∂x
= −pxψ i
∂ψ
∂y
= −pyψ i
∂ψ
∂z
= −pzψ i
∂ψ
∂t
= Etψ. (5.61)
Therefore, the operator of −E2
t + c2p2
x + c2p2
y + c2p2
z + m2
0c4 should have a form
2 ∂2
∂t2
− c2 2 ∂2
∂z2
− c2 2 ∂2
∂x2
− c2 2 ∂2
∂y2
+ (m0c2
)2
. (5.62)
Eq. 5.62 fulﬁlls the requirements of the special theory of relativity, but it contains the second time derivative. As discussed in
Section 4.9.11, an attempt to use Eq. 5.62 to describe evolution of the quantum system in time is not consistent with our ﬁrst postulate of
quantum mechanics and with our interpretation of Ψ∗Ψ as the probability density. Therefore, we look for an operator that contains only
the ﬁrst time derivative and allows us to formulate the equation(s) of motion that is in agreement with the special theory of relativity and
with the postulates of quantum mechanics. As this problem is not easy to solve, we will proceed step by step. Let us ﬁrst assume that
particles do not move, i.e., p = 0. Then, Eq. 5.59 simpliﬁes to
− E2
t + m2
0c4
= 0, (5.63)
which can be written as
(−Et + m0c2
)(Et + m0c2
) = 0, (5.64)
Using the operator of energy,
2 ∂2ψ
∂t2
+ (m0c2
)2
ψ = −E2
t + m2
0c4
ψ = 0 (5.65)
134
if ψ is an eigenfunction of the energy operator. The operator of −E2
t + m2
0c4 (let us call it ˆO2) can be obtained by a subsequent
application of operators ˆO+ and ˆO− that provide the following equations of motion:
i
∂
∂t
− m0c2
ψ = ˆO+
ψ = 0, (5.66)
−i
∂
∂t
− m0c2
ψ = ˆO−
ψ = 0. (5.67)
The operators ˆO− and ˆO+ can be viewed as ”square roots” of ˆO2:
ˆO2
ψ ≡ 2 ∂2ψ
∂t2
+ (m0c2
)2
ψ = i
∂
∂t
− m0c2
−i
∂
∂t
− m0c2
ψ = ˆO+ ˆO−
ψ 0. (5.68)
What are the eigenfuctions? One solution is a wave described by Eq. 5.60 with px = py = pz = 0. We can prove it by checking that
calculating the time derivatives give us the eigenvalues (see the green terms in the following equation):
i
∂
∂t
− m0c2
−i
∂
∂t
− m0c2
e
i (−Ett)
= i
∂
∂t
− m0c2
−Et − m0c2
e
i (−Ett)
= −Et − m0c2
i
∂
∂t
− m0c2
e
i (−Ett)
= −Et − m0c2
Et − m0c2
e
i (−Ett)
= m2
0c4
− E2
t e
i (−Ett)
= 0. (5.69)
But the complex conjugate of the wave described by Eq. 5.60 is another possible solution:
i
∂
∂t
− m0c2
−i
∂
∂t
− m0c2
e
i (Ett)
= i
∂
∂t
− m0c2
Et − m0c2
e
i (Ett)
= Et − m0c2
i
∂
∂t
− m0c2
e
i (Ett)
= Et − m0c2
−Et − m0c2
e
i (Ett)
= m2
0c4
− E2
t e
i (Ett)
= 0. (5.70)
The second eigenfunction can be interpreted as a particle with a positive energy moving backwards in time, or as an antiparticle
moving forward in time.
Let us now turn our attention to particles that can move (p = 0). For the most interesting particles as electron or quarks, the operator
ˆO2 should have the form described by Eq. 5.62
ˆO2
ψ = 2 ∂2
∂t2
− c2 2 ∂2
∂z2
− c2 2 ∂2
∂x2
− c2 2 ∂2
∂y2
+ (m0c2
)2
ψ. (5.71)
Let us try to ﬁnd ”square roots” of the operator ˆO2 for a particle with a momentum p. In Eq. 5.68, ˆO+ and ˆO− were complex
conjugates. A similar choice for a particle with a momentum p, i.e.,5
ˆO+
ψ = i
∂
∂t
+ ic
∂
∂x
+ ic
∂
∂y
+ ic
∂
∂z
− m0c2
ψ (5.72)
ˆO−
ψ = −i
∂
∂t
− ic
∂
∂x
− ic
∂
∂y
− ic
∂
∂z
− m0c2
ψ (5.73)
gives
ˆO− ˆO+ψ = ˆO2ψ = 2 ∂2
ψ
∂t2 +c 2 ∂ψ
∂t
∂ψ
∂x
+c 2 ∂ψ
∂t
∂ψ
∂y
+c 2 ∂ψ
∂t
∂ψ
∂z
−im0c2 ∂ψ
∂t
+c 2 ∂ψ
∂x
∂ψ
∂t
+ 2 ∂2
ψ
∂x2 +c 2 ∂ψ
∂x
∂ψ
∂y
+c 2 ∂ψ
∂x
∂ψ
∂z
−im0c3 ∂ψ
∂x
+c 2 ∂ψ
∂y
∂ψ
∂t
+c 2 ∂ψ
∂y
∂ψ
∂x
+ 2 ∂2
ψ
∂y2 +c 2 ∂ψ
∂y
∂ψ
∂z
−im0c3 ∂ψ
∂y
+c 2 ∂ψ
∂z
∂ψ
∂t
+c 2 ∂ψ
∂z
∂ψ
∂x
+c 2 ∂ψ
∂z
∂ψ
∂y
+ 2 ∂2
ψ
∂z2 −im0c3 ∂ψ
∂z
+im0c2 ∂ψ
∂t
+im0c3 ∂ψ
∂x
+im0c3 ∂ψ
∂y
+im0c3 ∂ψ
∂z
+(m0c2)2ψ
(5.74)
with the correct ﬁve square terms shown in blue, but also with additional twelve unwanted mixed terms shown in red (the green terms
for t, x, y, z cancel each other.
As the second trial, let us try (na¨ıvely) to get rid of the unwanted mixed terms by introducing coeﬃcients γj that hopefully cancel
them:
5
It make sense to look for an operator which depends on time and space coordinates in a similar manner because time and space play
similar roles in quantum mechanics. As the ﬁrst time derivative is our requirement, the equation should contain also the ﬁrst derivatives
∂/∂x, ∂/∂y, ∂/∂z.
5.7. SUPPORTING INFORMATION 135
ˆO+
ψ = i
∂
∂t
γ0 + ic
∂
∂x
γ1 + ic
∂
∂y
γ2 + ic
∂
∂z
γ3 − m0c2
ψ
(5.75)
ˆO−
ψ = −i
∂
∂t
γ0 − ic
∂
∂x
γ1 − ic
∂
∂y
γ2 − ic
∂
∂z
γ3 − m0c2
ψ.
(5.76)
Then,
ˆO− ˆO+ψ = ˆO2ψ = γ2
0
2 ∂2
ψ
∂t2 +γ0γ1c 2 ∂ψ
∂t
∂ψ
∂x
+γ0γ2c 2 ∂ψ
∂t
∂ψ
∂y
+γ0γ3c 2 ∂ψ
∂t
∂ψ
∂z
−iγ0m0c2 ∂ψ
∂t
+γ1γ0c 2 ∂ψ
∂x
∂ψ
∂t
+γ2
1
2 ∂2
ψ
∂x2 +γ1γ2c 2 ∂ψ
∂x
∂ψ
∂y
+γ1γ3c 2 ∂ψ
∂x
∂ψ
∂z
−iγ1m0c3 ∂ψ
∂x
+γ2γ0c 2 ∂ψ
∂y
∂ψ
∂t
+γ2γ1c 2 ∂ψ
∂y
∂ψ
∂x
+γ2
2
2 ∂2
ψ
∂y2 +γ2γ3c 2 ∂ψ
∂y
∂ψ
∂z
−iγ2m0c3 ∂ψ
∂y
+γ3γ0c 2 ∂ψ
∂z
∂ψ
∂t
+γ3γ1c 2 ∂ψ
∂z
∂ψ
∂x
+γ3γ2c 2 ∂ψ
∂z
∂ψ
∂y
+γ2
3
2 ∂2
ψ
∂z2 −iγ3m0c3 ∂ψ
∂z
+iγ0m0c2 ∂ψ
∂t
+iγ1m0c3 ∂ψ
∂x
+iγ2m0c3 ∂ψ
∂y
+iγ3m0c3 ∂ψ
∂z
+(m0c2)2ψ.
(5.77)
Obviously, the green terms with −iγjm0c2 cancel each other, which removes eight unwanted terms. Can we also remove the remaining
dozen of unwanted mixed derivative terms? In order to do it, we need the following conditions to be fulﬁlled:
γ2
0 = 1, (5.78)
γ2
1 = −1 γ2
2 = −1 γ2
3 = −1 (5.79)
and
γjγk + γkγj = 0 for j = k. (5.80)
These conditions are clearly in conﬂict. The ﬁrst four condition require γj to be ±1 or ±i, but the last condition requires them to be
zero. There are no complex numbers that allow us to get the correct operator ˆO2. However, there are mathematical objects that can fulﬁl
the listed conditions simultaneously. Such objects are matrices.
Let us replace the coeﬃcients γj in Eqs. 5.75–5.75 by matrices6 ˆγj:
ˆO+
Ψ = i
∂
∂t
ˆγ0
+ ic
∂
∂x
ˆγ1
+ ic
∂
∂y
ˆγ2
+ ic
∂
∂z
ˆγ3
− m0c2ˆ1 Ψ = 0 (5.81)
ˆO−
Ψ = −i
∂
∂t
ˆγ0
− ic
∂
∂x
ˆγ1
− ic
∂
∂y
ˆγ2
− ic
∂
∂z
ˆγ3
− m0c2ˆ1 Ψ = 0. (5.82)
As ˆγj are matrices, the wave function must be a vector composed of several waves ψk. This is emphasized by changing ψ (representing
a monochromatic wave) to Ψ (representing a vector of monochromatic waves) in the equations.
We need a set of four matrices ˆγj with the following properties:
ˆγ0
· ˆγ0
= ˆ1, (5.83)
ˆγ1
· ˆγ1
= −ˆ1 ˆγ2
· ˆγ2
= −ˆ1 ˆγ3
· ˆγ3
= −ˆ1 (5.84)
and
ˆγj
· ˆγk
+ ˆγk
· ˆγj
= ˆ0 for j = k. (5.85)
In addition, there is a physical restriction. We know that the operator of energy (Hamiltonian) is
ˆH = i
∂
∂t
. (5.86)
We can get the Dirac Hamiltonian by multiplying Eq. 5.81 by ˆγ0 from left:
i
∂
∂t
ˆ1Ψ = −ic
∂
∂x
ˆγ0
· ˆγ1
− ic
∂
∂y
ˆγ0
· ˆγ2
− ic
∂
∂z
ˆγ0
· ˆγ3
+ m0c2
ˆγ0
Ψ. (5.87)
6
In relativistic quantum mechanics, these matrices can be treated as four components of a four-vector. There are two types of
four-vectors (contravariant and covariant) which transform diﬀerently. There is a convention to distinguish these two types by writing
components of covariant vectors with lower indices and components of contravariant vectors with upper indices. To keep this convention,
we label the gamma matrices with upper indices, do not confuse them with power!
136
Operator of any measurable quantity must be Hermitian ( ψ| ˆOψ = ˆOψ|ψ ) in order to give real values of the measured quantity (see
Section 4.3). Since the terms in the Hamiltonian are proportional to ˆγ0 or to ˆγ0 · ˆγj, all these matrices must be Hermitian (the elements
in the j-th row and k-th column must be equal to the complex conjugates of the elements in the k-th row and j-th column for each j and
k, see footnote 4 in Lecture 4.).
5.7.4 Finding the matrices
Our task is to ﬁnd Hermitian matrices fulﬁlling the criteria imposed by Eqs. 5.83–5.85. We have a certain liberty in choosing the matrices.
A matrix equation is nothing else than a set of equations. One of the matrices can be always chosen to be diagonal. Let us assume that
ˆγ0 is diagonal.7 How should the diagonal elements of ˆγ0 look like? In order to fulﬁll Eq. 5.83, the elements must be +1 or −1.
Another requirement follows from a general property of matrix multiplication: Trace (sum of the diagonal elements) of the matrix
product ˆA · ˆB is the same as that of ˆB · ˆA. Let us assume that ˆA = ˆγj and ˆB = ˆγ0 · ˆγj. Then,
Tr{ˆγj
· ˆγ0
· ˆγj
} = Tr{ˆγj
· ˆγj
· ˆγ0
}. (5.88)
But Eq. 5.85 tells us that ˆγ0 · ˆγj = −ˆγj · ˆγ0. Therefore, the left-hand side of Eq. 5.88 can be written as Tr{ˆγj · (−ˆγj) · ˆγ0}, resulting in
− Tr{ˆγj
· ˆγj
· ˆγ0
} = Tr{ˆγj
· ˆγj
· ˆγ0
}, (5.89)
and using Eq. 5.84
Tr{ˆγ0
} = −Tr{ˆγ0
}. (5.90)
It can be true only if the trace is equal to zero. Consequently, the diagonal of ˆγ0 must contain the same number of +1 and −1
elements. It also tells us that the dimension of the ˆγj matrices must be even. Can they be two-dimensional?
No, for the following reason. The four ˆγj matrices must be linearly independent, and it is impossible to ﬁnd four linearly independent
2 × 2 matrices so that all fulﬁll Eq. 5.85.8
Is it possible to ﬁnd four-dimensional ˆγj matrices? Yes. We start by choosing
ˆγ0
=




1 0 0 0
0 1 0 0
0 0 −1 0
0 0 0 −1



 (5.91)
(the diagonal must contain two +1 elements and two −1 elements, their order is arbitrary, but predetermines forms of the other
matrices).
Being diagonal, ˆγ0 is of course Hermitian. The ˆγ0 · ˆγj products




1 0 0 0
0 1 0 0
0 0 −1 0
0 0 0 −1



 ·





γj
1,1 γj
1,2 γj
1,3 γj
1,4
γj
2,1 γj
2,2 γj
2,3 γj
2,4
γj
3,1 γj
3,2 γj
3,3 γj
3,4
γj
4,1 γj
4,2 γj
4,3 γj
4,4





=





γj
1,1 γj
1,2 γj
1,3 γj
1,4
γj
2,1 γj
2,2 γj
2,3 γj
2,4
−γj
3,1 −γj
3,2 −γj
3,3 −γj
3,4
−γj
4,1 −γj
4,2 −γj
4,3 −γj
4,4





(5.92)
must be also Hermitian, i.e.,





γj
1,1 γj
1,2 γj
1,3 γj
1,4
γj
2,1 γj
2,2 γj
2,3 γj
2,4
−γj
3,1 −γj
3,2 −γj
3,3 −γj
3,4
−γj
4,1 −γj
4,2 −γj
4,3 −γj
4,4





=





(γj
1,1)∗ (γj
2,1)∗ −(γj
3,1)∗ −(γj
4,1)∗
(γj
1,2)∗ (γj
2,2)∗ −(γj
3,2)∗ −(γj
4,2)∗
(γj
1,3)∗ (γj
2,3)∗ −(γj
3,3)∗ −(γj
4,3)∗
(γj
1,4)∗ (γj
2,4)∗ −(γj
3,4)∗ −(γj
4,4)∗





. (5.93)
7
This is a good choice because it results in a diagonal matrix representing the Hamiltonian, which is convenient.
8
If the ˆγj matrices are linearly independent, they can be used as a basis. If they constitute a basis, there must exist a linear combination
of ˆγj giving any 2 × 2 matrix, e.g., the unit matrix ˆ1: ˆ1 = c0ˆγ0 + c1ˆγ1 + c2ˆγ2 + c3ˆγ3. Let us now multiply this equation by ˆγ0 from left
(and use Eq. 5.83)
ˆγ0
= c0ˆ1 + c1ˆγ0
· ˆγ1
+ c2ˆγ0
· ˆγ2
+ c3ˆγ0
· ˆγ3
,
then from right
ˆγ0
= c0ˆ1 + c1ˆγ1
· ˆγ0
+ c2ˆγ2
· ˆγ0
+ c3ˆγ3
· ˆγ0
,
and sum both equations. If the matrices fulﬁll Eq. 5.85, the result must be 2ˆγ0 = 2c0ˆ1, but this cannot be true because we need ˆγ0 with
a zero trace and the trace of the unit matrix ˆ1 is obviously not zero.
5.7. SUPPORTING INFORMATION 137
At the same time, Eq. 5.85 requires ˆγ0 · ˆγj = −ˆγj · ˆγ0





γj
1,1 γj
1,2 γj
1,3 γj
1,4
γj
2,1 γj
2,2 γj
2,3 γj
2,4
−γj
3,1 −γj
3,2 −γj
3,3 −γj
3,4
−γj
4,1 −γj
4,2 −γj
4,3 −γj
4,4





= −





γj
1,1 γj
1,2 γj
1,3 γj
1,4
γj
2,1 γj
2,2 γj
2,3 γj
2,4
γj
3,1 γj
3,2 γj
3,3 γj
3,4
γj
4,1 γj
4,2 γj
4,3 γj
4,4





·




1 0 0 0
0 1 0 0
0 0 −1 0
0 0 0 −1



 =





−γj
1,1 −γj
1,2 γj
1,3 γj
1,4
−γj
2,1 −γj
2,2 γj
2,3 γj
2,4
−γj
3,1 −γj
3,2 γj
3,3 γj
3,4
−γj
4,1 −γj
4,2 γj
4,3 γj
4,4





, (5.94)
which is possible only if the red elements are equal to zero. Eq. 5.93 shows that the blue elements form two adjoint 2 × 2 matrices for
each j > 0:
ˆγj
=





0 0 γj
1,3 γj
1,4
0 0 γj
2,3 γj
2,4
γj
3,1 γj
3,2 0 0
γj
4,1 γj
4,2 0 0





=





0 0 γj
1,3 γj
1,4
0 0 γj
2,3 γj
2,4
−(γj
1,3)∗ −(γj
2,3)∗ 0 0
−(γj
1,4)∗ −(γj
2,4)∗ 0 0





=
ˆ0 ˆσj
−(ˆσj)† ˆ0
. (5.95)
Now we use Eqs. 5.84 and 5.85 to ﬁnd the actual forms of three ˆσj (and consequently ˆγj) matrices for j > 0.
Eq. 5.84 requires
ˆ0 ˆσj
−(ˆσj)† ˆ0
·
ˆ0 ˆσj
−(ˆσj)† ˆ0
=
−ˆσj · (ˆσj)† ˆ0
ˆ0 −(ˆσj)† · ˆσj = −
ˆ1 ˆ0
ˆ0 ˆ1
=
−ˆ1 ˆ0
ˆ0 −ˆ1
, (5.96)
i.e.,
ˆσj
· (ˆσj
)†
= (ˆσj
)†
· ˆσj
= ˆ1 (5.97)
Eq. 5.97 is obviously true if the ˆσj matrices are Hermitian (ˆσj = (ˆσj)†), i.e. σj
m,n = (σj
n,m)∗. It implies that the ˆσj matrices have
the following form:
ˆσj
=
aj cj
c∗
j bj
, (5.98)
where aj and bj are real, and cj is complex. Eq. 5.97 can be then written as
ˆσj
· (ˆσj
)†
= ˆσj
· ˆσj
=
aj cj
c∗
j bj
·
aj cj
c∗
j bj
=
a2
j + |cj|2 (aj + bj)cj
(aj + bj)c∗
j b2
j + |cj|2 =
1 0
0 1
. (5.99)
The oﬀ-diagonal terms of the product matrix must be equal to zero, which is true if aj = −bj or |cj| = 0. In the former case, matrices
ˆσj can be written as
ˆσj
=
1 − |cj|2 cj
c∗
j − 1 − |cj|2 , (5.100)
in the latter case, there are only two possibilities how to construct the ˆσj matrix:
ˆσj
=
1 0
0 1
or ˆσj
=
1 0
0 −1
(5.101)
(note that |cj|2 = 0 ⇒ a2
j = b2
j = 1.) Eq. 5.85 shows that the second option is correct. Eq. 5.85 requires
ˆ0 ˆσj
−(ˆσj)† ˆ0
·
ˆ0 ˆσk
−(ˆσk)† ˆ0
+
ˆ0 ˆσk
−(ˆσk)† ˆ0
·
ˆ0 ˆσj
−(ˆσj)† ˆ0
= −
ˆσj · (ˆσk)† + ˆσk · (ˆσj)† ˆ0
ˆ0 (ˆσj)† · ˆσk + (ˆσk)† · ˆσj =
ˆ0 ˆ0
ˆ0 ˆ0
, (5.102)
therefore no ˆσj can be a unit matrix.
As Eq. 5.101 unambiguously deﬁnes one sigma matrix (let us call it ˆσ3), the other two (ˆσ1 and ˆσ2) are given by Eq. 5.100. According
to Eq. 5.102, ˆσj · (ˆσk)† + ˆσk · (ˆσj)† = ˆσj ˆσk + ˆσk ˆσj = ˆ0 and consequently,
1 0
0 −1
·
1 − |cj|2 cj
c∗
j − 1 − |cj|2 +
1 − |cj|2 cj
c∗
j − 1 − |cj|2 ·
1 0
0 −1
=
2 1 − |cj|2 0
0 −2 1 − |cj|2
=
0 0
0 0
, (5.103)
showing that |cj|2 = 1 and the diagonal elements of ˆσ1 and ˆσ2 are equal to zero. Therefore, these equations can be written as
ˆσ1
=
0 eiφ1
e−iφ1 0
ˆσ2
=
0 eiφ2
e−iφ2 0
(5.104)
138
According to Eq. 5.102, ˆσj ˆσk + ˆσk ˆσj = ˆ0 and therefore
0 eiφ1
e−iφ1 0
·
0 eiφ2
e−iφ2 0
+
0 eiφ2
e−iφ2 0
·
0 eiφ1
e−iφ1 0
=
0 ei(φ1−φ2) + e−i(φ1−φ2)
e−i(φ1−φ2) + ei(φ1−φ2) 0
=
0 2 cos (φ1 − φ2)
2 cos (φ1 − φ2) 0
=
0 0
0 0
. (5.105)
The oﬀ-diagonal elements of the sum of the matrix products are equal to zero if the phases diﬀer by π/2. Choosing φ1 = 0, the set of
three sigma matrices is
ˆσ1
=
0 1
1 0
ˆσ2
=
0 −i
i 0
ˆσ3
=
1 0
0 −1
(5.106)
and the set of the four gamma matrices is
ˆγ0
=




1 0 0 0
0 1 0 0
0 0 −1 0
0 0 0 −1



 ˆγ1
=




0 0 0 1
0 0 1 0
0 −1 0 0
−1 0 0 0



 ˆγ2
=




0 0 0 −i
0 0 i 0
0 i 0 0
−i 0 0 0



 ˆγ3
=




0 0 1 0
0 0 0 −1
−1 0 0 0
0 1 0 0



 . (5.107)
With the help of the ˆγj matrices, we can modify our deﬁnition of ˆO+ and ˆO− to get the correct operator ˆO2:



i
∂
∂t




1 0 0 0
0 1 0 0
0 0 −1 0
0 0 0 −1



 + ic
∂
∂z




0 0 1 0
0 0 0 −1
−1 0 0 0
0 1 0 0



 + ic
∂
∂x




0 0 0 1
0 0 1 0
0 −1 0 0
−1 0 0 0



 + ic
∂
∂y




0 0 0 −i
0 0 i 0
0 i 0 0
−i 0 0 0




−m0c2




1 0 0 0
0 1 0 0
0 0 1 0
0 0 0 1












ψ1
ψ2
ψ3
ψ4



 = ˆO+
Ψ = 0,
(5.108)



−i
∂
∂t




1 0 0 0
0 1 0 0
0 0 −1 0
0 0 0 −1



 − ic
∂
∂z




0 0 1 0
0 0 0 −1
−1 0 0 0
0 1 0 0



 − ic
∂
∂x




0 0 0 1
0 0 1 0
0 −1 0 0
−1 0 0 0



 − ic
∂
∂y




0 0 0 −i
0 0 i 0
0 i 0 0
−i 0 0 0




−m0c2




1 0 0 0
0 1 0 0
0 0 1 0
0 0 0 1












ψ1
ψ2
ψ3
ψ4



 = ˆO−
Ψ = 0.
(5.109)
5.7.5 Solution of the Dirac equation
Introducing matrices means that we do not have a single equation of motion, but a set of four equations for four coupled wave functions.
The complete wave function Ψ is therefore a vector consisting of four components. The operators ˆO+ and ˆO− consist of partial derivative
operators summarized in Eq. 5.61, and Eq. 5.61 also shows that a monochromatic plane wave ψ = e
i (pxx+pyy+pzz−Ett)
is an eigenfunction
of the partial derivative operators, with the eigenvalues equal to Et, px, py, pz. The wave ψ is also an eigenfunction of the operator ˆO2
described by Eq. 5.62
ˆO2
ψ = 2 ∂2
∂t2
− c2 2 ∂2
∂z2
− c2 2 ∂2
∂x2
− c2 2 ∂2
∂y2
+ (m0c2
)2
e
i (pxx+pyy+pzz−Est)
= (−E2
s + c2
p2
+ (m0c2
)2
)ψ, (5.110)
providing the expected eigenvalue −E2
s + c2p2 + (m0c2)2. The requirement of the special theory of relativity
− E2
s + c2
p2
+ (m0c2
)2
= 0 ⇒ E2
s = c2
p2
+ (m0c2
)2
(5.111)
allows for two possible values of energy Es, one with a positive sign and another one with a negative sign
Es = ± c2p2 + (m0c2)2) = ±Et. (5.112)
5.7. SUPPORTING INFORMATION 139
We have labeled the energy Es in order to reserve the symbol Et for always positive c2p2 + (m0c2)2).
We check if a vector consisting of functions ψ multiplied by diﬀerent coeﬃcients wj (i.e., ψj = wjψ) is a solution of the Dirac equation



i
∂
∂t




1 0 0 0
0 1 0 0
0 0 −1 0
0 0 0 −1



 + ic
∂
∂z




0 0 1 0
0 0 0 −1
−1 0 0 0
0 1 0 0



 + ic
∂
∂x




0 0 0 1
0 0 1 0
0 −1 0 0
−1 0 0 0



 + ic
∂
∂y




0 0 0 −i
0 0 i 0
0 i 0 0
−i 0 0 0




−m0c2




1 0 0 0
0 1 0 0
0 0 1 0
0 0 0 1












w1ψ
w2ψ
w3ψ
w4ψ



 = ˆO+
Ψ = 0,
(5.113)



−i
∂
∂t




1 0 0 0
0 1 0 0
0 0 −1 0
0 0 0 −1



 − ic
∂
∂z




0 0 1 0
0 0 0 −1
−1 0 0 0
0 1 0 0



 − ic
∂
∂x




0 0 0 1
0 0 1 0
0 −1 0 0
−1 0 0 0



 − ic
∂
∂y




0 0 0 −i
0 0 i 0
0 i 0 0
−i 0 0 0




−m0c2




1 0 0 0
0 1 0 0
0 0 1 0
0 0 0 1












w1ψ
w2ψ
w3ψ
w4ψ



 = ˆO−
Ψ = 0,
(5.114)
or shortly
i
∂
∂t
ˆγ0
+ ic
∂
∂x
ˆγ1
+ ic
∂
∂y
ˆγ2
+ ic
∂
∂z
ˆγ3
− m0c2ˆ1




w1ψ
w2ψ
w3ψ
w4ψ



 = ˆO+
Ψ = 0
(5.115)
−i
∂
∂t
ˆγ0
− ic
∂
∂x
ˆγ1
− ic
∂
∂y
ˆγ2
− ic
∂
∂z
ˆγ3
− m0c2ˆ1




w1ψ
w2ψ
w3ψ
w4ψ



 = ˆO−
Ψ = 0.
(5.116)
For our wavefunction,
ˆO+
Ψ =




(Es − m0c2)w1 +0 −cpzw3 −c(px − ipy)w4
0 +(Es − m0c2)w2 −c(px + ipy)w3 +cpzw4
cpzw1 +c(px − ipy)w2 −(Es + m0c2)w3 +0
c(px + ipy)w1 −cpzw2 +0 −(Es + m0c2)w4



 ψ = 0. (5.117)
The values of wj are partially restrained by Eq. 5.117, but partially depend on the choice of the coordinate frame. One set of solution
is such that one of the coeﬃcients wj is set to zero and another one to a normalization constant N in each solution. The normalization
constant can be determined as discussed in Section 4.9.5 by requiring
∞ˆ
−∞
∞ˆ
−∞
∞ˆ
−∞
w∗
1ψ∗ w∗
2ψ∗ w∗
3ψ∗ w∗
4ψ∗




w1ψ
w2ψ
w3ψ
w4ψ



 dxdydz = (w1w∗
1 + w2w∗
2 + w3w∗
3 + w4w∗
4)
∞ˆ
−∞
∞ˆ
−∞
∞ˆ
−∞
ψψ∗
dxdydz
h3
= 1, (5.118)
where we deﬁne the probability as a ”square” of the vector Ψ.
Let us ﬁnd the ﬁrst solution Ψ1 by setting w1,1 = N and w2,1 = 0. Eq. 5.117 then corresponds to a set of four equations
(Es − m0c2)N +0 −cpzw3,1 −c(px − ipy)w4,1 = 0,
0 +0 −c(px + ipy)w3,1 +cpzw4,1 = 0,
cpzN +0 −(Es + m0c2)w3,1 +0 = 0,
c(px + ipy)N −0 +0 −(Es + m0c2)w4,1 = 0.
(5.119)
140
The third and fourth equation immediately provides the value of w3,1 and w4,1, respectively:
w3,1 = N
cpz
Es + m0c2
w4,1 = N
c(px + ipy)
Es + m0c2
. (5.120)
The solution is
Ψ1 = N





ψ
0
cpz
Es+m0c2 ψ
c(px+ipy)
Es+m0c2 ψ





(5.121)
and the normalization condition is
N2
1 +
c2p2
z
(Es + m0c2)2
+
c2(p2
x + p2
y)
(Es + m0c2)2
∞ˆ
−∞
∞ˆ
−∞
∞ˆ
−∞
ψ∗
ψdxdydz = N2
1 +
c2p2
z
(Es + m0c2)2
+
c2(p2
x + p2
y)
(Es + m0c2)2
h3
= N2 Es + m0c2 2
+ c2p2
(Es + m0c2)2
h3
= N2 E2
s + 2Esm0c2 + m2
0c4 + c2p2
(Es + m0c2)2
h3
= N2 2E2
s + 2Esm0c2
(Es + m0c2)2
h3
= N2 2Es
Es + m0c2
h3
= 1.
(5.122)
Therefore,
N =
Et + m0c2
2Eth3
(5.123)
and
Ψ1 =
Et + m0c2
2Eth3





1
0
cpz
Et+m0c2
c(px+ipy)
Et+m0c2





e
i (pxx+pyy+pzz−Ett)
. (5.124)
Here, the energy Es is a positive number Es = Et = + p2c2 + m2
0c4 (otherwise N → ∞ for slow motions, p → 0).
The second solution Ψ2 is obtained by setting w2,2 = N and w1,2 = 0
0 +0 −cpzw3,2 −c(px − ipy)w4,2 = 0,
0 +(Es − m0c2)N −c(px + ipy)w3,2 +cpzw4,2 = 0,
0 +c(px − ipy)N −(Es + m0c2)w3,2 +0 = 0,
0 −cpzN +0 −(Es + m0c2)w4,2 = 0,
(5.125)
w3,2 = N
c(px + ipy)
Es + m0c2
w4,2 = N
−cpz
Es + m0c2
, (5.126)
giving the same normalization condition. Therefore,
Ψ2 =
Et + m0c2
2Eth3





0
1
c(px+ipy)
Et+m0c2
−cpz
Et+m0c2





e
i (pxx+pyy+pzz−Ett)
(5.127)
with Es = Et = + p2c2 + m2
0c4.
The third solution Ψ3 is obtained by setting w3,3 = N and w4,3 = 0
(Es − m0c2)w1,3 +0 −cpzN +0 = 0,
0 +(Es − m0c2)w2,3 −c(px + ipy)N +0 = 0,
cpzw1,3 +c(px − ipy)w2,3 −(Es + m0c2)N +0 = 0,
c(px + ipy)w1,3 −cpzw2,3 +0 +0 = 0.
(5.128)
The values of w1,3 and w2,3 are calculated from the ﬁrst two equations. Keeping the same sign of the mass term,
w1,3 = N
−c(px + ipy)
−Es + m0c2
w2,3 = N
−cpz
−Es + m0c2
, (5.129)
Now the normalization condition is
5.7. SUPPORTING INFORMATION 141
N2
1 +
c2p2
z
(−Es + m0c2)2
+
c2(p2
x + p2
y)
(−Es + m0c2)2
∞ˆ
−∞
∞ˆ
−∞
∞ˆ
−∞
ψ∗
ψdxdydz = N2
1 +
c2p2
z
(−Es + m0c2)2
+
c2(p2
x + p2
y)
(−Es + m0c2)2
h3
= N2 −Es + m0c2 2
+ c2p2
(−Es + m0c2)2
h3
= N2 E2
s − 2Esm0c2 + m2
0c4 + c2p2
(−Es + m0c2)2
h3
= N2 2E2
s − 2Esm0c2
(−Es + m0c2)2
h3
= N2 −2Es
−Es + m0c2
h3
= 1.
(5.130)
Therefore,
N =
−Es + m0c2
−2Esh3
(5.131)
and
Ψ3 =
−Es + m0c2
−2Esh3





−cpz
−Es+m0c2
−c(px+ipy)
−Es+m0c2
1
0





e
i (pxx+pyy+pzz−Est)
=
Et + m0c2
2Eth3





−cpz
Et+m0c2
−c(px+ipy)
Et+m0c2
1
0





e
i (pxx+pyy+pzz+Ett)
. (5.132)
Here, the energy Es is a negative number Es = −Et = − p2c2 + m2
0c4 (otherwise N → ∞ for slow motions, p → 0). The fourth
solution Ψ4 is obtained by setting w4,4 = N and w3,4 = 0
(Es − m0c2)w1,4 +0 +0 −c(px − ipy)N = 0,
0 +(Es − m0c2)w2,4 +0 +cpzN = 0,
cpzw1,4 +c(px − ipy)w2,4 +0 +0 = 0,
c(px + ipy)w1,4 −cpzw2,4 +0 −(Es + m0c2)N = 0
(5.133)
w1,4 = N
−c(px + ipy)
−Es + m0c2
w2,4 = N
cpz
−Es + m0c2
, (5.134)
with the same normalization condition as the third solution. Therefore,
Ψ4 =
−Es + m0c2
−2Esh3





−c(px+ipy)
−Es+m0c2
cpz
−Es+m0c2
0
1





e
i (pxx+pyy+pzz−Est)
(5.135)
with Es = −Et = − p2c2 + m2
0c4.
The negative energy has some strange implications. For example, the plane wave ψ should propagate with the rate c. The value of c
is given by c = λ/T = Et/|p| and the direction should be given by the momentum vector p
c =
Es
|p|
p
|p|
=
Esp
p2
. (5.136)
Changing the sign of Es also changes direction of motion. However, note that opposite directions of the vectors of momentum give
the same p2:
p2
= p · p p2
= −p · (−p). (5.137)
We can therefore use −p in the wavefunction with the negative energy Es = −Et. Such wavefunction is complex conjugate of ψ
describing the ﬁrst two states
e
i (−pxx−pyy−pzz−Est)
= e
i (−pxx−pyy−pzz+Ett)
= e− i (pxx+pyy+pzz−Ett)
= ψ∗
. (5.138)
and it is also an eigenfunction of ˆO2
ˆO2
ψ∗
= 2 ∂2
∂t2
− c2 2 ∂2
∂z2
− c2 2 ∂2
∂x2
− c2 2 ∂2
∂y2
+ (m0c2
)2
e− i (pxx+pyy+pzz−Ett)
= (−E2
t + c2
p2
+ (m0c2
)2
)ψ∗
. (5.139)
142
When we change the signs of px, py, and pz in the coeﬃcients wj,3 and wj,4, we can express the solutions of the Dirac equation in a
general form
Ψ =




u1ψ
u2ψ
v1ψ∗
v2ψ∗



 , (5.140)
where
Ψ1 = N





ψ
0
cpz
Et+m0c2 ψ∗
c(px+ipy)
Et+m0c2 ψ∗





, Ψ2 = N





0
ψ
c(px−ipy)
Et+m0c2 ψ∗
−cpz
Et+m0c2 ψ∗





, Ψ3 = N





cpz
Et+m0c2 ψ
c(px+ipy)
Et+m0c2 ψ
ψ∗
0





, Ψ4 = N





c(px−ipy)
Et+m0c2 ψ
−cpz
Et+m0c2 ψ
0
ψ∗





, (5.141)
N =
Et + m0c2
2Eth3
, (5.142)
and
ψ = e
i (pxx+pyy+pzz−Ett)
. (5.143)
5.7.6 Relation between Dirac and Schr¨odinger equations
How is the Dirac equation related to the Schr¨odinger equation? We came to the Schr¨odinger equation using the relation Ekin = p2/2m
(energy of a free particle, i.e., kinetic energy). Let us now try to relate the kinetic energy Ekin to the total energy Et of the relativistic
treatment. We know that mass contributes to Et as m0c2. We can assume that in the absence of an electromagnetic ﬁeld, the total energy
of a free particle consists of the mass contribution m0c2 (rest energy) and of Ekin. Therefore, the square of the total energy should be
E2
t = (m0c2
+ Ekin)2
= (m0c2
)2
+ 2Ekin(m0c2
) + E2
kin. (5.144)
As a next step, we divide the square of the total energy by the square of the mass contribution
E2
t
(m0c2)2
=
(m0c2)2 + 2Ekin(m0c2) + E2
kin
(m0c2)2
= 1 + 2
Ekin
m0c2
+
E2
kin
(m0c2)2
. (5.145)
If the speed of the particle is low, E2
kin (m0c2)2, and the last term can be neglected
E2
t
(m0c2)2
≈ 1 + 2
Ekin
m0c2
. (5.146)
But Eq. 5.59 also tells us that
E2
t = (m0c2
)2
+ c2
p2
⇒
E2
t
(m0c2)2
= 1 +
c2p2
(m0c2)2
. (5.147)
Comparison of the right-hand sides of Eqs. 5.146 and 5.147 shows that
2
Ekin
m0c2
=
c2p2
(m0c2)2
⇒ Ekin =
p2
2m0
. (5.148)
We see that the approximation for low speeds led us to the relation deﬁning the kinetic energy as in the Schr¨odinger equation.
Therefore, Schr¨odinger equation can be viewed as a low-speed limit of the Dirac equation. We use similar arguments more rigorously in
Section 5.7.7 when we relate Dirac and Schr¨odinger Hamiltonian in the presence of the electromagnetic ﬁeld.
5.7. SUPPORTING INFORMATION 143
5.7.7 Hamiltonian of spin magnetic moment
Our next goal is to ﬁnd Hamiltonian for a relativistic charged particle in a magnetic ﬁeld. When we compare the classical Hamiltonian of
a particle in an electromagnetic ﬁeld (Eq. 4.74) with the classical Hamiltonian of a free particle H = (p)2/(2m) outside the ﬁeld, we see
that the presence of an electromagnetic ﬁeld requires the following modiﬁcations:
H → H − QV p → p − QA, (5.149)
Accordingly, the operators of energy and momentum in the quantum description change to
i
∂
∂t
→ i
∂
∂t
− QV − i
∂
∂x
→ −i
∂
∂x
− QAx − i
∂
∂y
→ −i
∂
∂y
− QAy − i
∂
∂z
→ −i
∂
∂z
− QAz. (5.150)
This modiﬁes Eq. 5.87 to
i
∂
∂t
− QV ˆ1Ψ = −c i
∂
∂x
+ QAx ˆγ0
ˆγ1
− c i
∂
∂y
+ QAy ˆγ0
ˆγ2
− c i
∂
∂z
+ QAz ˆγ0
ˆγ3
+ m0c2
ˆγ0
Ψ, (5.151)
where ˆ1 is a 4 × 4 unit matrix. In order to obtain an expression comparable to Eq. 4.26 (nonrelativistic Schr¨odinger equation), we
apply the operator (i ∂/∂t − QV ) twice
i
∂
∂t
− QV i
∂
∂t
− QV ˆ1Ψ = i
∂
∂t
− QV
2
Ψ
= c2
i
∂
∂x
+ QAx
2
ˆγ0
ˆγ1
ˆγ0
ˆγ1
+ c2
i
∂
∂y
+ QAy
2
ˆγ0
ˆγ2
ˆγ0
ˆγ2
+ c2
i
∂
∂z
+ QAz
2
ˆγ0
ˆγ3
ˆγ0
ˆγ3
+ m2
0c4
ˆγ0
ˆγ0
Ψ
−m0c3
i
∂
∂x
+ QAx ˆγ0
ˆγ1
ˆγ0
+ i
∂
∂y
+ QAy ˆγ0
ˆγ2
ˆγ0
+ i
∂
∂z
+ QAz ˆγ0
ˆγ3
ˆγ0
Ψ
−m0c3
i
∂
∂x
+ QAx ˆγ0
ˆγ0
ˆγ1
+ i
∂
∂y
+ QAy ˆγ0
ˆγ0
ˆγ2
+ i
∂
∂z
+ QAz ˆγ0
ˆγ0
ˆγ3
Ψ
+c2
i
∂
∂x
+ QAx i
∂
∂y
+ QAy ˆγ0
ˆγ1
ˆγ0
ˆγ2
+ i
∂
∂y
+ QAy i
∂
∂x
+ QAx ˆγ0
ˆγ2
ˆγ0
ˆγ1
Ψ
+c2
i
∂
∂y
+ QAy i
∂
∂z
+ QAz ˆγ0
ˆγ2
ˆγ0
ˆγ3
+ i
∂
∂z
+ QAz i
∂
∂y
+ QAy ˆγ0
ˆγ3
ˆγ0
ˆγ2
Ψ
+c2
i
∂
∂z
+ QAz i
∂
∂x
+ QAx ˆγ0
ˆγ3
ˆγ0
ˆγ1
+ i
∂
∂x
+ QAx i
∂
∂z
+ QAz ˆγ0
ˆγ1
ˆγ0
ˆγ3
Ψ. (5.152)
We use the properties of the gamma matrices (Eqs. 5.83–5.85) to simplify the equation. In particular, we invert of the order of matrices
in the products
ˆγ0
ˆγj
ˆγ0
= −(ˆγ0
ˆγ0
)ˆγj
= −ˆγj
, (5.153)
ˆγ0
ˆγj
ˆγ0
ˆγj
= −(ˆγ0
ˆγ0
)(ˆγj
ˆγj
) = −(ˆ1)(−ˆ1) = ˆ1, (5.154)
ˆγ0
ˆγj
ˆγ0
ˆγk
= −(ˆγ0
ˆγ0
)(ˆγj
ˆγk
) = −(ˆ1)(ˆγj
ˆγk
) = −ˆγj
ˆγk
= ˆγk
ˆγj
(5.155)
and obtain
i
∂
∂t
− QV
2
ˆ1Ψ = c2
i
∂
∂x
+ QAx
2
ˆ1 + c2
i
∂
∂y
+ QAy
2
ˆ1 + c2
i
∂
∂z
+ QAz
2
ˆ1 + m2
0c4ˆ1 Ψ
+m0c3
i
∂
∂x
+ QAx ˆγ1
+ i
∂
∂y
+ QAy ˆγ2
+ i
∂
∂z
+ QAz ˆγ3
Ψ
−m0c3
i
∂
∂x
+ QAx ˆγ1
+ i
∂
∂y
+ QAy ˆγ2
+ i
∂
∂z
+ QAz ˆγ3
Ψ
−c2
i
∂
∂x
+ QAx i
∂
∂y
+ QAy − i
∂
∂y
+ QAy i
∂
∂x
+ QAx ˆγ1
ˆγ2
Ψ
−c2
i
∂
∂y
+ QAy i
∂
∂z
+ QAz − i
∂
∂z
+ QAz i
∂
∂y
+ QAy ˆγ2
ˆγ3
Ψ
−c2
i
∂
∂z
+ QAz i
∂
∂x
+ QAx − i
∂
∂x
+ QAx i
∂
∂z
+ QAz ˆγ3
ˆγ1
Ψ,
(5.156)
144
where the second line and the third line cancel each other. To proceed, we need to evaluate the products of operators on the last three
lines. Let us look at one of the lines more closely
− c2
i
∂
∂x
+ QAx i
∂
∂y
+ QAy − i
∂
∂y
+ QAy i
∂
∂x
+ QAx ˆγ1
ˆγ2
Ψ (5.157)
and analyze the operator part (green) and the wave function part (blue) separately. We start by the green operator (to emphasize
that we work with the operator, we apply it to some arbitrary function, labeled ψ). The green operator is composed of linear operators,
we have to apply them twice (we must be very careful with diﬀerentiation)
i
∂
∂x
+ QAx i
∂
∂y
+ QAy − i
∂
∂y
+ QAy i
∂
∂x
+ QAx ψ =
− 2 ∂
∂x
∂ψ
∂y
−
∂
∂y
∂ψ
∂x
+ Q2
(AxAy − AyAx)ψ + i Q
∂(Ayψ)
∂x
+ Ax
∂ψ
∂y
−
∂(Axψ)
∂y
− Ay
∂ψ
∂x
. (5.158)
The ﬁrst two terms on the second line cancel each other because ∂2ψ/∂x∂y = ∂2ψ/∂y∂x and AxAy = AyAx (Ax, Ay are numbers,
not operators). Then we apply the chain rule to calculate the partial derivatives of Axψ and Ayψ:
− 2 ∂
∂x
∂ψ
∂y
−
∂
∂y
∂ψ
∂x
+ Q2
(AxAy − AyAx)ψ + i Q
∂(Ayψ)
∂x
+ Ax
∂ψ
∂y
−
∂(Axψ)
∂y
− Ay
∂ψ
∂x
=
i Q
∂(Ayψ)
∂x
+ Ax
∂ψ
∂y
−
∂(Axψ)
∂y
− Ay
∂ψ
∂x
= i Q
∂Ay
∂x
ψ + Ay
∂ψ
∂x
+ Ax
∂ψ
∂y
−
∂Ax
∂y
ψ − Ax
∂ψ
∂y
− Ay
∂ψ
∂x
= i Q
∂Ay
∂x
−
∂Ax
∂y
ψ.
(5.159)
Note that the resulting diﬀerence of partial derivatives in the parentheses is nothing else but the z component of the rotation (formally
a vector product) of the deﬁnition of B in Eq. 4.58. Therefore, we can write
− 2 ∂
∂x
∂ψ
∂y
−
∂
∂y
∂ψ
∂x
+ Q2
(AxAy − AyAx)ψ + i Q
∂(Ayψ)
∂x
+ Ax
∂ψ
∂y
−
∂(Axψ)
∂y
− Ay
∂ψ
∂x
= i Q
∂Ay
∂x
−
∂Ax
∂y
ψ = i QBzψ.
(5.160)
The combinations on the last two lines of Eq. 5.156 are obtained in the same manner.
In addition to the combinations of the operators evaluated above, the last three lines of Eq. 5.156 also contain the products ˆγ1ˆγ2,
ˆγ2ˆγ3, and ˆγ3ˆγ1. They can be calculated from Eq. 5.95
ˆγ1
ˆγ2
=
ˆ0 ˆσ1
−ˆσ1 ˆ0
ˆ0 ˆσ2
−ˆσ2 ˆ0
= −
ˆσ1 ˆσ2 ˆ0
ˆ0 ˆσ1 ˆσ2 = −i
ˆσ3 ˆ0
ˆ0 ˆσ3 , (5.161)
ˆγ2
ˆγ3
=
ˆ0 ˆσ2
−ˆσ2 ˆ0
ˆ0 ˆσ3
−ˆσ3 ˆ0
= −
ˆσ2 ˆσ3 ˆ0
ˆ0 ˆσ2 ˆσ3 = −i
ˆσ1 ˆ0
ˆ0 ˆσ1 , (5.162)
ˆγ3
ˆγ1
=
ˆ0 ˆσ3
−ˆσ3 ˆ0
ˆ0 ˆσ1
−ˆσ1 ˆ0
= −
ˆσ3 ˆσ1 ˆ0
ˆ0 ˆσ3 ˆσ1 = −i
ˆσ2 ˆ0
ˆ0 ˆσ2 , (5.163)
where the following important properties of the ˆσj matrices were used in the lasts steps:
ˆσ1
ˆσ2
=
0 1
1 0
0 −i
i 0
=
i 0
0 −i
= iˆσ3
(5.164)
ˆσ2
ˆσ3
=
0 −i
i 0
1 0
0 −1
=
0 i
i 0
= iˆσ1
(5.165)
ˆσ3
ˆσ1
=
1 0
0 −1
0 1
1 0
=
0 1
−1 0
= iˆσ2
. (5.166)
Note that we have written the 4 × 4 matrices ˆγj ˆγk in a block-diagonal form, using 2 × 2 matrices ˆσl and ˆ0.
After inserting everything into Eq. 5.156, we get
i
∂
∂t
− QV
2 ˆ1 ˆ0
ˆ0 ˆ1
Ψ = c2
i
∂
∂x
+ QAx
2
+ c2
i
∂
∂y
+ QAy
2
+ c2
i
∂
∂z
+ QAz
2
+ m2
0c4
ˆ1 ˆ0
ˆ0 ˆ1
Ψ
− c2
Q Bx
ˆσ1 ˆ0
ˆ0 ˆσ1 + By
ˆσ2 ˆ0
ˆ0 ˆσ2 + Bz
ˆσ3 ˆ0
ˆ0 ˆσ3 Ψ. (5.167)
5.7. SUPPORTING INFORMATION 145
To emphasize the block-diagonal form of the equation, we use 2 × 2 matrices ˆ1 (unit matrix) and ˆ0 (zero matrix) to write the 4 × 4
unit matrices on the ﬁrst line (note that the same symbol ˆ1 represents a 4 × 4 matrix above and a 2 × 2 matrix here and below).
Now we have a relativistic equation describing our particle in an electromagnetic ﬁeld. Let us now separate the mass contribution to
the energy from the operator i ∂/∂t and let us call the diﬀerence ˆH (it becomes clear soon why we choose the same symbol as the symbol
used for the Hamiltonian in the Schr¨odinger equation):
ˆH = i
∂
∂t
− m0c2
, (5.168)
Eq. 5.167 can be rewritten as
ˆH + m0c2
− QV
2 ˆ1 ˆ0
ˆ0 ˆ1
Ψ = ( ˆH − QV )2
+ 2m0c2
( ˆH − QV ) + m2
0c4
ˆ1 ˆ0
ˆ0 ˆ1
Ψ =
c2
i
∂
∂x
+ QAx
2
+ c2
i
∂
∂y
+ QAy
2
+ c2
i
∂
∂z
+ QAz
2
+ m2
0c4
ˆ1 ˆ0
ˆ0 ˆ1
Ψ
− c2
Q Bx
ˆσ1 ˆ0
ˆ0 ˆσ1 + By
ˆσ2 ˆ0
ˆ0 ˆσ2 + Bz
ˆσ3 ˆ0
ˆ0 ˆσ3 Ψ, (5.169)
where the two red terms m2
0c4 cancel each other. Dividing both sides of the equation by 2m0c2 gives
( ˆH − QV )2
2m0c2
+ ˆH − QV
ˆ1 ˆ0
ˆ0 ˆ1
Ψ =
1
2m0
i
∂
∂x
+ QAx
2
+ i
∂
∂y
+ QAy
2
+ i
∂
∂z
+ QAz
2 ˆ1 ˆ0
ˆ0 ˆ1
Ψ
−
Q
2m0
Bx
ˆσ1 ˆ0
ˆ0 ˆσ1 + By
ˆσ2 ˆ0
ˆ0 ˆσ2 + Bz
ˆσ3 ˆ0
ˆ0 ˆσ3 Ψ. (5.170)
Note that the rest energy of particles m0c2 is huge. Unless the eigenvalue of ˆH is very large (which is not expected in a standard NMR
experiment), the ﬁrst term with m0c2 in the denominator can be safely neglected. For the same reason, the factors ±cpz/(Et + m0c2) and
c(px ± ipy)/(Et + m0c2) in Eq. 5.141 are close to zero for v c.
The derived matrix equation represent a set of four equations for four unknowns. The block-diagonal form of all matrices reveals that
the ﬁrst two equations and the last two equations can be solved separately. Therefore, we obtain identical sets of two equations describing
particles and antiparticles:
ˆH
u1ψ
u2ψ
≈
1
2m0
i
∂
∂x
+ QAx
2
+ i
∂
∂y
+ QAy
2
+ i
∂
∂z
+ QAz
2
+ QV ˆ1 −
Q
2m0
Bx ˆσ1
+ By ˆσ2
+ Bz ˆσ3 u1ψ
u2ψ
,
(5.171)
ˆH
v1ψ∗
v2ψ∗ ≈
1
2m0
i
∂
∂x
+ QAx
2
+ i
∂
∂y
+ QAy
2
+ i
∂
∂z
+ QAz
2
+ QV ˆ1 −
Q
2m0
Bx ˆσ1
+ By ˆσ2
+ Bz ˆσ3 v1ψ∗
v2ψ∗ ,
(5.172)
where we described the wave functions using the notation introduced in Eq. 5.140. In both matrix equations, the terms multiplied
by ˆ1 constitute the Hamiltonian of the non-relativistic Schr¨odinger equation (Eq. 4.26), and the terms with the ˆσj matrices appear only in
our relativistic equations.
5.7.8 Spin magnetogyric ratio
The value of the magnetogyric ratio for the spin magnetic moment can be derived by inserting the expressions deﬁning operators of spin
magnetic moment components (Eqs. 5.5–5.7) into the commutation relation (Eq. 5.9), e.g.
ˆIx
ˆIy − ˆIy
ˆIx =
ˆµx
γ
ˆµy
γ
−
ˆµy
γ
ˆµx
γ
=
1
γ2
Q
2m0
2
0 1
1 0
0 −i
i 0
−
0 −i
i 0
0 1
1 0
=
1
γ2
Q
2m0
2
i 0
0 −i
−
−i 0
0 i
= i
2
γ2
Q
2m0
2
1 0
0 −1
(5.173)
The commutation relation Eq. 5.9 requires that
ˆIx
ˆIy − ˆIy
ˆIx = i
2
γ2
Q
2m0
2
1 0
0 −1
= i ˆIz = i
γ
Q
2m0
1 0
0 −1
⇒
2
γ
Q
2m0
a = 1 ⇒ γ = 2
Q
2m0
. (5.174)
146
5.7.9 The factor of one half in the eigenvalues of ˆIz
The eigenvalues ± /2 are closely related to the fact that spin is a relativistic eﬀect. Special relativity requires that the Dirac equation
must not change if we rotate the coordinate frame or if it moves with a constant speed (Lorentz transformation). This is true in general,
but for the sake of simplicity, we just check rotation about the z axis.
We start by writing explicitly the Dirac equation as a set of four equations9
i
∂(u1ψ)
∂t
= −ic
∂(v1ψ∗)
∂z
− ic
∂(v2ψ∗)
∂x
+ ic
∂(iv2ψ∗)
∂y
+ m0c2
u1ψ, (5.175)
i
∂(u2ψ)
∂t
= +ic
∂(v2ψ∗)
∂z
− ic
∂(v1ψ∗)
∂x
− ic
∂(iv1ψ∗)
∂y
+ m0c2
u2ψ, (5.176)
i
∂(v1ψ∗)
∂t
= −ic
∂(u1ψ)
∂z
− ic
∂(u2ψ)
∂x
+ ic
∂(iu2ψ)
∂y
− m0c2
v1ψ∗
, (5.177)
i
∂(v2ψ∗)
∂t
= +ic
∂(u2ψ)
∂z
− ic
∂(u1ψ)
∂x
− ic
∂(iu1ψ)
∂y
− m0c2
v2ψ∗
. (5.178)
Let us assume that we have an original coordinate frame t, x, y, z and a rotated frame t , x , y , z . If we rotate about z by an angle ϕ,
t = t (5.179)
z = z (5.180)
x = (cos ϕ)x − (sin ϕ)y (5.181)
y = (sin ϕ)x + (cos ϕ)y (5.182)
and
∂f
∂t
=
∂f
∂t
(5.183)
∂f
∂z
=
∂f
∂z
(5.184)
∂f
∂x
=
∂x
∂x
∂f
∂x
+
∂y
∂x
∂f
∂y
= cos ϕ
∂f
∂x
+ sin ϕ
∂f
∂y
(5.185)
∂f
∂y
=
∂x
∂y
∂f
∂x
+
∂y
∂y
∂f
∂y
= − sin ϕ
∂f
∂x
+ cos ϕ
∂f
∂y
(5.186)
and consequently
∂f
∂x
+ i
∂f
∂y
= e−iϕ ∂f
∂x
+ i
∂f
∂y
, (5.187)
∂f
∂x
− i
∂f
∂y
= eiϕ ∂f
∂x
− i
∂f
∂y
. (5.188)
We also need to transform the wavefunction Ψ to the rotated frame. We already know (Eqs. 4.160 and 4.161) that rotation of
a complex function f by an angle φ can be written as f = feiφ. Let us assume that each of component of Ψ rotates by some angle
(ϕ1, ϕ2, ϕ3, ϕ4,) – the key step of our analysis will be to relate values of these angles the actual angle of rotating the coordinate frames ϕ.
Now we have everything that we need to write the set of Eqs. 5.175–5.178 in the rotated coordinate frame:
i
∂(eiϕ1 u1ψ )
∂t
= −ic
∂(eiϕ3 v1ψ ∗
)
∂z
− ic
∂(ei(ϕ4+ϕ)v2ψ ∗
)
∂x
+ ic
∂(iei(ϕ4+ϕ)v2ψ ∗
)
∂y
+ m0c2
eiϕ1 u1ψ , (5.189)
i
∂(eiϕ2 u2ψ )
∂t
= +ic
∂(eiϕ4 v2ψ ∗
)
∂z
− ic
∂(ei(ϕ3−ϕ)v1ψ ∗
)
∂x
− ic
∂(iei(ϕ3−ϕ)v1ψ ∗
)
∂y
+ m0c2
eiϕ2 u2ψ , (5.190)
i
∂(eiϕ3 v1ψ ∗
)
∂t
= −ic
∂(eiϕ1 u1ψ )
∂z
− ic
∂(ei(ϕ2+ϕ)u2ψ )
∂x
+ ic
∂(iei(ϕ2+ϕ)u2ψ )
∂y
− m0c2
eiϕ3 v1ψ
∗
, (5.191)
i
∂(eiϕ4 v2ψ ∗
)
∂t
= +ic
∂(eiϕ2 u2ψ )
∂z
− ic
∂(ei(ϕ1−ϕ)u1ψ )
∂x
− ic
∂(iei(ϕ1−ϕ)u1ψ )
∂y
− m0c2
eiϕ4 v2ψ
∗
. (5.192)
9
Note that we use the form of the Dirac equation which directly deﬁnes the relativistic Hamiltonian (Eq. 5.87).
5.7. SUPPORTING INFORMATION 147
According to the ﬁrst postulate of the special theory of relativity, Eqs. 5.189–5.192 must have the same form as Eqs. 5.175–5.178.
In other words, we must eliminate the complex exponential expressions from Eqs. 5.189–5.192. Let us ﬁrst multiply both sides of the
ﬁrst equation by e−iϕ1 , both sides of the second equation by e−iϕ2 , both sides of the third equation by e−iϕ3 , and both sides of the last
equation by e−iϕ4 :
i
∂(u1ψ )
∂t
= −ic
∂(ei(ϕ3−ϕ1)v1ψ ∗
)
∂z
− ic
∂(ei(ϕ4−ϕ1+ϕ)v2ψ ∗
)
∂x
+ ic
∂(iei(ϕ4−ϕ1+ϕ)v2ψ ∗
)
∂y
+ m0c2
u1ψ , (5.193)
i
∂(u2ψ )
∂t
= +ic
∂(ei(ϕ4−ϕ2)v2ψ ∗
)
∂z
− ic
∂(ei(ϕ3−ϕ2−ϕ)v1ψ ∗
)
∂x
− ic
∂(iei(ϕ3−ϕ2−ϕ)v1ψ ∗
)
∂y
+ m0c2
u2ψ , (5.194)
i
∂(v1ψ ∗
)
∂t
= −ic
∂(ei(ϕ1−ϕ3)u1ψ )
∂z
− ic
∂(ei(ϕ2−ϕ3+ϕ)u2ψ )
∂x
+ ic
∂(iei(ϕ2−ϕ3+ϕ)u2ψ )
∂y
− m0c2
v1ψ
∗
, (5.195)
i
∂(v2ψ ∗
)
∂t
= +ic
∂(ei(ϕ2−ϕ4)u2ψ )
∂z
− ic
∂(ei(ϕ1−ϕ4−ϕ)u1ψ )
∂x
− ic
∂(iei(ϕ1−ϕ4−ϕ)u1ψ )
∂y
− m0c2
v2ψ
∗
. (5.196)
This cleared the t and m0 terms. The exponential expressions disappear from the z term if ϕ1 = ϕ3 and ϕ2 = ϕ4 (i.e., if the rotation
of u1ψ and v1ψ∗ is identical and the same applies to u2ψ and v2ψ∗). In order to ﬁx the x and y terms, we assume that ϕ1 = −ϕ2 and
ϕ3 = −ϕ4, i.e., that the rotation of u1ψ and u2ψ is opposite and the same applies to v1ψ∗ and v2ψ∗. This implies that u1ψ and u2ψ
describe states with opposite spins (and v1ψ∗ and v2ψ∗ too). Then, u1ψ and v1ψ ∗
in the x and y terms are multiplied by ei(2ϕ1−ϕ),
and u2ψ and v2ψ ∗
in the x and y terms are multiplied by e−i(2ϕ1−ϕ). In both cases, the exponential expressions disappear (are equal
to one) if ϕ1 = ϕ/2. What does it mean? If we rotate the coordinate system by a certain angle, the components of the wavefunction rotate
only by half of this angle! The function describing rotation of the wavefunction about z has the form
Rj = ei
Iz,j ϕ
2 . (5.197)
This looks very similar to Eq. 4.145, but with one important diﬀerence: rotation by 2π (360 ◦) does not give the same eigenfunction
Rj as no rotation (ϕ = 0), but changes its sign. Only rotation by 4π (720 ◦) reverts the system to the initial state!
Eq. 4.145 tells us that the eigenvalues of the operator of the spin angular momentum are half-integer multiples of :
Iz,1 =
2
Iz,2 = −
2
. (5.198)
5.7.10 Eigenfunctions of ˆIx and ˆIy
Eigenfunctions of ˆIx are the following linear combinations of |α and |β :
1
√
2
|α +
1
√
2
|β =
1
√
2
1
1
≡ | → , (5.199)
−
i
√
2
|α +
i
√
2
|β =
1
√
2
−i
i
≡ | ← , (5.200)
or these linear combinations multiplied by a phase factor eiφ. E.g., state vectors multiplied by eiπ/2 = i are
| → = eiπ/2 1
√
2
1
1
= i
1
√
2
1
1
=
1
√
2
i
i
, | ← = eiπ/2 1
√
2
−i
i
= i
1
√
2
−i
i
=
1
√
2
1
−1
. (5.201)
Eigenvalues are again /2 and − /2:
ˆIx| → = +
2
| →
2
0 1
1 0
1
√
2
1
1
= +
2
·
1
√
2
1
1
, (5.202)
ˆIx| ← = +
2
| ←
2
0 1
1 0
1
√
2
−i
i
= −
2
·
1
√
2
−i
i
. (5.203)
Eigenfunctions of ˆIy are the following linear combinations of |α and |β :
1 − i
2
|α +
1 + i
2
|β =
1
2
1 − i
1 + i
≡ |⊗ , (5.204)
−
1 + i
2
|α +
1 − i
2
|β =
1
2
1 + i
1 − i
≡ | , (5.205)
148
or these linear combinations multiplied by a phase factor eiφ. E.g., state vectors multiplied by eiπ/4 = (1 + i)/
√
2 are
|⊗ = eiπ/4 1
2
1 − i
1 + i
=
1 + i
√
2
1
2
1 − i
1 + i
=
1
√
2
1
i
, | = eiπ/4 1
2
1 + i
1 − i
=
1 + i
√
2
1
2
1 + i
1 − i
=
1
√
2
i
1
. (5.206)
Eigenvalues are again /2 and − /2:
ˆIy|⊗ = +
2
|⊗
2
0 −i
i 0
1
2
1 − i
1 + i
= +
2
·
1
2
1 − i
1 + i
, (5.207)
ˆIy| = −
2
|
2
0 −i
i 0
1
2
1 + i
1 − i
= −
2
·
1
2
1 + i
1 − i
. (5.208)
An operator representing angular momentum pointing in a general direction, described by angles ϑ (inclination) and ϕ (azimuth) can
be written as
ˆIz cos ϑ + ˆIx sin ϑ cos ϕ + ˆIy sin ϑ sin ϕ. (5.209)
Its eigenvalue are again /2 and − /2 and its eigenfunctions are
|ϑ, ϕ =
cos ϑ
2
e−i ϕ
2
sin ϑ
2
e+i ϕ
2
, |ϑ + π, ϕ =
− sin ϑ
2
e−i ϕ
2
cos ϑ
2
e+i ϕ
2
(5.210)
or the vectors described by Eq. 5.210 multiplied by a phase factor eiφ, e.g.
|ϑ, ϕ =
cos ϑ
2
sin ϑ
2
eiϕ , |ϑ + π, ϕ =
− sin ϑ
2
cos ϑ
2
eiϕ . (5.211)
5.7.11 Stationary states and energy level diagram
In the presence of a homogeneous magnetic ﬁeld B0 = (0, 0, B0), the evolution of the system is given by the Hamiltonian ˆH = −γB0
ˆIz.
The Schr¨odinger equation is then
i
∂
∂t
cα
cβ
= −γB0
2
1 0
0 −1
cα
cβ
, (5.212)
which is a set of two equations with separated variables
dcα
dt
= +i
γB0
2
cα, (5.213)
dcβ
dt
= −i
γB0
2
cβ, (5.214)
with the solution
cα = cα(t = 0)e+i
γB0
2
t
= cα(t = 0)e−i
ω0
2
t
, (5.215)
cβ = cβ(t = 0)e−i
γB0
2
t
= cβ(t = 0)e+i
ω0
2
t
. (5.216)
If the initial state is |α , cα(t = 0) = 1, cβ(t = 0) = 0, and
cα = e−i
ω0
2
t
, (5.217)
cβ = 0. (5.218)
Note that the evolution changes only the phase factor, but the system stays in state |α (all vectors described by Eq. 5.16 correspond
to state |α ). It can be shown by calculating the probability that the system is in the |α or |β state.
Pα = c∗
αcα = e+i
ω0
2
t
e−i
ω0
2
t
= 1, (5.219)
Pβ = c∗
βcβ = 0. (5.220)
5.7. SUPPORTING INFORMATION 149
If the initial state is |β , cα(t = 0) = 0, cβ(t = 0) = 1, and
cα = 0, (5.221)
cβ = e+i
ω0
2
t
. (5.222)
Again, the evolution changes only the phase factor, but the system stays in state |β . The probability that the system is in the |α or
|β state is
Pα = c∗
αcα = 0, (5.223)
Pβ = c∗
βcβ = e−i
ω0
2
t
e+i
ω0
2
t
= 1. (5.224)
Let us summarize results of our analysis. If evolving wave functions are eigenfunctions of the Hamiltonian describing the evolution:
• The probability of ﬁnding the system in a given state do not change the state is stationary.
• Only the phase factors of the coeﬃcients constituting the state vector change with a frequencies equal to the eigenvalues of the
Hamiltonian divided by (Eqs. 5.215 and 5.216).
• The eigenvalues of the Hamiltonian represent energies of the individual eigenstates. Such energies can be plotted as the energy
level diagram.
5.7.12 Oscillatory states
We now analyze evolution of states described by other wave functions that eigenfunctions of the Hamiltonian. We can continue the
discussion of the previous section (evolution of evolution of |α and |β due to ˆH = −γB0
ˆIz) and change either the wave function or the
Hamiltonian. We start by the latter option, which is easier.
In the presence of a homogeneous magnetic ﬁeld B1 = (B1, 0, 0), the evolution of the system is given by the Hamiltonian ˆH = −γB0
ˆIx.
The Schr¨odinger equation is then
i
∂
∂t
cα
cβ
= −γB1
2
0 1
1 0
cα
cβ
, (5.225)
which is a set of two equations
dcα
dt
= i
γB1
2
cβ, (5.226)
dcβ
dt
= i
γB1
2
cα. (5.227)
These equations have similar structure as Eqs. 4.151 and 4.152. Adding and subtracting them leads to the solution
cα + cβ = C+e+i
γB1
2
t
= C+e−i
ω1
2
t
, (5.228)
cα − cβ = C−e−i
γB1
2
t
= C−e+i
ω1
2
t
. (5.229)
If the initial state is |α , cα(t = 0) = 1, cβ(t = 0) = 0, C+ = C− = 1, and
cα = cos
ω1
2
t , (5.230)
cβ = −i sin
ω1
2
t . (5.231)
Probability that the system is in the |α or |β state is calculated as
Pα = c∗
αcα = cos2 ω1
2
t =
1
2
+
1
2
cos(ω1t), (5.232)
Pβ = c∗
βcβ = sin2 ω1
2
t =
1
2
−
1
2
cos(ω1t). (5.233)
150
If the initial state is |β , cα(t = 0) = 0, cβ(t = 0) = 1, C+ = 1, C− = −1, and
cα = −i sin
ω1
2
t , (5.234)
cβ = cos
ω1
2
t . (5.235)
Probability that the system is in the |α or |β state is calculated as
Pα = c∗
αcα = sin2 ω1
2
t =
1
2
−
1
2
cos(ω1t), (5.236)
Pβ = c∗
βcβ = cos2 ω1
2
t =
1
2
+
1
2
cos(ω1t). (5.237)
In both cases, the system oscillates between the |α and |β states.
Now we return to the Hamiltonian of the vertical ﬁeld ˆH = −γB0
ˆIz, but analyze the evolution of superposition states called | → and
| ← in Section 5.7.10. The Schr¨odinger equation has in this case the same form as in Section 5.7.11 with the solution
cα = cα(t = 0)e+i
γB0
2
t
= cα(t = 0)e−i
ω0
2
t
, (5.238)
cβ = cβ(t = 0)e−i
γB0
2
t
= cβ(t = 0)e+i
ω0
2
t
. (5.239)
We are interested in evolution of a wave function that can be described as
|Ψ = c→| → + c←| ← . (5.240)
According to Eqs. 5.213 and 5.214,
c→ =
cα
√
2
+
cβ
√
2
(5.241)
c← = −i
cα
√
2
+ i
cβ
√
2
. (5.242)
If the initial state is | → , cα(t = 0) = 1/
√
2, cβ(t = 0) = 1/
√
2, and
c→ =
1
2
e−i
ω0
2
t
+
1
2
e+i
ω0
2
t
= cos
ω0
2
t (5.243)
c← = −
i
2
e−i
ω0
2
t
+
i
2
e+i
ω0
2
t
= − sin
ω0
2
t . (5.244)
Probability that the system is in the | → or | ← state is calculated as
P→ = c∗
→c→ = cos2 ω0
2
t =
1
2
+
1
2
cos(ω0t), (5.245)
P← = c∗
←c← = sin2 ω0
2
t =
1
2
−
1
2
cos(ω0t). (5.246)
5.7.13 Evolution in general alternating magnetic ﬁelds
Bloch and Siegert analyzed in Phys. Rev. 57 (1940) 522–527 a general case of evolution of spin states in a magnetic ﬁeld B whose x and y
component alternate with the frequency ωradio. The analysis is not simple and the main purpose of discussing it here is to explain why the
eﬀect of radio waves is usually described approximatively, assuming presence of rotating magnetic ﬁelds, instead of a much more realistic
description using ﬁelds oscillating in one direction.
The Schr¨odinger equation describing evolution in the general ﬁeld B = (Bx, By, Bz) is
i
∂ψ
∂t
= ˆHψ = −γBz
ˆIzψ − γBx
ˆIxψ − γBy
ˆIyψ (5.247)
5.7. SUPPORTING INFORMATION 151
i
∂
∂t
cα
cβ
= −γ
2
Bz
1 0
0 −1
cα
cβ
+ Bx
0 1
1 0
cα
cβ
+ iBx
0 −1
1 0
cα
cβ
. (5.248)
Written as a set of two equations,
dcα
dt
= +i
γ
2
Bzcα + (Bx − iBy)cβ , (5.249)
dcβ
dt
= −i
γ
2
−Bzcβ + (Bx + iBy)cα . (5.250)
The coeﬃcients cα and cβ are not independent because cαc∗
α + cβc∗
β = 1. Therefore, we can combine them into a single variable and
then solve one equation instead of two. Bloch and Siegert divided both equations by cβ and introduced a variable u = cα/cβ:
1
cβ
dcα
dt
= +i
γ
2
(Bzu + (Bx − iBy)) , (5.251)
1
cβ
dcβ
dt
= −i
γ
2
(−Bz + (Bx + iBy)u) . (5.252)
We multiply the second equation by u and subtract it from the ﬁrst one
1
cβ
dcα
dt
− u
1
cβ
dcβ
dt
= i
γ
2
2Bzu + (Bx − iBy) − (Bx + iBy)u2
. (5.253)
The time derivative of u is
du
dt
=
d
dt
cα
cβ
=
dcα
dt
cβ − cα
dcβ
dt
c2
β
=
1
cβ
dcα
dt
− u
1
cβ
dcβ
dt
(5.254)
Inserting the result into Eq. 5.253,
du
dt
= i
γ
2
2Bzu + (Bx − iBy) − (Bx + iBy)u2
. (5.255)
We assume that the longitudinal component of B is the static magnetic ﬁeld B0 and that the transverse component of B is composed
of two counterrotating ﬁelds with general amplitudes and phases:
Bz = B0, (5.256)
Bx + iBy = B1ei(ωradiot+φ1)
+ B2e−i(ωradiot+φ2)
, (5.257)
Bx − iBy = B1e−i(ωradiot+φ1)
+ B2ei(ωradiot+φ2)
. (5.258)
Then,
du
dt
= i
γ
2
2B0 + B1ei(ωradiot+φ1)
+ B2e−i(ωradiot+φ2)
− B1e−i(ωradiot+φ1)
+ B2ei(ωradiot+φ2)
u2
, (5.259)
du
dt
= −i ω0u +
ω1
2
ei(ωradiot+φ1)
+
ω2
2
e−i(ωradiot+φ2)
−
ω1
2
e−i(ωradiot+φ1)
+
ω2
2
ei(ωradiot+φ2)
u2
. (5.260)
We multiply both sides by ω1
2
e−i(ωradiot+φ1) and introduce a new variable
w =
ω1
2
e−i(ωradiot+φ1)
u (5.261)
with the time derivative
dw
dt
= −iωradio
ω1
2
e−i(ωradiot+φ1)
+
ω1
2
e−i(ωradiot+φ1) du
dt
(5.262)
We obtain
dw
dt
= −i (ω0 + ωradio)w +
ω2
1
4
1 +
ω2
ω1
e−i(2ωradiot+φ1+φ2)
− 1 +
ω2
ω1
ei(2ωradiot+φ1+φ2)
w2
. (5.263)
152
To simplify the right-hand side, we use the deﬁnition of the frequency oﬀset Ω = ω0 + ωradio (see Section 1.5.7) and write 2ωradiot +
φ1 + φ2 = 2ωradiot + φ as Φ:
dw
dt
= −i Ωw +
ω2
1
4
1 +
ω2
ω1
e−iΦ
− 1 +
ω2
ω1
eiΦ
w2
= i 1 +
ω2
ω1
eiΦ
w2
−
Ω
1 + ω2
ω1
eiΦ
w −
ω2
1
4
1 + ω2
ω1
e−iΦ
1 + ω2
ω1
eiΦ
. (5.264)
To proceed, we note that
d cot η
dη
=
d
dη
cos η
sin η
=
− sin2 η − cos2 η
sin2 η
= − 1 + cot2
η . (5.265)
We try to modify Eq. 5.264 to resemble Eq. 5.265. We rewrite the left-hand side as
dw
dt
=
dw
dη
dη
dt
(5.266)
and complete the square in the right-hand side:
dw
dη
dη
dt
= i 1 +
ω2
ω1
eiΦ


 w −
1
2
Ω
1 + ω2
ω1
eiΦ
2
−
1
4
Ω2 + ω2
1 1 + ω2
ω1
e−iΦ 1 + ω2
ω1
eiΦ
1 + ω2
ω1
eiΦ
2


 , (5.267)
dw
dη
dη
dt
= −
i
4
Ω2 + ω2
1 1 + ω2
ω1
e−iΦ 1 + ω2
ω1
eiΦ
1 + ω2
ω1
eiΦ


1 −
4 1 + ω2
ω1
eiΦ
2
Ω2 + ω2
1 1 + ω2
ω1
e−iΦ 1 + ω2
ω1
eiΦ
w −
1
2
Ω
1 + ω2
ω1
eiΦ
2


 . (5.268)
We can identify
cot2
η = −
4 1 + ω2
ω1
eiΦ
2
Ω2 + ω2
1 1 + ω2
ω1
e−iΦ 1 + ω2
ω1
eiΦ
w −
1
2
Ω
1 + ω2
ω1
eiΦ
2
(5.269)
and consequently
cot η = i
2 1 + ω2
ω1
eiΦ
Ω2 + ω2
1 1 + ω2
ω1
e−iΦ 1 + ω2
ω1
eiΦ
w −
1
2
Ω
1 + ω2
ω1
eiΦ
= i
2 1 + ω2
ω1
eiΦ
λ
w −
1
2
Ω
1 + ω2
ω1
eiΦ
(5.270)
w =
1
2
Ω − iλ cot η
1 + ω2
ω1
eiΦ
=
1
2
Ω + λ
cos η
i sin η
1 + ω2
ω1
eiΦ
=
1
2
Ω + λ eiη
+e−iη
eiη−e−iη
1 + ω2
ω1
eiΦ
, (5.271)
where
λ = Ω2 + ω2
1 1 +
ω2
ω1
e−iΦ 1 +
ω2
ω1
eiΦ = Ω2 + ω2
1 + ω2
2 + 2ω1ω2 cos Φ = Ω2 + ω2
1 + ω2
2 + 2ω1ω2 cos(2ωradiot + φ). (5.272)
The solution should be obtained by inserting Eq. 5.271 into Eq. 5.268. Before we try it for the general ﬁeld, we check a simpler solution
for ω2 = 0.
5.7.14 Evolution in rotating magnetic ﬁelds
A magnetic ﬁeld composed of B0 and of a component rotating about the z can be completely described by B0 = Bz and a single rotating
ﬁeld B1eiωradiot+φ1 = Bx + iBy. The absence of the ω2 = −γB2 term simpliﬁes Eq. 5.268 to
dw
dη
dη
dt
= −
i
4
Ω2
+ ω2
1 1 + cot2
η = −
i
4
λ2
0 1 + cot2
η (5.273)
and Eq. 5.271 to
w =
1
2
Ω −
i
2
λ0 cot η =
1
2
Ω +
1
2
λ0
cos η
i sin η
=
1
2
Ω +
1
2
λ0
eiη + e−iη
eiη − e−iη
, (5.274)
5.7. SUPPORTING INFORMATION 153
where λ0 = Ω2 + ω2
1 is constant (time-independent). Diﬀerentiation of Eq. 5.274 is then simply
dw
dη
=
d
dη
1
2
Ω −
i
2
λ0 cot η = −
i
2
λ0
d cot η
dη
=
i
2
λ2
0 1 + cot2
η (5.275)
because both Ω and λ0 are constants. Inserting the result into Eq. 5.273 gives
dw
dη
dη
dt
=
i
2
λ0 1 + cot2
η
dη
dt
= −
i
4
λ2
0 1 + cot2
η . (5.276)
Evaluation of η is then a matter of straightforward integration
ˆ η
η0
dη = −
ˆ t
0
1
2
λ0dt = −
1
2
λ0t η = η0 −
1
2
λ0t. (5.277)
Inserting to Eq. 5.274 and returning to the variable u,
u =
2w
ω1
ei(ωradiot+φ1)
=
Ω
ω1
− i
λ0
ω1
cot η ei(ωradiot+φ1)
=
Ω
ω1
+
λ0
ω1
eiη + e−iη
eiη − e−iη
ei(ωradiot+φ1)
, (5.278)
u∗
=
2w∗
ω1
e−i(ωradiot+φ1)
=
Ω
ω1
+ i
λ0
ω1
cot η e−i(ωradiot+φ1)
=
Ω
ω1
−
λ0
ω1
eiη + e−iη
eiη − e−iη
e−i(ωradiot+φ1)
, (5.279)
and
uu∗
=
4ww∗
ω2
1
=
1
ω2
1
(Ω + iλ0 cot η) (Ω − iλ0 cot η) =
Ω2
ω2
1
+
Ω2 + ω2
1
ω2
1
cos2 η
sin2 η
=
Ω2 + ω2
1 cos2 η
ω2
1 sin2 η
=
Ω2 + ω2
1 − ω2
1 sin2 η
ω2
1 sin2 η
=
Ω2 + ω2
1
ω2
1 sin2 η
− 1,
(5.280)
This allows us to calculate Pα and Pβ:
Pα = c∗
αcα = uu∗
c∗
βcβ = uu∗
(1 − Pα) ⇒ Pα =
uu∗
1 + uu∗
= 1 −
ω2
1 sin2 η
Ω2 + ω2
1
, Pβ = 1 − Pα = 1 −
uu∗
1 + uu∗
=
1
1 + uu∗
=
ω2
1 sin2 η
Ω2 + ω2
1
.
(5.281)
The derived equations include results of Section 5.7.11 for ω1 = 0 and of Section 5.7.12 for Ω = 0.
The last issue discussed in this section is determination of the integration factor η0. The value of η0 depends on the initial conditions.
For example, if we start from Pα = 1 at t = 0, sin η(t = 0) and consequently η(t = 0) must be zero. As η = η0 − λ0t/2, η0 must be zero to
give η(t = 0) = 0. With the boundary condition Pα = 1 at t = 0, the variable u is determined by
u =
2w
ω1
ei(ωradiot+φ1)
=
Ω
ω1
− i
λ0
ω1
cot
λ0
2
t ei(ωradiot+φ1)
=

 Ω
ω1
+
λ0
ω1
e−i
λ0
2
t
+ ei
λ0
2
t
e−i
λ0
2
t
− ei
λ0
2
t

 ei(ωradiot+φ1)
=
Ω
ω1
+
λ0
ω1
e−iλ0t + 1
e−iλ0t − 1
ei(ωradiot+φ1)
=
λ0+Ω0
ω1
e−iλ0t + λ0−Ω0
ω1
e−iλ0t − 1
ei(ωradiot+φ1)
=
λ0+Ω0
ω1
e−i
λ0
2
t
+ λ0−Ω0
ω1
e+i
λ0
2
t
e−i
λ0
2
t
− e+i
λ0
2
t
ei(ωradiot+φ1)
. (5.282)
Consequently (taking into account that λ0 = Ω2 + ω2
1 and sin2(−x) = sin2 x),
Pβ =
1
1 + uu∗
=
ω2
1
Ω2 + ω2
1
sin2
Ω2 + ω2
1
2
t, Pα = 1 − Pβ = 1 −
ω2
1
Ω2 + ω2
1
sin2
Ω2 + ω2
1
2
t. (5.283)
which serves us as a reference when discussing the general case in the next section.
5.7.15 Evolution in non-rotating magnetic ﬁelds
In general, w is (Eq. 5.271)
w =
1
2
Ω − iλ cot η
1 + ω2
ω1
eiΦ
=
1
2 1 + ω2
ω1
eiΦ
Ω + λ
eiη + e−iη
eiη − e−iη
=
(λ + Ω) eiη + (λ − Ω) e−iη
1 + ω2
ω1
eiΦ eiη − e−iη
=
(λ + Ω) e2iη + (λ − Ω)
2 1 + ω2
ω1
eiΦ (e2iη − 1)
(5.284)
154
In the presence of ω2 = −γB2, diﬀerentiation of w deﬁned by Eq. 5.271 is far from simple because Φ = 2ωradiot + φ1 + φ2 and
λ = Ω2 + ω2
1 + ω2
2 + 2ω1ω2 cos Φ are not constants, but time-dependent functions. To follow the analysis by Bloch and Siegert, we
”hide” the time-dependence in a single quantity denoted ξ in the following text. We express u using w deﬁned by Eq. 5.271
u =
1
ω1
Ω − iλ cot η
1 + ω2
ω1
eiΦ
ei(ωradiot+φ1)
=
λ+Ω
ω1
e2iη + λ−Ω
ω1
1 + ω2
ω1
eiΦ (e2iη − 1)
ei(ωradiot+φ1)
(5.285)
and require, in analogy with Eq. 5.282, that it is equal to
u =
λ0+Ω
ω1
Ξ e−iλ0t + λ0−Ω
ω1
Ξ e−iλ0t − 1
ei(ωradiot+φ1)
=
λ0+Ω
ω1
eiξ e−iλ0t + λ0−Ω
ω1
eiξ e−iλ0t − 1
ei(ωradiot+φ1)
=
λ0+Ω
ω1
ei
ξ−λ0t
2 + λ0−Ω
ω1
e−i
ξ−λ0t
2
ei
ξ−λ0t
2 − e−i
ξ−λ0t
2
ei(ωradiot+φ1)
,
(5.286)
where Ξ = eiξ is a time-dependent factor to be determined (note that any Ξ can be written as eiξ if we allow ξ to be a complex
quantity). The following rearrangements conﬁrm that Ξ can be really expressed analytically as a function of well-deﬁned Φ and the
unknown quantity η.
1
1+
ω2
ω1
eiΦ
λ+Ω
ω1
e2iη + 1
1+
ω2
ω1
eiΦ
λ−Ω
ω1
e2iη − 1
=
λ0+Ω
ω1
Ξ e−iλ0t + λ0−Ω
ω1
Ξ e−iλ0t − 1
(5.287)
Ξ e−iλ0t
− 1
1
1 + ω2
ω1
eiΦ
λ + Ω
ω1
e2iη
+
1
1 + ω2
ω1
eiΦ
λ − Ω
ω1
=
λ0 + Ω
ω1
Ξ e−iλ0t
+
λ0 − Ω
ω1
e2iη
− 1 (5.288)
Ξ e−iλ0t 1
1 + ω2
ω1
eiΦ
λ + Ω
ω1
e2iη
+
1
1 + ω2
ω1
eiΦ
λ − Ω
ω1
−
λ0 + Ω
ω1
e2iη
− 1 =
1
1 + ω2
ω1
eiΦ
λ + Ω
ω1
e2iη
+
1
1 + ω2
ω1
eiΦ
λ − Ω
ω1
+
λ0 − Ω
ω1
e2iη
− 1
(5.289)
Ξ = eiλ0t
λ+Ω
ω1+ω2eiΦ + λ0−Ω
ω1
e2iη + λ−Ω
ω1+ω2eiΦ − λ0−Ω
ω1
λ+Ω
ω1+ω2eiΦ − λ0+Ω
ω1
e2iη + λ−Ω
ω1+ω2eiΦ + λ0+Ω
ω1
(5.290)
Determination of Ξ is demanding. Therefore, we postpone it to Sections 5.7.16 and 5.7.17 and present here only the solution. For the
sake of simplicity, the solution for the most interesting case of a ﬁeld oscillating in one direction (i.e., for ω2 = ω1) is summarized below,
a more general is derived in Sections 5.7.16 and 5.7.17.
• It is possible to ﬁnd suﬃciently accurate approximation of ξ for oscillating ﬁelds much weaker than the static ﬁeld B1 B0. If
the alternating ﬁelds oscillate close to resonance, this also implies ω1 ωradio.
• The solution can be found as a series expansion of ξ in powers of ω1/ωradio. The second-order approximation is suﬃcient for
standard NMR experiments.
• Using this approximation, ξ, u, Pα, and Pβ can be evaluated.
• The solution is greatly simpliﬁed if we are interested only in average results of repeated experiments and assume that phase factors
dependent on the actual beginning of the measurement average to zero.
• As ω1/ωradio 1, the terms proportional to (ω1/ωradio)2 can be neglected in the expressions deﬁning the average values of Pβ
and of Pα = 1 − Pβ. Then,
Pβ ≈
ω2
1
ω2
1 + Ω −
ω2
1
4ωradio
2
sin2
ω2
1 + Ω −
ω2
1
4ωradio
2
2
t. (5.291)
We see that in a ﬁeld of radio waves oscillating in one direction with frequency close to ω0 and amplitude much lower than B0, the
equation describing evolution of Pβ has the same form as the equation describing evolution of Pβ in a rotating ﬁeld (Eq. 5.283).
The only diﬀerence is a small10 Bloch-Siegert shift of the frequency oﬀset (by ω2
1/(4ωradio)). This justiﬁes the common practice
to approximate the eﬀects of radio waves by eﬀects of rotating ﬁelds.
10
The shift by ω2
1/(4ωradio) changes Ω from Ω = ω0 + ωradio to Ω = ω0 + ωradio − ω2
1/(4ωradio) = ω0 + ωradio(1 − (ω1/(2ωradio))2). The
relative change is thus proportional to (ω1/(2ωradio))2
5.7. SUPPORTING INFORMATION 155
5.7.16 Modifying factor ξ in description of non-rotating ﬁelds
In order to determine Ξ in terms of known parameters, we convert u expressed in Eq. 5.286 to w, and evaluated its square and time
derivative
2w
ω1
= ue−i(ωradiot+φ1)
=
λ0+Ω
ω1
Ξ e−iλ0t + λ0−Ω
ω1
Ξ e−iλ0t − 1
=
λ0+Ω
ω1
Ξ e−iλ0t − 1 + λ0+Ω
ω1
+ λ0−Ω
ω1
Ξ e−iλ0t − 1
=
λ0 + Ω
ω1
+
2λ0
ω1
1
Ξ e−iλ0t − 1
, (5.292)
w =
λ0+Ω
2
Ξ e−iλ0t + λ0−Ω
2
Ξ e−iλ0t − 1
=
λ0+Ω
2
Ξ e−iλ0t − 1 + λ0+Ω
2
+ λ0−Ω
ω1
Ξ e−iλ0t − 1
=
λ0 + Ω
2
+
λ0
Ξ e−iλ0t − 1
, (5.293)
w2
=
λ0 + Ω
2
+
λ0
Ξ e−iλ0t − 1
2
=
λ0 + Ω
2
2
+
(λ0 + Ω)λ0
Ξ e−iλ0t − 1
+
λ2
0
Ξ e−iλ0t − 1
2
, (5.294)
dw
dt
= λ0
d
dt
1
Ξ e−iλ0t − 1
= −λ0
dΞ
dt
e−iλ0t − iλ0Ξ e−iλ0t
Ξ e−iλ0t − 1
2
= i
λ2
0Ξ e−iλ0t
Ξ e−iλ0t − 1
2
−
λ0e−iλ0t
Ξ e−iλ0t − 1
2
dΞ
dt
, (5.295)
and insert them into Eq. 5.264:
dw
dt
= −i Ωw +
ω2
1
4
1 +
ω2
ω1
e−iΦ
− 1 +
ω2
ω1
eiΦ
w2
(5.296)
i
λ2
0Ξ e−iλ0t
Ξ e−iλ0t − 1
2
−
λ0e−iλ0t
Ξ e−iλ0t − 1
2
dΞ
dt
=
− i
Ω(λ0 + Ω)
2
+
Ωλ0
Ξ e−iλ0t − 1
+
ω2
1
4
1 +
ω2
ω1
e−iΦ
− 1 +
ω2
ω1
eiΦ λ0 + Ω
2
2
+
(λ0 + Ω)λ0
Ξ e−iλ0t − 1
+
λ2
0
Ξ e−iλ0t − 1
2
. (5.297)
Then we separate dΞ/dt
dΞ
dt
= i λ0Ξ +
Ω(λ0 + Ω)
2
Ξ e−iλ0t − 1
2
λ0e−iλ0t
+ Ω
Ξ e−iλ0t − 1
e−iλ0t
+
ω2
1
4
1 +
ω2
ω1
e−iΦ Ξ e−iλ0t − 1
2
λ0e−iλ0t
− 1 +
ω2
ω1
eiΦ λ0 + Ω
2
2
Ξ e−iλ0t − 1
2
λ0e−iλ0t
+ (λ0 + Ω)
Ξ e−iλ0t − 1
e−iλ0t
+
λ0
e−iλ0t
, (5.298)
and simplify the right-hand side as much as possible in a series of routine steps:
dΞ
dt
= i
2Ωλ0 + 2Ω2 + ω2
1 − λ2
0 + 2Ωλ0 + Ω2
4
+
ω2
1
4
ω2
ω1
e−iΦ
−
ω2
ω1
eiΦ λ2
0 + 2Ωλ0 + Ω2
4
Ξ e−iλ0t − 1
2
λ0e−iλ0t
Ω − (λ0 + Ω) −
ω2
ω1
eiΦ
(λ0 + Ω)
Ξ e−iλ0t − 1
e−iλ0t
+ λ0Ξ −
λ0
e−iλ0t
−
ω2
ω1
eiΦ λ0
e−iλ0t
, (5.299)
dΞ
dt
= i
2Ωλ0 + 2Ω2 + ω2
1 − Ω2 + ω2
1 + 2Ωλ0 + Ω2
4
+
ω2
1
4
ω2
ω1
e−iΦ
−
ω2
ω1
eiΦ λ2
0 + 2Ωλ0 + Ω2
4
Ξ e−iλ0t − 1
2
λ0e−iλ0t
−λ0
Ξ e−iλ0t − 1
e−iλ0t
−
ω2
ω1
eiΦ
(λ0 + Ω)
Ξ e−iλ0t − 1
e−iλ0t
+ λ0
Ξ e−iλ0t − 1
e−iλ0t
−
ω2
ω1
eiΦ λ0
e−iλ0t
, (5.300)
dΞ
dt
= i
ω1ω2
4λ0
e−iΦ
− eiΦ λ0 + Ω
ω1
2
Ξ e−iλ0t − 1
2
e−iλ0t
−
ω2
ω1
eiΦ (λ0 + Ω)Ξ e−iλ0t − Ω
e−iλ0t
, (5.301)
dΞ
dt
= i
ω1ω2
4λ0
e−iΦ
− eiΦ λ0 + Ω
ω1
2
Ξ2
e−iλ0t
− 2Ξ + eiλ0t
−
ω2
ω1
eiΦ
(λ0 + Ω)Ξ +
ω2
ω1
eiΦ
Ωeiλ0t
, (5.302)
dΞ
dt
= i
ω1ω2
4λ0
e−iΦ
Ξ2
e−iλ0t
− 2Ξ + eiλ0t
− eiΦ λ0 + Ω
ω1
2
Ξ2
e−iλ0t
− 2Ξ + eiλ0t
+
4λ2
0 + 4Ωλ0
ω2
1
Ξ −
4Ωλ0
ω2
1
eiλ0t
, (5.303)
156
dΞ
dt
= i
ω1ω2
4λ0
e−iΦ
−
λ0 + Ω
ω1
2
eiΦ
Ξ2
e−iλ0t
− 2 e−iΦ
−
Ω2 + ω2
1 + 2Ωλ0 + Ω2 − 2(Ω2 + ω2
1) − 2Ωλ0
ω2
1
eiΦ
Ξ
+ e−iΦ
−
λ2
0 + 2Ωλ0 + Ω2 − 4Ωλ0
ω2
1
eiΦ
eiλ0t
, (5.304)
dΞ
dt
= i
ω1ω2
4λ0
e−iΦ
−
λ0 + Ω
ω1
2
eiΦ
Ξ2
e−iλ0t
− 2 e−iΦ
+ eiΦ
Ξ + e−iΦ
−
λ0 − Ω
ω1
2
eiΦ
eiλ0t
. (5.305)
We express the complex quantity Ξ as eiξ, where ξ is also a complex time-dependent function. Then
dΞ
dt
=
d(iξ)
dt
eiξ
= iΞ
dξ
dt
(5.306)
Dividing Eq. 5.305 by iΞ gives
dξ
dt
=
ω1ω2
4λ0
e−iΦ
−
λ0 + Ω
ω1
2
eiΦ
ei(ξ−λ0t)
− 2 e−iΦ
+ eiΦ
+ e−iΦ
−
λ0 − Ω
ω1
2
eiΦ
e−i(ξ−λ0t)
. (5.307)
This diﬀerential equation cannot be solved by direct integration because the variables t and ξ are not separated. But we proceed
anyway
ξˆ
0
dξ =
ω1ω2
4λ0
tˆ
0
e−iΦ
−
λ0 + Ω
ω1
2
eiΦ
ei(ξ−λ0t )
dt −
ω1ω2
2λ0
tˆ
0
e−iΦ
+ eiΦ
dt +
ω1ω2
4λ0
tˆ
0
e−iΦ
−
λ0 − Ω
ω1
2
eiΦ
e−i(ξ−λ0t )
dt ,
(5.308)
applying the boundary condition Pα = 1 ⇒ u → ∞ ⇒ Ξ = 1 and ξ = 0 at t = 0 (see Eq. 5.286). Only the middle integral of the
right-hand side can be evaluated
−
ω1ω2
2λ0
tˆ
0
e−iΦ
+ eiΦ
dt = −
ω1ω2
λ0
tˆ
0
cos Φdt = −
ω1ω2
λ0
Φˆ
φ
dt
dΦ
cos ΦdΦ = −
ω1ω2
2λ0ωradio
Φˆ
φ
cos ΦdΦ = −ε
Φˆ
φ
cos ΦdΦ
= ε(sin φ − sin Φ) = ε(sin φ − sin(2ωradiot + φ)), (5.309)
where ε is a dimensionless constant
ε =
ω1ω2
2ωradio Ω2 + ω1
. (5.310)
The other two integrals can be modiﬁed, but not solved, in the same manner
ω1ω2
4λ0
tˆ
0
e−iΦ
−
λ0 ± Ω
ω1
2
eiΦ
e±i(ξ−λ0t )
dt =
ω1ω2
4λ0
Φˆ
φ
e−iΦ
2ωradio
−
λ0 ± Ω
ω1
2
eiΦ
2ωradio
e
±i ξ−λ0
Φ −φ
2ωradio dΦ
=
ω1ω2
8λ0ωradio
Φˆ
φ
e−iΦ
−
λ0 ± Ω
ω1
2
eiΦ
e
±i ξ−λ0
Φ −φ
2ωradio dΦ = ε
Φˆ
φ
1
4
e−iΦ
−
λ0 ± Ω
ω1
2
eiΦ
e
±i ξ−λ0
Φ −φ
2ωradio dΦ . (5.311)
In summary, we have found that the time-dependent factor ξ is deﬁned by
ξ = ε


sin φ − sin Φ +
1
4
Φˆ
φ
e−iΦ
−
λ0 + Ω
ω1
2
eiΦ
e
i ξ−λ0
Φ −φ
2ωradio + e−iΦ
−
λ0 − Ω
ω1
2
eiΦ
e
−i ξ−λ0
Φ −φ
2ωradio dΦ


 .
(5.312)
This deﬁnition is only implicit, because ξ is present also in the integral in the right-hand side.
5.7. SUPPORTING INFORMATION 157
5.7.17 Factor ξ approximated by power series expansion
Bloch and Siegert noticed that the constant ε plays a critical role in Eq. 5.312. The function ξ depends on ε and, if ε is suﬃciently small,
this dependence can be approximated as a power series
ξ = ξ0 + εξ1 + ε2
ξ2 + . . . (5.313)
In other words, we assume that ξ ≈ ξ0 and add corrections εξ1, ε2ξ2, etc. to improve accuracy of the approximation. If ε 1 is, the
higher powers of ε are even smaller and already low powers provide good approximation. When is ε suﬃciently small? Eq. 5.310 shows
that on resonance (i.e., for Ω = 0),
ε ≤
ω2
2ωradio
. (5.314)
We see that ε 1 if ω2 ωradio, it is not necessary to have ω2 ω1!. This shows that our approximation is well applicable for
counter-rotating ﬁelds with B1 = B2 (and therefore ω2 = ω1) and with φ1=φ2 that add up to a ﬁeld oscillating in one direction:
B1ei(ωradiot+φ1)
+ B1e−i(ωradiot+φ1)
= 2B1 cos(ωradiot + φ1). (5.315)
For ω2 ωradio, we can replace ξ in Eq. 5.312 by ξ = ξ0 + εξ1 + ε2ξ2 + . . . and compare the terms with the same powers of ε to
determine the functions ξ0, ξ1, ξ2, . . . . As ξ is a time-dependent function, we should also check how it evolves in time. The oscillatory
terms like e−iωt (where ω is a constant) stay within the range ±1 even if t → ∞. However, as we show below, the integral in Eq. 5.312
also yields linear terms. Therefore, we have to express ξ as a sum of oscillatory components (labeled by the symbol ∼ and including also
constant contributions) and components linearly increasing in time (labeled by the symbol ∠):
ξ = ξ∼
+ ξ∠
t = ξ∼
0 + εξ∼
1 + ε2
ξ∼
2 + · · · + ξ∠
0 t + εξ∠
1 t + ε2
ξ∠
2 t + . . . (5.316)
Eq. 5.312 with the power expansion of the left-hand side is
ξ = ξ∼
0 + εξ∼
1 + ε2
ξ∼
2 + · · · + ξ∠
0 t + εξ∠
1 t + ε2
ξ∠
2 t + · · · = ε (sin φ − sin Φ) +
ε
4
Φˆ
φ
e−iΦ
−
λ0 + Ω
ω1
2
eiΦ
eiξ∼
e
i(ξ∠
−λ0) Φ −φ
2ωradio + e−iΦ
−
λ0 − Ω
ω1
2
eiΦ
e−iξ∼
e
−i(ξ∠
−λ0) Φ −φ
2ωradio dΦ
= ε (sin φ − sin Φ) +
1
4
Φˆ
φ
e−iΦ
−
λ0 + Ω
ω1
2
eiΦ
εeiξ∼
eiΛΦ
e−iΛφ
+ e−iΦ
−
λ0 − Ω
ω1
2
eiΦ
εe−iξ∼
e−iΛΦ
eiΛφ
dΦ , (5.317)
where
Λ =
ξ∠ − λ0
2ωradio
. (5.318)
We have already evaluated (from the initial condition) ξ0 = ξ(t = 0) = 0. Therefore, we can skip the constant ξ0 part in the expansion.
To determine the higher terms, we replace the time-dependent exponential functions e±iξ∼
by a power series11
e±i(εξ∼
1 +ε2
ξ∼
2 +... ) = 1 ± i εξ∼
1 + ε2
ξ∼
2 + . . . −
1
2
ε2
(ξ∼
1 )2
+ . . . (5.319)
and consequently
εe±iξ∼
+...
= ε ± iε2
ξ∼
1 + . . . . (5.320)
Equating terms linear in ε, we determine ξ1
ξ1 = ξ∼
1 + ξ∠
1 t = ξ∼
1 + ξ∠
1
Φ − φ
2ωradio
=
sin φ − sin Φ +
1
4



Φˆ
φ
e−iΦ
−
λ0 + Ω
ω1
2
eiΦ
eiΛΦ
e−iΛφ
+ e−iΦ
−
λ0 − Ω
ω1
2
eiΦ
e−iΛΦ
eiΛφ


 dΦ =
sin φ − sin Φ +
1
4



Φˆ
φ
e−i(1−Λ)Φ
e−iΛφ
+ e−i(1+Λ)Φ
eiΛφ
−
λ0 + Ω
ω1
2
ei(1+Λ)Φ
e−iΛφ
−
λ0 − Ω
ω1
2
ei(1−Λ)Φ
eiΛφ


 dΦ = sin φ − sin Φ+
11
Note that ex = 1 + x
1!
+ x2
2!
+ x3
3!
+ . . .
158
i
4







F∗
−
e−i(1−Λ)Φ
e−iΛφ
−e−iφ
1 − Λ
+
F∗
+
e−i(1+Λ)Φ
eiΛφ
−e−iφ
1 + Λ
+
λ0 + Ω
ω1
2
F+
ei(1+Λ)Φ
e−iΛφ
−eiφ
1 + Λ
+
λ0 − Ω
ω1
2
F−
ei(1−Λ)Φ
eiΛφ
−eiφ
1 − Λ







=
sin φ − sin(2ωradiot + φ) +
ie−iφ
4
e−2i(1−Λ)ωradiot − 1
1 − Λ
+
e−2i(1+Λ)ωradiot − 1
1 + Λ
+
ieiφ
4
λ0 + Ω
ω1
2
e2i(1+Λ)ωradiot − 1
1 + Λ
+
λ0 − Ω
ω1
2
e2i(1−Λ)ωradiot − 1
1 − Λ
. (5.321)
The integration produces only oscillatory terms or constant terms (those depending on φ only), implying that ξ∠
1 = 0. Note that all
terms depend on φ values of these terms are arbitrary because the value of φ depends on the choice of t = 0. If a series of independent
experiments is run, the φ-dependent terms average to zero. Therefore, such terms must be evaluated in order to proceed to the next order
of approximation, but they themselves do not contribute to the results of repeated experiments. Another feature of the solution is that the
result of the integration can be written in terms of two time-dependent functions, denoted F+, F− and written in blue, and their complex
conjugates, denoted F∗
+, F∗
− and written in red. We use this fact when we evaluate ξ∠
2 in the next step.
Equating terms quadratic in ε, we determine ξ2
ξ2 = ξ∼
2 + ξ∠
2 t = ξ∼
2 + ξ∠
2
Φ − φ
2ωradio
=
i
4
Φˆ
φ
ξ∼
1 e−iΦ
−
λ0 + Ω
ω1
2
eiΦ
eiΛΦ
e−iΛφ
− e−iΦ
−
λ0 − Ω
ω1
2
eiΦ
e−iΛΦ
eiΛφ
dΦ =
i
4
Φˆ
φ
ξ∼
1 e−i(1−Λ)Φ
e−iΛφ
− e−i(1+Λ)Φ
eiΛφ
−
λ0 + Ω
ω1
2
ei(1+Λ)Φ
e−iΛφ
+
λ0 − Ω
ω1
2
ei(1−Λ)Φ
eiΛφ
dΦ (5.322)
The function to be integrated is a product of the following expressions:
ξ∼
1 = sin φ − sin Φ +
i
4
F∗
− − e−iφ
1 − Λ
+
F∗
+ − e−iφ
1 + Λ
+
λ0 + Ω
ω1
2
F+ − eiφ
1 + Λ
+
λ0 − Ω
ω1
2
F− − eiφ
1 − Λ
=
i
4
2e−iφ
− 2eiφ
− 2e−iΦ
+ 2eiΦ
+
F∗
− − e−iφ
1 − Λ
+
F∗
+ − e−iφ
1 + Λ
+
λ0 + Ω
ω1
2
F+ − eiφ
1 + Λ
+
λ0 − Ω
ω1
2
F− − eiφ
1 − Λ
and
i
4
F∗
− − F∗
+ −
λ0 + Ω
ω1
2
F+ +
λ0 − Ω
ω1
2
F− .
This product consists of three types of terms:
1. Constant terms. Their integration produces a linear function, i.e., the ξ∠
2 t component of ξ2.
2. Terms that oscillate in time but do not depend on the phase φ. Their integration yields oscillatory and constant terms, i.e.,
contributions to ξ∼
2 .
3. Oscillatory φ-dependent terms. Their integration also yields oscillatory and constant terms (contributions to ξ∼
2 ). However, values
of these terms are arbitrary and do not contribute to a description of repeated experiments because they depend on the choice of
t = 0 as mentioned when we discussed the obtained ξ1 factor. As we stop our approximation at the second order of the power
series expansion, we do not need the φ-dependent terms to calculate corrections of higher orders and neglect them in our analysis.
We use the following relations to identify the constant and oscillatory φ-independent terms.
F±F∗
± = 1 (5.323)
F±e−iφ
= e+iΦ
e±iΛΦ
e iΛφ
· e−iφ
= e+2i(1±Λ)ωradiot
F∗
±e+iφ
= e−iΦ
e iΛΦ
e±iΛφ
· e+iφ
= e−2i(1±Λ)ωradiot
(5.324)
F±e−iΦ
= e+iΦ
e±iΛΦ
e iΛφ
· e−iΦ
= e±2iΛωradiot
F∗
±e+iΦ
= e−iΦ
e iΛΦ
e±iΛφ
· e+iΦ
= e 2iΛωradiot
. (5.325)
5.7. SUPPORTING INFORMATION 159
Eqs. 5.324 and 5.325 tell us that we have to inspect all terms of the product obtained by multiplying a red function by a blue function,
or vice versa.
Φˆ
φ
F∗
− − F∗
+
16
2eiφ
− 2eiΦ
−
λ0 + Ω
ω1
2
F+ − eiφ
1 + Λ
−
λ0 − Ω
ω1
2
F− − eiφ
1 − Λ
dΦ
+
1
16
Φˆ
φ
λ0 + Ω
ω1
2
F+ −
λ0 − Ω
ω1
2
F− 2e−iφ
− 2e−iΦ
+
F∗
− − e−iφ
1 − Λ
+
F∗
+ − e−iφ
1 + Λ
dΦ (5.326)
=
ωradio
4
tˆ
0
λ0 + Ω
ω1
2
1
1 + Λ
−
λ0 − Ω
ω1
2
1
1 − Λ
dt
+
ωradio
8
tˆ
0
e+2iΛωradiot
2 e−2iωradiot
− 1 +
λ0 + Ω
ω1
2
1
1 + Λ
+
λ0 − Ω
ω1
2
1
1 − Λ
e−2iωradiot
−
λ0 + Ω
ω1
2
1
1 + Λ
dt
−
ωradio
8
tˆ
0
e−2iΛωradiot
2 e−2iωradiot
− 1 +
λ0 + Ω
ω1
2
1
1 + Λ
+
λ0 − Ω
ω1
2
1
1 − Λ
e−2iωradiot
−
λ0 − Ω
ω1
2
1
1 − Λ
dt
+
ωradio
8
tˆ
0
λ0 + Ω
ω1
2
1
1 + Λ
−
λ0 − Ω
ω1
2
1
1 − Λ
dt
+
ωradio
8
tˆ
0
e+2iΛωradiot
2
λ0 + Ω
ω1
2
e+2iωradiot
− 1 −
λ0 + Ω
ω1
2
1
1 + Λ
+
1
1 − Λ
e+2iωradiot
+
λ0 + Ω
ω1
2
1
1 − Λ
dt
−
ωradio
8
tˆ
0
e−2iΛωradiot
2
λ0 − Ω
ω1
2
e−2iωradiot
− 1 −
λ0 − Ω
ω1
2
1
1 + Λ
+
1
1 − Λ
e+2iωradiot
+
λ0 − Ω
ω1
2
1
1 + Λ
dt .
(5.327)
We combine terms with the same s in the exponents e2siΛωradiot
ξ2 =
ωradio
4
tˆ
0
λ0 + Ω
ω1
2
1
1 + Λ
−
λ0 − Ω
ω1
2
1
1 − Λ
dt
−
ωradio
8
tˆ
0
e+2iΛωradiot λ0 + Ω
ω1
2
1
1 − Λ
−
1
1 + Λ
+ 2
λ0 + Ω
ω1
2
+ 1
+
λ0 + Ω
ω1
2
1
1 + Λ
+
1
1 − Λ
− 2 e+2iωradiot
−
λ0 + Ω
ω1
2
1
1 + Λ
+
λ0 − Ω
ω1
2
1
1 − Λ
+ 2 e−2iωradiot
dt
+
ωradio
8
tˆ
0
e+2iΛωradiot λ0 − Ω
ω1
2
1
1 + Λ
−
1
1 − Λ
+ 2
λ0 + Ω
ω1
2
+ 1
+
λ0 − Ω
ω1
2
1
1 + Λ
+
1
1 − Λ
− 2 e+2iωradiot
−
λ0 + Ω
ω1
2
1
1 + Λ
+
λ0 − Ω
ω1
2
1
1 − Λ
+ 2 e−2iωradiot
dt . (5.328)
In order to simplify Eq. 5.328, we examine the value of
Λ =
ξ∠ − λ0
2ωradio
=
εξ∠
1 + ε2ξ∠
2 + · · · − λ0
2ωradio
. (5.329)
We have determined that ξ∠
1 = 0. Therefore, Λ at the examined level of approximation (including corrections up to quadratic in ε),
Λ =
ε2ξ∠
2 − λ0
2ωradio
=
ω2
1ω2
2ξ∠
2
2λ2
0ω3
radio
−
λ0
2ωradio
=
ω2
1
Ω2 + ω2
1
ω2
2ξ∠
2
2ω3
radio
−
ω1
2ωradio
1 +
Ω2
ω2
1
. (5.330)
160
Our approximation is applicable only for ωradio much higher than ω2 or ω1 (recall that ω2 = ω1 for the most interesting example of a
ﬁeld oscillating in one direction). The ratios ω1/ωradio and ω2/ωradio in Eq. 5.330 indicate that Λ 1. For Λ 1, 1 + Λ ≈ 1 − Λ ≈ 1.
Consequently, the red terms Eq. 5.328 are negligible, and the green expression simpliﬁes to 4(Ω2 + ω2
1)/ω2
1, which allows us to further
simplify the integral by in the next step
ξ2 =
ωradio
4
tˆ
0
λ0 + Ω
ω1
2
−
λ0 − Ω
ω1
2
dt
+
ωradio
4
tˆ
0
e−2iΛωradiot λ0 − Ω
ω1
2
+ 1 − e+2iΛωradiot λ0 + Ω
ω1
2
+ 1 dt
+
ωradio
2
tˆ
0
Ω2 + ω2
1
ω2
1
e−2i(1−Λ)ωradiot
− e−2i(1+Λ)ωradiot
dt (5.331)
because the green exponential factors cancel each other for Λ 1. Neglecting the last line, the integration yields
ξ2 = ξ∠
2 t + ξ∼
2 =
ωradio
4
λ0 + Ω
ω1
2
−
λ0 − Ω
ω1
2
[t ]t
0
+
ωradio
4
λ0 − Ω
ω1
2
+ 1
e−2iΛωradiot
t
0
−2iΛωradio
−
ωradio
4
λ0 + Ω
ω1
2
+ 1
e+2iΛωradiot
t
0
2iΛωradio
=
ωradio
4
λ0 + Ω
ω1
2
−
λ0 − Ω
ω1
2
t +
i
8Λ
λ0 + Ω
ω1
2
+ 1 e+2iΛωradiot
− 1 +
λ0 − Ω
ω1
2
+ 1 e−2iΛωradiot
− 1
(5.332)
= ωradio
λ0Ω
ω2
1
t+
i
8Λ
λ0 + Ω
ω1
2
+ 1 (cos(2Λωradiot) + i sin(2Λωradiot) − 1) +
i
8Λ
λ0 − Ω
ω1
2
+ 1 (cos(2Λωradiot) − i sin(2Λωradiot) − 1) (5.333)
= ωradio
λ0Ω
ω2
1
t +
i
8Λ
λ0 + Ω
ω1
2
+
λ0 − Ω
ω1
2
+ 2 (cos(2Λωradiot) − 1) −
1
8Λ
λ0 + Ω
ω1
2
−
λ0 − Ω
ω1
2
sin(2Λωradiot) (5.334)
= ωradio
λ0Ω
ω2
1
t +
i
8Λ
λ0 + Ω
ω1
+
λ0 − Ω
ω1
2
(cos(2Λωradiot) − 1) −
1
8Λ
4λ0Ω
ω2
1
sin(2Λωradiot) (5.335)
= ωradio
λ0Ω
ω2
1
t +
i
2Λ
λ2
0
ω2
1
(cos(2Λωradiot) − 1) −
1
2Λ
λ0Ω
ω2
1
sin(2Λωradiot). (5.336)
We easily identify ξ∠
2 and ξ∼
2 as the red and blue expressions. We are now ready to express the factor Ξ = eiξ as
eiξ
= eiε2
ξ2 = e
i
ω2
2
2ωradio
Ω
λ0
t−
2f
1
2Λ
ω2
2ωradio
2
(cos(2Λωradiot) − 1) −i
2g
1
2Λ
ω2
2ωradio
2
Ω
λ0
sin(2Λωradiot)
= e
i
ω2
2
2ωradio
Ω
λ0
t−2f−2ig
(5.337)
and
ei
ξ−λ0t
2 = e
i
ω2
2
4ωradio
Ω
λ0
−
λ0
2
t−
f
1
4Λ
ω2
2ωradio
2
(cos(2Λωradiot) − 1) −i
g
1
4Λ
ω2
2ωradio
2
Ω
λ0
sin(2Λωradiot)
= eiΛωradiot−f−ig
, (5.338)
where we simpliﬁed notation by introducing real functions f and g. Using the notation, we can express Pβ = 1 − Pα from the last
term in Eq. 5.286 (the horizontal bars indicate that Pβ and Pα are averages for a large number of measurements with random phases φ)
uu∗
=
λ0+Ω
ω1
eiΛωradiote−f−ig + λ0−Ω
ω1
e−iΛωradiotef+ig
eiΛωradiote−f−ig − e−iΛωradiotef+ig
·
λ0+Ω
ω1
e−iΛωradiote−f+ig + λ0−Ω
ω1
eiΛωradiotef−ig
e−iΛωradiote−f+ig − eiΛωradiotef−ig
5.7. SUPPORTING INFORMATION 161
=
λ0+Ω
ω1
2
e−2f + λ0−Ω
ω1
2
e2f +
λ2
0−Ω2
ω2
1
e2iΛωradiote−2ig + e−2iΛωradiote2ig
e−2f + e2f − e2iΛωradiote−2ig − e−2iΛωradiote2ig
=
λ2
0+Ω2
ω2
1
e−2f + e2f + 2λ0Ω
ω2
1
e−2f − e2f + e2iΛωradiote−2ig + e−2iΛωradiote2ig
e−2f + e2f − (e2iΛωradiote−2ig + e−2iΛωradiote2ig)
(5.339)
1 − uu∗
=
λ2
0+Ω2
ω2
1
+ 1 e−2f + e2f + 2λ0Ω
ω2
1
e−2f − e2f
e−2f + e2f − (e2iΛωradiote−2ig + e−2iΛωradiote2ig)
=
2
λ2
0
ω2
1
e−2f + e2f + 2λ0Ω
ω2
1
e−2f − e2f
e−2f + e2f − (e2iΛωradiote−2ig + e−2iΛωradiote2ig)
. (5.340)
To proceed, we look at f and g more closely. Knowing the explicit expression of ξ∠
2 (Eq. 5.336), we can calculate Λ from Eq. 5.330
Λ =
ε2ξ∠
2
2ωradio
−
λ0
2ωradio
=
1
2
ω2
2ωradio
2
Ω
λ0
−
λ0
2ωradio
= −
λ0
2ωradio
1 −
ω2
2Ω
4ωradioλ2
0
. (5.341)
Inserting the expressed Λ to Eq. 5.338,
1
4Λ
ω2
2ωradio
2
=
ω2
2ωradio
2
2 ω2
2ωradio
2
Ω
λ0
− 2 λ0
ωradio
=
1
2
1
Ω
λ0
− 2ωradio
ω2
2
λ0
ωradio
=
1
2
1
Ω
Ω2+ω2
1
− 4ωradio
ω2
ω1
ω2
Ω2+ω2
1
ω1
. (5.342)
Since we assumed that ω2 ωradio, this expression and consequently f and g are much smaller than unity12
1
4Λ
ω2
2ωradio
2
=
1
2
1
Ω
Ω2+ω2
1
− 4ωradio
ω2
ω1
ω2
Ω2+ω2
1
ω1
≈ −
1
8
·
ω2
ωradio
·
ω2
ω1
·
ω1
Ω2 + ω2
1
1. (5.343)
We can therefore express ef and eig as power series and safely ignore higher than linear terms
ef
≈ 1 + f, eig
≈ 1 + ig (5.344)
This simpliﬁes Eq. 5.340 to
1 − uu∗
=
4
λ2
0
ω2
1
− 8λ0Ω
ω2
1
f
2 − (e2iΛωradiot + e−2iΛωradiot) + 2ig (e2iΛωradiot − e−2iΛωradiot)
=
λ2
0
ω2
1
·
2 − 4 Ω
λ0
f
1 − cos(2Λωradiot) − 2g sin(2Λωradiot)
. (5.345)
and
Pβ =
1
1 − uu∗
=
ω2
1
λ2
0
·
1 − cos(2Λωradiot) − 2g sin(2Λωradiot)
2 − 4 Ω
λ0
f
. (5.346)
Writing f and g explicitly,
Pβ =
1
1 − uu∗
=
ω2
1
λ2
0
·
1 − cos(2Λωradiot) − 1
2Λ
ω2
2ωradio
2
Ω
λ0
sin2(2Λωradiot)
2 + 1
Λ
ω2
2ωradio
2
Ω
λ0
(1 − cos(2Λωradiot))
. (5.347)
Using the identities cos(2x) = cos2 x − sinx = 1 − 2 sin2 x and sin(2x) = 2 sin x cos x,
Pβ =
ω2
1
λ2
0
·
2 sin2(Λωradiot) − 2
Λ
ω2
2ωradio
2
Ω
λ0
sin2(Λωradiot) cos2(Λωradiot)
2 + 2
Λ
ω2
2ωradio
2
Ω
λ0
sin2(Λωradiot)
=
ω2
1
λ2
0
sin2
(Λωradiot) ·
1 − 1
Λ
ω2
2ωradio
2
Ω
λ0
cos2(Λωradiot)
1 + 1
Λ
ω2
2ωradio
2
Ω
λ0
sin2(Λωradiot)
12
Taking into account that Ω ≤ λ Ω2 + ω2
1, ω1 ≤ λ Ω2 + ω2
1, and that ω2 is not substantially greater than ω1 (in the most relevant
case of linear oscillations ω2 = ω1).
162
=
ω2
1
λ2
0
sin2
(Λωradiot) ·
1 − 1
Λ
ω2
2ωradio
2
Ω
λ0
cos2(Λωradiot)
1 + 1
Λ
ω2
2ωradio
2
Ω
λ0
sin2(Λωradiot)
·
1 − 1
Λ
ω2
2ωradio
2
Ω
λ0
sin2(Λωradiot)
1 − 1
Λ
ω2
2ωradio
2
Ω
λ0
sin2(Λωradiot)
=
ω2
1
λ2
0
sin2
(Λωradiot) ·
1 − 1
Λ
ω2
2ωradio
2
Ω
λ0
+ 1
Λ2
ω2
2ωradio
4
Ω
λ0
2
sin2(Λωradiot) cos2(Λωradiot)
1 − 1
Λ2
ω2
2ωradio
4
Ω
λ0
2
sin4(Λωradiot)
, (5.348)
where the blue term was obtained using the Pythagorean identity cos2 x + sin2 x = 1. The terms containing the very small factors
printed in red can be safely neglected, yielding an equation closely resembling Eq 5.283
Pβ ≈
ω2
1
λ2
0
1 −
1
Λ
ω2
2ωradio
2
Ω
λ0
sin2
(Λωradiot) =
ω2
1
λ2
0
1 −
ω2
2
2ωradio
Ω
2Λωradioλ0
sin2
(Λωradiot). (5.349)
Using Eq. 5.341,
2Λωradioλ0 = − ω2
1 + Ω2
−
ω2
2
4ωradio
Ω (5.350)
and
(2Λωradio)2
=
ω2
1 + Ω2 −
ω2
2
4ωradio
Ω
2
ω2
1 + Ω2
=
ω2
1 + Ω2 2
−
ω2
2
2ωradio
Ω ω2
1 + Ω2 +
ω4
2
16ω2
radio
Ω2
ω2
1 + Ω2
= ω2
1 + Ω2
−
ω2
2
2ωradio
Ω +
ω4
2
16ω2
radio
Ω2
ω2
1 + Ω2
= ω2
1 + Ω −
ω2
2
4ωradio
2
−
ω4
2
16ω2
radio
+
ω4
2
16ω2
radio
Ω2
ω2
1 + Ω2
= ω2
1 + Ω −
ω2
2
4ωradio
2
−
ω4
2
16ω2
radio
ω2
1
ω2
1 + Ω2
. (5.351)
Neglecting the very small red term,
Λωradio ≈ −
1
2
ω2
1 + Ω −
ω2
2
4ωradio
2
(5.352)
Taking into account that sin2(−x) = sin2 x, we can express Pβ as
Pβ ≈
ω2
1
ω2
1 + Ω2

1 −
ω2
2
2ωradio
Ω
ω2
1 + Ω2 −
ω2
2
4ωradio
Ω

 sin2
ω2
1 + Ω −
ω2
2
4ωradio
2
2
t
=
ω2
1
ω2
1 + Ω2
ω2
1 + Ω2 +
ω2
2
4ωradio
Ω
ω2
1 + Ω2 −
ω2
2
4ωradio
Ω
sin2
ω2
1 + Ω −
ω2
2
4ωradio
2
2
t (5.353)
Multiplying numerator and denominator of the second fraction on the second line by ω2
1 + Ω2 −
ω2
2
4ωradio
Ω, we obtain
5.7. SUPPORTING INFORMATION 163
Pβ ≈
ω2
1
ω2
1 + Ω2
ω2
1 + Ω2 2
−
ω4
2
16ω2
radio
Ω2
ω2
1 + Ω2 −
ω2
2
4ωradio
Ω
2
sin2
ω2
1 + Ω −
ω2
2
4ωradio
2
2
t
=
ω2
1
ω2
1 + Ω2
ω2
1 + Ω2 2
−
ω4
2
16ω2
radio
Ω2
ω2
1 + Ω2 2
−
ω2
2
2ωradio
Ω ω2
1 + Ω2 +
ω4
2
16ω2
radio
Ω2
sin2
ω2
1 + Ω −
ω2
2
4ωradio
2
2
t
=
ω2
1
ω2
1 + Ω2 2
ω2
1 + Ω2 2
−
ω4
2
16ω2
radio
Ω2
ω2
1 + Ω2 −
ω2
2
2ωradio
Ω +
ω4
2
16ω2
radio
Ω2
ω2
1+Ω2
sin2
ω2
1 + Ω −
ω2
2
4ωradio
2
2
t
=
ω2
1
ω2
1 + Ω2 2
ω2
1 + Ω2 2
−
ω4
2
16ω2
radio
Ω2
ω2
1 + Ω −
ω2
2
4ωradio
2
−
ω4
2
16ω2
radio
ω2
1
ω2
1+Ω2
sin2
ω2
1 + Ω −
ω2
2
4ωradio
2
2
t. (5.354)
Again, we neglect the very small red terms, cancel ω2
1 + Ω2 2
in the denominator and numerator of the ﬁrst and second fraction,
respectively, and obtain
Pβ ≈
ω2
1
ω2
1 + Ω −
ω2
2
4ωradio
2
sin2
ω2
1 + Ω −
ω2
2
4ωradio
2
2
t. (5.355)
Comparison with Eq 5.283
Pβ =
ω2
1
ω2
1 + Ω2
sin2
ω2
1 + Ω2
2
t (5.356)
reveals that the expression describing the average population Pβ for non-rotating ﬁelds (in the approximation keeping terms proportional
to ω1/ωradio but neglecting terms proportional to higher powers of ω1/ωradio) diﬀers form the expression describing the population
Pβ for rotating ﬁelds only by the frequency shift
ω2
2
4ωradio
.
164
Lecture 6
Ensemble of non-interacting spins
Literature: A nice short introduction is given in K3.1. The topic is clearly described in K6, L11,
C2.2. The mixed state is introduced nicely in B17.2, K6.8, L11.1, and C2.2.2. The general strategy of
analyzing NMR experiments is outlined in C2.4. More speciﬁc references are given in the individual
sections below.
6.1 Mixed state
So far, we worked with systems in so-called pure states, when we described the whole studied system
by its complete wave function. It is ﬁne if the system consists of one particle or a small number of
particles. In the case of a single particle, the wave function Ψ(x, y, z, cα) depends on the x, y, z coordinates
of the particle plus the additional degree of freedom describing the spin state (in terms of the
four-components of the solution of the Dirac equation). Extending quantum-mechanical description
to more than one particle presents both fundamental and practical problems. A fundamental problem
is that particles of the same type cannot be distinguished as in classical mechanics. This issue is
brieﬂy discussed in Section 6.7.1. The major practical problem is a high complexity of multiparticle
systems. The complete wave function of whole molecule is already very complicated, represented by
multidimensional state vectors and their properties are described by operators represented by multidimensional
matrices. In the case of macroscopic ensembles of many molecules, the dimensionality
of the state vectors and operator matrices is described by astronomic numbers. A typical NMR
sample contains approximately 1024
particles (electrons, protons, and neutrons). Clearly, we cannot
use the brute-force approach requiring determination of the complete wave function. In this lecture,
we describe two levels of simpliﬁcation routinely applied to describe NMR samples.
The ﬁrst level of simpliﬁcation is separation of the description of spin magnetic moments from
the other terms of the wave function. In NMR spectroscopy, we are interested only in properties
of molecules associated with spins of the observed nuclei. If we assume that motions of the whole
molecule, of its atoms, and of electrons and nuclei in the atoms, do not depend on the spin of
the observed nucleus, we can divide the complete wave function into spin wave functions and wave
function describing all the other degrees of freedom. Validity of such assumption is discussed in
Section 6.7.2. Based on the arguments presented there, we can conclude that (in most cases except
for some relaxation eﬀects) wave functions (and consequently of Hamiltonians) can be divided into
two parts, one dependent on the spin degrees of freedom, and the other one dependent on the
165
166
other degrees of freedom that are not important in the NMR spectroscopy. To describe the NMR
experiment, is is suﬃcient to analyze only the spin wave function (spin state vector). However, the
number of dimensions of the spin state vector is extremely high, typically ∼ 1023
, and properties
of the large sets of magnetic moments in bulk samples are described by operators represented by
matrices of the same dimensionality. Another level of simpliﬁcation is therefore needed.
The second level of simpliﬁcation is related to the question whether individual magnetic moments
can be treated independently. This is possible if the spin Hamiltonian can be decomposed into a sum
of operators acting separately on individual nuclear magnetic moments, as shown in Section 6.7.3. If
this condition is fulﬁlled, the spin wave function of the whole ensemble can be decomposed to independent
spin wave functions of individual nuclei, and the Hamiltonian has the same eigenfunctions
(|α , |β in the case of a vertical ﬁeld B0) when applied to any of the individual spin wave function.
These eigenfunctions can be used as the same basis set for all spin wave functions (state vectors) of
individual magnetic moments. Using the same basis for vectors representing spins of diﬀerent nuclei
allows us to use two-dimensional operator matrices (for spin-1/2 nuclei) instead of multidimensional
operator matrices. Similar arguments can be applied to the Hamiltonian of magnetic moments in
magnetic ﬁelds in other directions.
Expected value A of a quantity A for a single nucleus can be calculated using Eq. 4.12 as a
trace of the following product of matrices:
A = Tr
cαc∗
α cαc∗
β
cβc∗
α cβc∗
β
A11 A12
A21 A22
. (6.1)
Expected value A of a quantity A for multiple nuclei with the same basis is
A = Tr
cα,1c∗
α,1 cα,1c∗
β,1
cβ,1c∗
α,1 cβ,1c∗
β,1
A11 A12
A21 A22
+
cα,2c∗
α,2 cα,2c∗
β,2
cβ,2c∗
α,2 cβ,2c∗
β,2
A11 A12
A21 A22
+ · · ·
= Tr
cα,1c∗
α,1 cα,1c∗
β,1
cβ,1c∗
α,1 cβ,1c∗
β,1
+
cα,2c∗
α,2 cα,2c∗
β,2
cβ,2c∗
α,2 cβ,2c∗
β,2
+ · · ·
A11 A12
A21 A22
= N Tr



cαc∗
α cαc∗
β
cβc∗
α cβc∗
β
ˆρ
A11 A12
A21 A22
ˆA



= N Tr ˆρ ˆA . (6.2)
The matrix ˆρ is the (probability) density matrix, the horizontal bar indicates average over the
whole ensemble of nuclei in the sample, and N is the number of non-interacting nuclei described in
the same operator basis.
Why probability density? Because the probability P = Ψ|Ψ , the operator of probability can be
written as the unit matrix ˆ1: Ψ|Ψ ≡ Ψ|ˆ1|Ψ . Therefore, the expectation value of probability can
be also calculated using Eq. 4.12 as Tr{ˆρˆ1} = Tr{ˆρ}.
The most important features of the mixed-state approach are listed below:
• Two-dimensional basis is suﬃcient for the whole set of N nuclei (if they do not interact with
each other).
6.2. POPULATIONS 167
Table 6.1: Examples of operators and a density matrix expressed in the same basis. The density matrix is shown
in red, the operators are shown in green. The elements of the density matrix are expressed in terms of the |ϑj, ϕj
states, as described in Section 6.7.4.
Description of units symbol explicit expression (linear combination of basis matrices)
mixed state 1 ˆρ 1×1
2
1 0
0 1
+ cos ϑ×1
2
1 0
0 −1
+ sin ϑ cos ϕ×1
2
0 1
1 0
+ sin ϑ sin ϕ×1
2
0 −i
i 0
angular momentum J s ˆIz 0×1
2
1 0
0 1
+ ×1
2
1 0
0 −1
+ 0×1
2
0 1
1 0
+ 0×1
2
0 −i
i 0
magnetic moment J T−1
ˆµz 0×1
2
1 0
0 1
+ γ ×1
2
1 0
0 −1
+ 0×1
2
0 1
1 0
+ 0×1
2
0 −i
i 0
energy J ˆH 0×1
2
1 0
0 1
+ γBz ×1
2
1 0
0 −1
+ γBx ×1
2
0 1
1 0
+ γBy ×1
2
0 −i
i 0
• Statistical approach: the possibility to use a 2D basis is paid by loosing the information about
the microscopic state. The same density matrix can describe an astronomic number of possible
combinations of individual angular momenta which give the same macroscopic result. What is
described by the density matrix is called the mixed state.
• Choice of the basis of the wave function is encoded in the deﬁnition of ˆρ (eigenfunctions of ˆIz).
• The state is described not by a vector, but by a matrix, ˆρ is a matrix like matrices representing
the operators.
• Any 2 × 2 matrix can be written as a linear combination of four 2 × 2 matrices. Such four
matrices can be used as a basis of all 2 × 2 matrices, including matrices representing operators
(in the same manner as two selected 2-component vectors serve as a basis for all 2-component
vectors). Examples of such linear combinations are presented in Table 6.1. Note that the
density matrix and the operators describe diﬀerent features, they are clearly distinguished by
the coeﬃcients of the linear combinations.
• A good choice of a basis is a set of orthonormal matrices.1
• Diagonal elements of ˆρ (or matrices with diagonal elements only) are known as populations.
They are discussed in Section 6.2.
• Oﬀ-diagonal elements (or matrices with diagonal elements only) are known as coherences. They
are discussed in Section 6.3.
6.2 Populations
Population is a somewhat confusing name of a diagonal element of the probability density matrix,
the correct physical interpretation is clearly described in L11.2.
1
Orthonormality for a set of four matrices ˆA1, ˆA2, ˆA3, ˆA4 can be deﬁned as Tr{ ˆA†
j
ˆAk} = δjk, where j and k ∈
{1, 2, 3, 4}, δjk = 1 for j = k and δjk = 0 for j = k, and ˆA†
j is an adjoint matrix of ˆAj, i.e., matrix obtained from ˆAj
by exchanging rows and columns and replacing all numbers with their complex conjugates.
168
• In a pure state, cαc∗
α is given by the amplitude of cα: cαc∗
α = |cα|2
.
• In a mixed state, the coeﬃcients cα,j are diﬀerent for the observed nucleus in each molecule j.
• The populations cαc∗
α and cβc∗
β are real numbers |cα|2 and |cβ|2, respectively, and their sum is
always one.2
• If cα,j and cβ,j describe stationary states, the populations cαc∗
α and cβc∗
β do not change in time.
• A population cαc∗
α > 1/2 describes longitudinal polarization, i.e. polarization of magnetic
moments in the z direction (the direction of B0), an excess of magnetic moments with positive
µz components. The sum of µz of all magnetic moments in the sample divided by the volume
of the sample is the z component of the bulk magnetization (Mz).
• The value cαc∗
α = 1/2 indicates no net polarization in the direction B0 (equal populations of
the α and β states). It does not indicate that all spins in the ensemble must be either in the
α state or in the β state! The value cαc∗
α = 1/2 describes equally well all combinations of
superposition states describing sets of magnetic moments pointing in all possible directions as
long as their vector sum has a zero z component. Probability that the system contains 50 %
spins in the α state and 50 % spins in the β state is actually negligible.
• When cαc∗
α is speciﬁed, cβc∗
β does not carry any additional information because its value is
already fully described by the cαc∗
α value: cβc∗
β = 1 − cαc∗
α. It also implies that the real number
cαc∗
α carries the same information as the matrix
cαc∗
α 0
0 cβc∗
β
=
cαc∗
α 0
0 1 − cαc∗
α
=
1
2
1 0
0 1
+ cαc∗
α −
1
2
1 0
0 −1
.
Consequently, longitudinal polarization is described equally well by the number cαc∗
α and by
the second term contributing to the displayed matrix.
• Graphical representations of quantum mechanical objects are helpful but not perfect. An attempt
to visualize the population cαc∗
α is presented in Figure 6.1. The polarization is depicted
as one possible distribution of magnetic moments and as a vector describing the bulk magnetization
as a result of the longitudinal polarization of magnetic moments.
6.3 Coherence
Coherence is a very important issue in NMR spectroscopy. It is discussed in K6.9, L11.2, C2.6.
• In a pure state, cβc∗
α is given by amplitudes and by the diﬀerence of phases of cα and cβ:
cβc∗
α = |cα||cβ|e−i(φα−φβ)
.
2
Note that
N
j=1(cα,jc∗
α,j + cβ,jc∗
β,j) = N . Therefore, cαc∗
α + cβc∗
β = 1.
6.3. COHERENCE 169
• In a mixed state, cα,j = |cα,j|eiφα,j
and cβ,j = |cβ,j|eiφβ,j are diﬀerent for the observed nucleus in
each molecule j.
• The coherence cβc∗
α is a complex number |A|e−iΦ
= |cα||cβ| · e−i(φα−φβ). Its amplitude |A| is
|cα||cβ| and its phase Φ is given by e−i(φα−φβ) = cos(φα − φβ) − i sin(φα − φβ).
• In general, the spin magnetic moment in individual molecules are present in various superposition
states corresponding to various linear combinations of the |α and |β eigenstates
(cα,j|α + cβ,j|β ). If there is no macroscopic relationship between the phases φα,j and φβ,j in
individual molecules, the diﬀerence φα,j − φβ,j can take any value in the interval (0, 2π) with
the same probability. Therefore, e−i(φα−φβ) = cos(φα − φβ)−i sin(φα − φβ) = 0+0 = 0 because
the average values of both sine and cosine values are zero in the interval (0, 2π). Obviously,
cβc∗
α = 0 in such a case, regardless of the amplitudes. Such an ensemble of states is called
incoherent superposition of the |α and |β eigenstates.
• If e−i(φα,j−φβ,j)
does not average to zero, a macroscopic relationship exists between the phases
φα,j and φβ,j. Such an ensemble of states is called coherent superposition of the |α and |β
eigenstates. This is why the term coherence is used for the oﬀ-diagonal elements of the density
matrix, whose non-zero values indicate coherent superposition of the |α and |β eigenstates,
or simply coherence of the system.
• The non-zero coherence cβc∗
α describes transverse polarization, i.e. polarization of magnetic
moments in the xy plane (a plane perpendicular to B0). The magnitude of the transverse
polarization is |cα||cβ| and its direction is given by the phase of cβc∗
α. Since the result of
polarization of magnetic moments is a bulk magnetization, the direction of the transverse
polarization can be described by the x and y components of the magnetization vector M:
Mx = |M⊥| cos Φ, My = |M⊥| sin Φ, where Φ is the phase of cβc∗
α and M⊥ = M2 − M2
z .
• If the evolution of the phases φα,j and φβ,j is coherent, the diﬀerences φα,j −φβ,j change in time,
but identically for all magnetic moments. In such a case, the coherence of the system persists
and cβc∗
α describes transverse polarization with a constant magnitude and in the direction
speciﬁed by the actual value of the phase Φ. Section 6.7.4 describes explicitly how the coherence
cβc∗
α depends on φα,j and φβ,j.
• cαc∗
β does not carry any additional information, it is just a complex conjugate of cβc∗
α. It also
implies that the complex number cβc∗
α carries the same information as the matrix
0 cαc∗
β
cβc∗
α 0
.
Consequently, the term coherence is used for the complex number cβc∗
α as well as for the
displayed matrix.
• As cβc∗
α is a complex number, it carries information of two real numbers, of its amplitude
and phase, or of its real and imaginary components |cα||cβ| cos Φ and i|cα||cβ| sin Φ. The same
170
M
B0
Figure 6.1: Pictorial representation of the populations cαc∗
α = 1/2 (left) and cαc∗
α > 1/2 (right). The populations
are depicted as distributions of magnetic moments (black) and as a magnetization vector (cyan) deﬁning the direction
of the longitudinal polarization.
information is encoded in purely real and purely imaginary matrices
|cα||cβ| cos Φ
0 1
1 0
i|cα||cβ| sin Φ
0 −1
1 0
.
• Graphical representation of the coherence cβc∗
α is shown in Figure 6.2.
6.4 Basis sets
Usual choices of basis matrices are (C2.7.2):
• Cartesian operators, equal to the operators of spin angular momentum divided by . In this
text, these matrices are written as Ix, Iy, Iz, It. In a similar fashion, we write H = ˆH/
for Hamiltonians with eigenvalues expressed in units of (angular) frequency, not energy. The
normalization factor
√
2 is often omitted (then the basis is still orthogonal, but not orthonor-
mal):
√
2It =
1
√
2
1 0
0 1
√
2Iz =
1
√
2
1 0
0 −1
√
2Ix =
1
√
2
0 1
1 0
√
2Iy =
1
√
2
0 −i
i 0
. (6.3)
6.5. LIOUVILLE-VON NEUMANN EQUATION 171
M
B0
Figure 6.2: Pictorial representation of the coherence cβc∗
α as a distribution of magnetic moments (black) and as a
magnetization vector (cyan) deﬁning the direction of the transverse polarization.
• Single-element population
Iα = It + Iz =
1 0
0 0
Iβ = It − Iz =
0 0
0 1
(6.4)
and transition operators
I+ = Ix + iIy =
0 1
0 0
I− = Ix − iIy =
0 0
1 0
. (6.5)
• A mixed basis
√
2It =
1
√
2
1 0
0 1
√
2Iz =
1
√
2
1 0
0 −1
I+ =
0 1
0 0
I− =
0 0
1 0
. (6.6)
6.5 Liouville-von Neumann equation
In order to describe the evolution of mixed states in time, we must ﬁnd an equation describing how
elements of the density matrix change in time. Derivation of such equation is nicely described in
C2.2.3 and reviewed in Section 6.7.5 of our text. The result is
dˆρ
dt
=
i
(ˆρ ˆH − ˆH ˆρ) =
i
[ˆρ, ˆH] = −
i
[ ˆH, ˆρ] (6.7)
or in the units of (angular) frequency
dˆρ
dt
= i(ˆρH − H ˆρ) = i[ˆρ, H ] = −i[H , ˆρ]. (6.8)
Eqs. 6.7 and 6.8 are known as the Liouville-von Neumann equation.
172
The Liouville-von Neumann equation can be solved using techniques of linear algebra. However,
a very simple geometric solution is possible (K7.3, C2.7.3, L11.8) if the Hamiltonian does not change
in time and consists solely of matrices which commute (e.g., It and Iz, but not Ix and Iz).
The evolution of ˆρ can be described as a rotation in an abstract three-dimensional operator space
with the dimensions given by Ix, Iy, and Iz, as shown in Section 6.7.6. An example is given in
Fig. 6.3.
If the operator Ij, deﬁning the density matrix ˆρ(t = 0) = cIj, and the operator Il, deﬁning
the Hamiltonian H = ωIl, satisfy the following commutation relation
[Ij, Ik] = iIl, (6.9)
where j, k, l ∈ {x, y, z}, then the density matrix evolves as
ˆρ = cIj −→ cIj cos(ωt) + cIk sin(ωt), (6.10)
which corresponds to a rotation about Il in an abstract three-dimensional space deﬁned by the
basis Ij, Ik, Il.
ˆρ
ωt
Iz
Iy
−Iz
−Ix
−Iy
Ix
H = ωIz
Figure 6.3: Evolution of the density matrix ˆρ = cIx cos(ωt) + cIy cos(ωt) under the inﬂuence of the Hamiltonian
H = ωIz visualized as a rotations in the space of operators Ix, Iy, Iz.
6.6 General strategy of analyzing NMR experiments
The Liouville-von Neumann equation is the most important tool in the analysis of evolution of the
spin system during the NMR experiment. The general strategy consists of three steps:
1. Deﬁne ˆρ at t = 0
2. Describe evolution of ˆρ using the relevant Hamiltonians – this is usually done in several steps
3. Calculate the expectation value of the measured quantity (magnetization components in the
x, y plane) according to Eq. 6.2
Obviously, the procedure requires knowledge of
6.6. GENERAL STRATEGY OF ANALYZING NMR EXPERIMENTS 173
1. relation(s) describing the initial state of the system (ˆρ(0))
2. all Hamiltonians
3. the operator representing the measurable quantity
In the next section, we start from the end and deﬁne ﬁrst the operator of the measurable quantity.
Then we spend a lot of time deﬁning all necessary Hamiltonians. Finally, we use the knowledge of
the Hamiltonians and basic thermodynamics to describe the initial state.
HOMEWORK
Following Section 6.7.6, and in particular Eq. 6.61, calculate the density matrix after 25 µs, starting
from the state Iy and evolving under the inﬂuence of the Hamiltonian H = ω0Iz, where ω0 =
π × 105
rad/s.
174
6.7 SUPPORTING INFORMATION
6.7.1 Indistinguishable particles
In classical mechanics, where particles are described by coordinates and momenta, two particles can be always distinguished by tracking
their coordinates. This is not possible in quantum mechanics, where particles are described by wave functions. For example, two electrons
in a hydrogen molecule are indistinguishable, it is not possible to tell which electron ”originally” belonged to which hydrogen atom. This
seemingly innocent quantum mechanical feature has dramatic consequences.
Let us investigate a set of three identical spin-1/2 particles, e.g. electrons. Their state is completely described by a wave function Ψ,
which depends on their coordinates and spin degrees of freedom:
Ψ(x1, y1, z1, cα1 , x2, y2, z2, cα2 , x3, y3, z3, cα3 ). (6.11)
The probability density that one particle is in a place and in a spin state described by the coordinates x1, y1, z1, cα1 , another one in
a place and in a spin state described by the coordinates x2, y2, z2, cα2 , and a third one in a place and in a spin state described by the
coordinates x3, y3, z3, cα3 is given by Ψ∗Ψ = |Ψ|2:
ρ = |Ψ(x1, y1, z1, cα1 , x2, y2, z2, cα2 , x3, y3, z3, cα3 )|2
. (6.12)
If the particles are indistinguishable, Ψ∗Ψ = |Ψ|2 should not be changed by exchanging the particles because we cannot say which
one is which.
ρ = |Ψ(x1, y1, z1, cα1 , x2, y2, z2, cα2 , x3, y3, z3, cα3 )|2
= |Ψ(x2, y2, z2, cα2 , x1, y1, z1, cα1 , x3, y3, z3, cα3 )|2
This is true only if the amplitude of Ψ is not aﬀected by the exchange. The phase of Ψ can diﬀer, but only in a limited way. If the
exchange x1, y1, z1, cα1 ↔ x2, y2, z2, cα2 changes Ψ to Ψei∆φ, then the second exchange x1, y1, z1, cα1 ↔ x2, y2, z2, cα2 must return Ψ to
its original form because we have returned to the initial state:
Ψei∆φ
→ Ψei∆φ
ei∆φ
= Ψei2∆φ
= Ψ ⇒ ei∆φ
= ±1. (6.13)
Therefore
Ψ(x1, y1, z1, cα1 , x2, y2, z2, cα2 , x3, y3, z3, cα3 ) = ±Ψ(x2, y2, z2, cα2 , x1, y1, z1, cα1 , x3, y3, z3, cα3 ). (6.14)
The wave functions for spin-1/2 particles always change the sign, they are called antisymmetric, whereas wave functions keeping
the sign upon particle exchange are called symmetric. Note that a possible solution of the Schr¨odinger’s equation may by a linear
combination of the ”correct” symmetric and antisymmetric wave functions, which is not symmetric or antisymmetric. Then, the symmetric
and antisymmetric wave functions, correctly describing the system, must be recovered by ﬁnding appropriate linear combinations of the
”wrong” solutions. For example, if our function Ψ is not symmetric or antisymmetric, we ﬁrst write all functions obtained by all possible
permutations (exchanges) of the coordinates:
no exchange : Ψ(x1, y1, z1, cα1 , x2, y2, z2, cα2 , x3, y3, z3, cα3 )
1 exchange : Ψ(x2, y2, z2, cα2 , x1, y1, z1, cα1 , x3, y3, z3, cα3 )
1 exchange : Ψ(x3, y3, z3, cα3 , x2, y2, z2, cα2 , x1, y1, z1, cα1 )
1 exchange : Ψ(x1, y1, z1, cα1 , x3, y3, z3, cα3 , x2, y2, z2, cα2 )
2 exchanges : Ψ(x2, y2, z2, cα2 , x3, y3, z3, cα3 , x1, y1, z1, cα1 )
2 exchanges : Ψ(x3, y3, z3, cα3 , x1, y1, z1, cα1 , x2, y2, z2, cα2 ) (6.15)
Then, the sum of all permuted wave functions is symmetric
Ψs
= +
1
√
6
Ψ(x1, y1, z1, cα1 , x2, y2, z2, cα2 , x3, y3, z3, cα3 )
+
1
√
6
Ψ(x2, y2, z2, cα2 , x1, y1, z1, cα1 , x3, y3, z3, cα3 )
+
1
√
6
Ψ(x3, y3, z3, cα3 , x2, y2, z2, cα2 , x1, y1, z1, cα1 )
+
1
√
6
Ψ(x1, y1, z1, cα1 , x3, y3, z3, cα3 , x2, y2, z2, cα2 )
+
1
√
6
Ψ(x2, y2, z2, cα2 , x3, y3, z3, cα3 , x1, y1, z1, cα1 )
+
1
√
6
Ψ(x3, y3, z3, cα3 , x1, y1, z1, cα1 , x2, y2, z2, cα2 ) (6.16)
6.7. SUPPORTING INFORMATION 175
and the sum of the permuted functions multiplied by (−1)n, where n is the number of exchanges, is antisymmetric
Ψa
= +
1
√
6
Ψ(x1, y1, z1, cα1 , x2, y2, z2, cα2 , x3, y3, z3, cα3 )
−
1
√
6
Ψ(x2, y2, z2, cα2 , x1, y1, z1, cα1 , x3, y3, z3, cα3 )
−
1
√
6
Ψ(x3, y3, z3, cα3 , x2, y2, z2, cα2 , x1, y1, z1, cα1 )
−
1
√
6
Ψ(x1, y1, z1, cα1 , x3, y3, z3, cα3 , x2, y2, z2, cα2 )
+
1
√
6
Ψ(x2, y2, z2, cα2 , x3, y3, z3, cα3 , x1, y1, z1, cα1 )
+
1
√
6
Ψ(x3, y3, z3, cα3 , x1, y1, z1, cα1 , x2, y2, z2, cα2 ). (6.17)
The factor 1/
√
6 is a normalization constant, used to obtain |Ψs|2 = |Ψa|2 = |Ψ|2. The symmetry of Ψs and antisymmetry of Ψa can
be checked easily. If we switch any pair of particles, the individual contributions Ψ may change. But the exchange of particles changes the
given Ψ to another Ψ, which is already present in the sum, with the same sign (in Ψs) or with the opposite sign (in Ψa). Therefore, the
exchange of particles does not change Ψs and changes all signs in Ψa.
The minus signs in Eq. 6.17 require that all indistinguishable particles in a system described by an antisymmetric wave must be
in diﬀerent quantum states (Pauli exclusion principle). E.g., if particles 1 and 2 in our three-particle set are in the same state, i.e., if
x1, y1, z1, cα1 = x2, y2, z2, cα2 , the lines 1 and 2, 3 and 6, and 4 and 5 in Eq. 6.17 cancel each other and the ﬁnal result is Ψa = 0.
Consequently, |Ψa|2 = 0 and the probability of ﬁnding the particles anywhere is zero.
Whereas the wave function of a set of indistinguishable particles can change its sign when the particles are exchanged, the Hamiltonian
acting on them must stay the same because the Hamiltonian represents the total energy which does not change if we exchange particles.
And because the evolution of Ψ is given by the Hamiltonian, a symmetric wave function remains symmetric and an antisymmetric wave
function remains antisymmetric during the evolution.
As described in Section 6.1, we usually separate the spatial and spin degrees of freedom:
Ψ = ψnon-spin(x1, y1, z1, x2, y2, z2, x3, y3, z3) · ψspin(cα1 , cα2 , cα3 ). (6.18)
Note that ψnon-spin must be symmetric and ψspin antisymmetric, to obtain an antisymmetric Ψ.
6.7.2 Separation of spin wave function
The separation of the spin wave function is trivial in the case of a free particle in the low-speed (i.e., low-energy) limit, as shown in
Section 5.4:
Ψ =
1
h3
· e
i pxx
· e
i pyy
· e
i pzz
·
cα
cβ
(6.19)
Here, we expressed the wave function as a product of the green vector describing the degree of freedom important in NMR spectroscopy,
and of a function dependent of the irrelevant degrees of freedom, shown in red.
In molecules, we ﬁrst have to be able to separate the nuclear component of the wave function from the electronic one. This is possible
if we assume that motions of the electrons in the orbitals are (i) much faster then evolution of the nuclear spin states3 and (ii) little aﬀected
by the magnetic moments of nuclei (i.e., if we assume that the magnetic ﬁelds of the nuclear magnetic moments are too weak to inﬂuence
motions of electrons). Then, we can use shapes4 of molecular orbitals as a static description of the distribution probability of electron
localization, independent of the actual state of the nuclear spin.
3
In the currently available NMR spectrometers, the frequency of the magnetic moment precession is ∼ 109 s−1. The velocity of the
electrons in atoms is not sharply deﬁned (a consequence of the commutation relation between ˆrj and ˆpj, known as the Heisenberg’s
uncertainty principle). Nevertheless, a rough estimate can be made. In a stationary set of bound particles described by the classical
mechanics, the total kinetic and potential energy are related as follows. Since our set of particles is stationary, the time derivative of the
quantity k(pk ·rk) is equal to zero. The time derivative can be expressed as k(dpk
dt
·rk +pk
drk
dt
) = k(Fk ·rk +mv2
k) = Epot −2Ekin = 0,
where rk is the position vector of the k-th particle, pk is its momentum, vk is its velocity, Fk is the force acting on it, Ekin and Epot are
the total kinetic and potential energy, respectively. In the case of the electron in the hydrogen atom, Epot = −Q2/(4π 0r), where Q is
the elementary charge and r is the electron-proton distance, related to the velocity by the uncertainty principle rjpj ∼ . Therefore,
mv2 ∼ mvQ2/(4π 0 ) ⇒ v ∼ Q2/(4π 0 ) ≈ c/137, where c is the speed of light. Considering the size of the atom (∼ 10−10 m), the
”frequency” of the electron is roughly ∼ 1016 s−1 in hydrogen and higher in heavier atoms.
4
Here, the word ”shape” is a synonym for values of the wave function dependent on the x, y, z coordinates in a coordinate frame
attached to the molecule, independent of the position and orientation of the molecule as a whole.
176
Second, we have to consider how the nuclear spin wave function depends on the coordinates of the nucleus (to see if the degree of
freedom describing the spin state can be separated from the degrees of freedom describing the position). Infrared spectra tell us that
vibrations of nuclei in molecules are much faster (roughly 1014 s−1) than the precession of magnetic moments (∼ 109 s−1). Therefore, we
can safely use coordinates describing averaged positions of nuclei in the molecule. Then, the molecule is deﬁned as a rigid object, and
the average coordinates of nuclei deﬁne the orientation of the molecule, but also the orientation of the cloud of electrons, discussed above.
Instead of investigating the eﬀects of magnetic moments on individual nuclei, it is suﬃcient to ask how the magnetic moments of nuclei
aﬀect the orientation of the molecule. The magnetic ﬁelds of the nuclear magnetic moments are weak (the energy of magnetic moments in
NMR spectrometers is much lower than the kinetic energy of molecules at the ambient temperature), and we can assume that the inﬂuence
of the magnetic moments on the orientation of molecules is negligible.5
At this moment, we have ﬁnished our discussion of the ﬁrst level of the simpliﬁcation of quantum mechanical description of magnetic
moments in molecules. The second level of simpliﬁcation is discussed in Section 6.7.3.
6.7.3 Separation of variables
Our task is to ﬁnd when a wave function ψspin depending on degrees of freedom of many spins6 can be treated as a product of wave
functions of individual spins ψspin = ψ(1) · ψ(2) · ψ(3) . . . , where ψ(j) depends only on the spin degree of freedom of the ﬁrst nucleus etc.
Such separation works if the Hamiltonian can be written as a sum of operators that act only on individual particles (on magnetic moments
of nuclei in individual molecules):
ˆHspin = ˆH(1)
+ ˆH(2)
+ ˆH(3)
+ . . . (6.20)
ˆHspinψspin = ( ˆH(1)
+ ˆH(2)
+ ˆH(3)
+. . . )ψ(1)
·ψ(2)
·ψ(3)
· · · = ψ(2)
·ψ(3)
· · ·· ˆH(1)
ψ(1)
+ψ(1)
·ψ(3)
· · ·· ˆH(2)
ψ(2)
+ψ(1)
·ψ(2)
· · ·· ˆH(3)
ψ(3)
+. . .
(6.21)
Let us assume (see Section 4.9.10)
ˆHspinψspin = Espinψspin. (6.22)
Then, expressing ψspin as the product ψ(1) · ψ(2) · ψ(3) . . . results in
ˆHspinψspin = ψ(2)
· ψ(3)
· · · · ˆH(1)
ψ(1)
+ ψ(1)
· ψ(3)
· · · · ˆH(2)
ψ(2)
+ ψ(1)
· ψ(2)
· · · · ˆH(3)
ψ(3)
+ · · · = Espinψ(1)
· ψ(2)
· ψ(3)
. . . (6.23)
If we divide both sides by ψspin = ψ(1) · ψ(2) · ψ(3) . . . ,
ˆH(1)ψ(1)
ψ(1)
+
ˆH(2)ψ(2)
ψ(2)
+
ˆH(3)ψ(3)
ψ(3)
+ · · · = Espin. (6.24)
The right-hand side is the constant Espin. Therefore, all terms ˆH(j)ψ(j)/ψ(j) must be constant if the equation is true for all values of
the spin degrees of freedom of all nuclei:
ˆH(1)ψ(1)
ψ(1)
= E(1)
,
ˆH(2)ψ(2)
ψ(2)
= E(2)
,
ˆH(3)ψ(3)
ψ(3)
= E(3)
, . . .
⇒ ˆH(1)
ψ(1)
= E(1)
ψ(1)
, ˆH(2)
ψ(2)
= E(2)
ψ(2)
, ˆH(3)
ψ(3)
= E(3)
ψ(3)
,
⇒ E(1)
+ E(2)
+ E(3)
+ · · · = Espin. (6.25)
If the nuclei are indistinguishable (see Section 6.7.1), all equations ˆH(j)ψ(j) = E(j)ψ(j) and the superscripts can be omitted
ˆHψ = Eψ. (6.26)
Nuclear magnetic moments in all molecules are now described by the same spin wave function ψ and by the same Hamiltonian ˆH with
eigenvalues Ej and eigenfunctions ψj. For example, we have shown (Section 5.4) that the Hamiltonian representing energy of a magnetic
moment in a vertical magnetic ﬁeld described by B0 is
−
γB0
2
1 0
0 −1
= ω0
2
1 0
0 −1
, (6.27)
its eigenfunctions are (after separation from the wave functions describing the dependence on x, y, z) the vectors
5
This is a very reasonable assumption in most cases. However, note that it is not true completely: if motions of the magnetic moments
and of the molecules were independent, it would be impossible to explain how the magnetic moments reach their equilibrium distribution.
6
We are now interested in the spin degrees of freedom, but the same arguments can be applied to any variables.
6.7. SUPPORTING INFORMATION 177
1
0
= |α ,
0
1
= |β , (6.28)
and its eigenvalues are
−
γB0
2
= +ω0
2
= Eα, +
γB0
2
= −ω0
2
= Eβ, (6.29)
respectively. This Hamiltonian and its eigenfunctions can be used to describe all nuclear magnetic moments of a macroscopic sample
if all consequences of interactions of individual magnetic moments can be described by modifying only the values Eα, Eβ to some Eα, Eβ
(actually, only the energy diﬀerences Eα − Eβ and Eα − Eβ are relevant). Such modiﬁcation may account for the shielding magnetic ﬁelds
by electrons, variation of the external ﬁeld B0 etc. The modiﬁcation should be general, i.e., we should be able to use a single expression
for Eα − Eβ of any magnetic moment in the sample.
6.7.4 Phases and coherences
The coherence cβc∗
α with the amplitude |cα||cβ| and with a phase Φ describes the transverse polarization of magnetic moments. In order
to analyze coherences explicitly, we use an eigenfunction of the operator representing angular momentum pointing in a general direction,
described by angles ϑ (inclination) and ϕ (azimuth), introduced in Section 5.7.10. The eigenfunction (cf. Eq. 5.210) is the following linear
combination (superposition) of the α and β eigenstates of ˆIz:
|ϑj, ϕj =
cos
ϑj
2
e−i
ϕj
2
sin
ϑj
2
e+i
ϕj
2
=
cα,j
cβ,j
= cα,j|α + cβ,j|β . (6.30)
If states of all magnetic moments in our ensemble are described by an eigenfunction of this form, the density matrix element cβc∗
α is
cβc∗
α = cos
ϑ
2
sin
ϑ
2
e+iϕ =
1
2
sin ϑe+iϕ. (6.31)
If the distributions of the angles ϑ and ϕ are independent,
cβc∗
α =
1
2
sin ϑ · e+iϕ. (6.32)
What is the physical interpretation of such density matrix elements? If the phase ϕ is the same for all magnetic moments of the
ensemble (it is never true in reality), the direction of the transverse polarization is given by Mx = |M⊥| cos ϕ and My = |M⊥| sin ϕ.
E.g., ϕ = 0 describes polarization of magnetic moments in the x direction, ϕ = π/2 describes polarization of magnetic moments in the y
direction, etc.
What deﬁnes the values of ϕj in real samples? In Section 5.7.11, we analyzed how the phases of the cα and cβ coeﬃcients evolve in a
magnetic ﬁeld described by the Hamiltonian ˆH = −γB0
ˆIz = ω0
ˆIz. We have found (Eqs. 5.215–5.216) that the phases of both coeﬃcients
rotate with the frequencies given by the eigenvalues of the Hamiltonian (Eα and Eβ):
cα(t) = cα(t = 0)e+i
γB0
2
t
= cos
ϑ
2
e−i
ϕ(t=0)
2 e+i
γB0
2
t
= cos
ϑ
2
e−i
ϕ(t=0)
2 e−i
ω0
2
t
= cos
ϑ
2
e−i
ϕ(t=0)
2 e+i Eα t
, (6.33)
cβ(t) = cβ(t = 0)e−i
γB0
2
t
= sin
ϑ
2
e+i
ϕ(t=0)
2 e−i
γB0
2
t
= sin
ϑ
2
e+i
ϕ(t=0)
2 e+i
ω0
2
t
= sin
ϑ
2
e+i
ϕ(t=0)
2 e−i
Eβ
t
, (6.34)
where we have used the explicit forms of cα(t = 0) and cβ(t = 0) for |ϑ, ϕ , (cf. Eq. 5.210). Note that the evolution in the magnetic
ﬁeld B0 changes only the azimuth ϕ, not the inclination ϑ.
If all magnetic moments experience the same magnetic ﬁeld B0, the coherence cβc∗
α evolves as
cβc∗
α =
1
2
sin ϑ e+iϕ(t=0) e+iω0t
, (6.35)
i.e., all azimuths ϕj evolve with the same angular frequency ω0.
We have described the evolution of the coherence, but we have not yet speciﬁed what deﬁnes the distributions of ϑj and ϕj(t = 0),
determining cβc∗
α at t = 0, i.e., 1
2
sin ϑ e+iϕ(t=0). The general answer is that the magnetic ﬁeld felt by the magnetic moments determines
the statistical distribution of ϑj and ϕj(t = 0). A quantitative analysis of various magnetic ﬁelds (the external static ﬁeld B0, the inﬂuence
of the electrons, the ﬁeld of the applied radio waves B0) is presented in the next lecture.7 At this moment, we only comment two results
that are derived in the next lecture.
7
Setting the beginning of the time scale is somewhat tricky. Therefore we start the analysis by deﬁning the elements of the density
matrix (the distribution of ϑj and ϕj) for a stationary macroscopic state, when the density matrix does not depend on time. Then we can
start to vary the magnetic ﬁelds and count the time from the ﬁrst applied change.
178
The ﬁrst example is an equilibrium ensemble of magnetic moments in B0. At the thermodynamic equilibrium, there is no preferred
azimuth of magnetic moments in the vertical ﬁeld B0. Therefore, the state of the system is an incoherent superposition of the eigenstates
α and β with e+iϕ(t=0) = 0 and consequently cβc∗
α = 0.
The second example is an ensemble of magnetic moments in B0 after applying a radio-wave pulse that rotated the bulk magnetization
to the direction y (cf. Figure 1.4). In such a case, Mx = |M| cos Φ = 0 and My = |M| sin Φ = |M|, telling us that Φ = π/2 immediately
after the pulse. Then, the phase factor starts to rotate with the frequency ω0 = −γB0:
eiΦ
= e+iϕ = e+iϕ(t=0) e+iω0t
= ei π
2 e+iω0t
= ei(π
2
+ω0t). (6.36)
Now only the magnitude 1
2
sin ϑ remains to be speciﬁed. In the next lecture, we derive (i) that the magnitude of the transverse
polarization after the pulse is equal to the longitudinal polarization before the pulse and (ii) that the longitudinal polarization at the
equilibrium is deﬁned by a statistical relation resembling the Boltzmann’s law of classical statistical mechanics.
6.7.5 From Schr¨odinger to Liouville-von Neumann equation
We start with the Schr¨odinger equation for a single spin in the matrix representation:
i
d
dt
cα
cβ
=
Hα,α Hα,β
Hβ,α Hβ,β
cα
cβ
=
Hα,αcα + Hα,βcβ
Hβ,αcα + Hβ,βcβ
. (6.37)
Note that the Hamiltonian matrix is written in a general form, the basis functions are not necessarily eigenfunctions of the operator.
However, the matrix must be Hermitian, i.e., Hj,k = H∗
k,j:
Hα,β = H∗
β,α Hβ,α = H∗
α,β. (6.38)
If we multiply Eq. 6.37 by the basis functions from left, we obtained the diﬀerential equations for cα and cβ (because the basis functions
are orthonormal):
( 1 0 )i
d
dt
cα
cβ
= i
dcα
dt
= Hα,αcα + Hα,βcβ (6.39)
( 0 1 )i
d
dt
cα
cβ
= i
dcβ
dt
= Hβ,αcα + Hβ,βcβ. (6.40)
In general,
dck
dt
= −
i
l
Hk,lcl (6.41)
and its complex conjugate (using Eq. 6.38) is
dc∗
k
dt
= +
i
l
H∗
k,lc∗
l = +
i
l
Hl,kc∗
l . (6.42)
Elements of the density matrix consist of the products cjc∗
k. Therefore, we must calculate
d(cjc∗
k)
dt
= cj
dc∗
k
dt
+ c∗
k
dcj
dt
=
i
l
Hl,kcjc∗
l −
i
l
Hj,lclc∗
k. (6.43)
For multiple nuclei with the same basis,
d(cj,1c∗
k,1 + cj,2c∗
k,2 + · · · )
dt
= cj,1
dc∗
k,1
dt
+ c∗
k,1
dcj,1
dt
+ cj,2
dc∗
k,2
dt
+ c∗
k,2
dcj,2
dt
+ · · · (6.44)
=
i
l
Hl,k(cj,1c∗
l,1 + cj,2c∗
l,2 + · · · ) −
i
l
Hj,l(cl,1c∗
k,1 + cl,2c∗
k,2 + · · · ). (6.45)
Note that
l
(cj,1c∗
l,1 + cj,2c∗
l,2 + · · · )Hl,k = N
l
ρj,lHl,k (6.46)
is the j, k element of the product N ˆρ ˆH, and
6.7. SUPPORTING INFORMATION 179
l
Hj,l(cl,1c∗
k,1 + cl,2c∗
k,2 + · · · ) = N
l
Hj,lρl,k (6.47)
is the j, k element of the product N ˆH ˆρ. Therefore, we can write the equation of motion for the whole density matrix as
dˆρ
dt
=
i
(ˆρ ˆH − ˆH ˆρ) =
i
[ˆρ, ˆH] = −
i
[ ˆH, ˆρ]. (6.48)
6.7.6 Rotation in operator space
Let us look at an example8 for H = εtIt + ω0Iz and ˆρ = cxIx + cyIy + czIz + ctIt.
Let us ﬁrst evaluate the commutators from the Liouville-von Neumann equation:
It is proportional to a unit matrix ⇒ it must commute with all matrices:
[It, Ij] = 0 (j = x, y, z, t). (6.49)
Commutators of Iz are given by the deﬁnition of angular momentum operators (Eqs. 4.35–4.38):
[Iz, Iz] = [Iz, It] = 0 [Iz, Ix] = iIy [Iz, Iy] = −iIx. (6.50)
Let us write the Liouville-von Neumann equation with the evaluated commutators:
dcx
dt
Ix +
dcy
dt
Iy +
dcz
dt
Iz +
dct
dt
It = i (−iω0cxIy + iω0cyIx) . (6.51)
Written in a matrix representation (noticing that cz and ct do not evolve because the czIz and ctIt components of the density matrix
commute with both matrices constituting the Hamiltonian),
dcx
dt
1
2
0 1
1 0
+
dcy
dt
1
2
0 −i
i 0
+ 0 + 0 = ω0cx
1
2
0 −i
i 0
− ω0cy
1
2
0 1
1 0
, (6.52)
1
2
0 dcx
dt
dcx
dt
0
+
1
2
0 −i
dcy
dt
i
dcy
dt
0
+ 0 + 0 =
i
2
0 −ω0cx
ω0cx 0
+
i
2
0 iω0cy
iω0cy 0
. (6.53)
Adding the matrices,
0
d(cx−icy)
dt
d(cx+icy)
dt
0
= iω0
0 −(cx − icy)
cx + icy 0
. (6.54)
This corresponds to a set of two diﬀerential equations
d(cx − icy)
dt
= −iω0(cx − icy) (6.55)
d(cx + icy)
dt
= +iω0(cx + icy) (6.56)
with the same structure as Eqs. 4.154 and 4.155. The solution is
cx − icy = (cx(0) − icy(0))e−iω0t
= c0e−i(ω0t+φ0)
(6.57)
cx + icy = (cx(0) + icy(0))e+iω0t
= c0e+i(ω0t+φ0)
(6.58)
with the amplitude c0 and phase φ0 given by the initial conditions. It corresponds to
cx = c0 cos(ω0t + φ0) (6.59)
cy = c0 sin(ω0t + φ0). (6.60)
We see that coeﬃcients cx, cy, cz play the same roles as coordinates rx, ry, rz in Eqs. 4.151–4.153, respectively, and operators Ix, Iy,
Iz play the same role as unit vectors ı, , k, deﬁning directions of the axes of the Cartesian coordinate system. Therefore, evolution of ˆρ in
our case can be described as a rotation of a three-dimensional vector consisting of the elements cx, cy, cz in an abstract three-dimensional
space deﬁned by Ix, Iy, and Iz. In our case, if φ = 0, then ˆρ(0) = c0Ix + czIz + ctIt and it evolves as
c0Ix + czIz + ctIt −→ c0Ix cos(ω0t) + c0Iy sin(ω0t) + czIz + ctIt. (6.61)
8
Various Hamiltonians encountered in NMR spectroscopy are discussed in the next lectures. At this moment, take H = εtIt + ω0Iz
just as an example.
180
Lecture 7
Chemical shift, one-pulse experiment
Literature: The operator of magnetization is described in C2.4.1, Hamiltonians discussed in L8,
thermal equilibrium in L11.3, C2.4.1, K6.8.6, relaxation due to the chemical shift in C5.4.4, K9.10
(very brieﬂy, the quantum approach to relaxation is usually introduced using dipole-dipole interactions
as an example). The one-pulse experiment is analyzed in K7.2.1, L11.11 and L11.12.
7.1 Operator of the observed quantity
The quantity observed in the NMR experiment is the bulk magnetization M, i.e., the sum of magnetic
moments of all nuclei divided by volume of the sample, assuming isotropic distribution of the nuclei in
the sample. Technically, we observe oscillations in the plane perpendicular to the homogeneous ﬁeld
of the magnet B0. The associated oscillations of the magnetic ﬁelds of nuclei induce electromotive
force in the detector coil, as described by Eq. 55. Since a complex signal is usually recorded (see
Section 3.2), the operator of complex magnetization M+ = Mx + iMy is used (M− = Mx − iMy can
be used as well).
ˆM+ = Nγ(ˆIx + iˆIy) = Nγ ˆI+, (7.1)
where N is the number of nuclei in the sample per unit volume.
7.2 Hamiltonian of the static ﬁeld B0
The Hamiltonian of the static homogeneous magnetic ﬁeld B0 can be easily derived from the classical
description of energy of a magnetic moment in a magnetic ﬁeld (Eq. 6):
E = −µ · B0. (7.2)
Since B0 deﬁnes direction of the z axis,
E = −µ · B0 = −µzB0 = −γB0Iz. (7.3)
Replacing the value of Iz (z-component of the spin angular momentum) by its operator provides
the Hamiltonian:
181
182
ˆH0,lab = −γB0
ˆIz. (7.4)
7.3 Hamiltonian of the radio ﬁeld B1
Using radio waves in NMR spectroscopy has two consequences. First, frequency of the radio waves
deﬁnes angular frequency of the rotating coordinate frame, used to describe evolution of the distribution
of magnetic moments in the presence, but also in the absence of radio waves. Second, radio
waves allow us to change the distribution of magnetic moments, described by the probability density
matrix ˆρ.
The oscillating magnetic ﬁeld of radio waves irradiating the sample is usually approximated by
a magnetic ﬁeld B1 rotating with the frequency of the radio waves ωradio (validity of such approximation
is discussed in Sections 5.7.13–5.7.17). Evolution of the density matrix is then described
in a coordinate frame rotating with the opposite angular frequency ωrot = −ωradio, as described in
Section 1.5.5. The x axis of the rotating coordinate frame is deﬁned by the direction of the B1 vector.
The phase φrot of this vector is given by the convention described in Section 1.5.5.
In the rotating coordinate system, frequency of the rotation of the coordinate frame1
is subtracted
from the precession frequency and the diﬀerence Ω = ω0 − ωrot = −γB0 − ωrot is the frequency oﬀset
deﬁning the evolution in the rotating frame in the absence of other ﬁelds:2
In the absence of other ﬁelds than B0,
ˆH0,rot = (−γB0 − ωrot)ˆIz = ΩˆIz. (7.5)
During irradiation by the radio wave, the magnetic ﬁeld of the radio wave inﬂuences the distribution
of magnetic moments described by ˆρ. The Hamiltonian contains an additional term ˆH1,rot
describing the eﬀect of the ﬁeld of the radio waves:
ˆH0,rot + ˆH1,rot = (−γB0 − ωrot)ˆIz − γB1
ˆIx = ΩˆIz + ω1
ˆIx. (7.6)
As the radio frequency ωradio (and consequently ωrot) should be close to the precession frequency
of the magnetic moments of the observed nuclei, we can assume |Ω| |γB0|. If the radio frequency
is very close to the resonance, −γB0 ≈ ωrot, Ω ω1, and the ˆIz component of the Hamiltonian can
be neglected.
The above description is suﬃcient for a one-dimensional experiment, discussed in this lecture.
However, radio waves are applied in several pulses in many NMR experiments. During diﬀerent
pulses, the phase of the radio waves is often shifted. In such a case, it is the phase of the ﬁrst pulse
which deﬁnes the x axis of the rotating coordinate frame. In order to be able to analyze the multiple
radio pulses later in our course, we now also describe the form of a Hamiltonian of the magnetic ﬁeld
aﬀecting the magnetic moments during irradiation by a wave shifted by π/2 from the phase of the
ﬁrst pulse:
1
Formally opposite to ωradio.
2
Note that eigenvalues of such Hamiltonian are not values of energy in the ﬁeld B0.
7.4. HAMILTONIAN OF CHEMICAL SHIFT 183
ˆH0,rot + ˆH1,rot = (−γB0 − ωrot)ˆIz − γB1
ˆIy = ΩˆIz + ω1
ˆIy. (7.7)
Note that such a radio wave (phase shifted by π/2 from the ﬁrst pulse) deﬁnes the direction of
the y axis of the rotating frame. Therefore, a pulse of such a wave is referred to as a y-pulse. In a
similar manner, we describe pulses of waves shifted by π or 3π/2 as −x or −y pulses, respectively.
7.4 Hamiltonian of chemical shift
In addition to the external ﬁeld, magnetic moments are also inﬂuenced by magnetic ﬁelds of electrons
in the molecules. In order to describe our ensemble of spin magnetic moments by a 2 × 2 density
matrix, the interactions with the electrons must modify only eigenvalues, not eigenfunctions of the
already introduced Hamiltonians. The concept of the chemical shift tensor, introduced during our
classical treatment of the magnetic ﬁelds of moving electrons in Section 1.4, allows us to include
the chemical shift into the already deﬁned Hamiltonians without changing their eigenfunctions. The
values of µx, µy, and µz in the classical equations are simply replaced by the operators ˆIx, ˆIy, and ˆIz:
ˆHδ = −γ(ˆIxBe,x + ˆIyBe,y + ˆIzBe,z) = −γ( ˆIx
ˆIy
ˆIz )


Be,x
Be,y
Be,z

 =
= −γ( ˆIx
ˆIy
ˆIz )


δxx δxy δxz
δyx δyy δyz
δzx δzy δzz




B0,x
B0,y
B0,z

 = −γ
ˆ
I · δ · B0. (7.8)
As we have also learnt in Section 1.4, we can decompose the chemical shift tensor δ into isotropic,
axially symmetric and asymmetric (rhombic) components. The corresponding decomposition of the
chemical shift Hamiltonian to ˆHδ,i, ˆHδ,a, and ˆHδ,r is presented in Section 7.10.1. The complete
Hamiltonian of a magnetic moment of a nucleus not interacting with magnetic moments of other
nuclei in the presence of the static ﬁeld B0 but in the absence of the radio waves is given by
ˆH = ˆH0,lab + ˆHδ,i + ˆHδ,a + ˆHδ,r. (7.9)
If we insert the explicit forms of ˆHδ,i, ˆHδ,a, and ˆHδ,r (Section 7.10.1) to Eq. 7.9, the Hamiltonian
including the chemical shift becomes very complicated. Fortunately, it can be simpliﬁed in many
cases, as we show in the following sections.
7.5 Secular approximation and averaging
• The components of the induced ﬁelds Be,x and Be,y are perpendicular to B0. The contributions
of ˆHδ,i are constant and the contributions of ˆHδ,a and ˆHδ,r ﬂuctuate with the molecular motions
changing values of ϕ, ϑ, and χ. Since the molecular motions do not resonate (in general) with
the precession frequency −γB0, the components Be,x
ˆIx and Be,y
ˆIy of the Hamiltonian oscillate
184
(in addition to ﬂuctuations due to the molecular motions) rapidly with a frequency close to
−γB0 in the rotating coordinate frame. These oscillations are much faster than the precession
about Be,x and Be,y (because the ﬁeld B0 is much larger than Be) and eﬀectively average to zero
on the timescale longer than 1/(γB0) (typically nanoseconds). Therefore, the Be,x
ˆIx and Be,y
ˆIy
terms can be neglected if the eﬀects on the longer timescales are studied. Such a simpliﬁcation
is known as secular approximation.3
The secular approximation simpliﬁes the (time-averaged)
Hamiltonian to
ˆH = −γB0(1 + δi + δa 3 cos2
ϑ − 1 + δr cos(2χ) sin2
ϑ )ˆIz (7.10)
• If the sample is an isotropic liquid, averaging over all molecules of the sample further simpliﬁes
the Hamiltonian. As no orientation of the molecule is preferred, all values of χ are equally
probable and independent of ϑ. Therefore, the last term in Eq. 7.10 is averaged to zero.
Moreover, average values of Z2
x = cos2
ϕ sin2
ϑ, of Z2
y = sin2
ϕ sin2
ϑ, and of Z2
z = cos2
ϑ must
be the same because none of the directions x, y, z is preferred. The consequence has been already
discussed when we described relaxation classically (Eq. 2.45 in Section 2.6.1): (3 cos2 ϑ − 1) = 0
and the anisotropic and rhombic contributions can be neglected.
The Hamiltonian describing the eﬀects of the static external magnetic ﬁeld and coherent eﬀects
of the electrons in isotropic liquids reduces to
ˆH = −γB0(1 + δi)ˆIz. (7.11)
Note that the described simpliﬁcations can be used only if they are applicable. Eq. 7.11 is valid
only in isotropic liquids, not in liquid crystals, stretched gels, polycrystalline powders, monocrystals,
etc.! Moreover, Eq. 7.11 does not describe relaxation processes, as discussed in Section 7.7.
7.6 Thermal equilibrium as the initial state
Knowledge of the Hamiltonian allows us to derive the density matrix at the beginning of the experiment.
Usually, we start from the thermal equilibrium. If the equilibrium is achieved, phases of
individual magnetic moments are random and the magnetic moments precess incoherently. Therefore,
the oﬀ-diagonal elements (coherences) of the equilibrium density matrix (proportional to Ix
and Iy) are equal to zero. Values of the diagonal elements (populations) are derived in Section 7.10.2
and the complete equilibrium density matrix is
3
In terms of quantum mechanics, eigenfunctions of Be,x
ˆIx and Be,y
ˆIy diﬀer from the eigenfunctions of ˆH0,lab (|α
and |β ). Therefore, the matrix representation of Be,x
ˆIx and Be,y
ˆIy contains oﬀ-diagonal elements. Terms proportional
to ˆIz represent so-called secular part of the Hamiltonian, which does not change the |α and |β states (because they
are eigenfunctions of ˆIz). Terms proportional to ˆIx and ˆIy are non-secular because they change the |α and |β
states (|α and |β are not eigenfunctions of ˆIx or ˆIy). However, eigenvalues of Be,x
ˆIx and Be,y
ˆIy, deﬁning the oﬀdiagonal
elements, are much smaller than the eigenvalues of ˆH0,lab (because the ﬁeld Be is much smaller than B0).
Secular approximation represents neglecting such small oﬀ-diagonal elements in the matrix representation of the total
Hamiltonian and keeping only the diagonal secular terms.
7.7. RELAXATION DUE TO CHEMICAL SHIFT ANISOTROPY 185
ˆρeq
=
1
2
+ γB0
4kBT
0
0 1
2
− γB0
4kBT
=
1
2
1 0
0 1
+
γB0
4kBT
1 0
0 −1
= It + κIz, (7.12)
where
κ =
γB0
2kBT
. (7.13)
Note that we derived the quantum description of a mixed state. The diﬀerence in two diagonal
elements (populations) of the density matrix describes longitudinal polarization of the magnetic
moments (their sum is equal to one by deﬁnition). Populations do not tell us anything about
microscopic states of individual magnetic moments. The two-dimensional density matrix does not
imply that all magnetic moments are in one of two eigenstates!
7.7 Relaxation due to chemical shift anisotropy
The simpliﬁed Eq. 7.11 does not describe the eﬀects of fast ﬂuctuations, resulting in relaxation. In
order to derive quantum description of relaxation caused by the chemical shift, the Liouville-von
Neumann equation must be solved for the complete Hamiltonian including the axial and rhombic
contributions. Bloch, Wangsness, and Redﬁeld developed a theory, described in Section 7.10.3, that
treats the magnetic moments quantum mechanically and their molecular surroundings classically.4
The theory provides the same deﬁnitions of the rate constants describing relaxation due to chemical
shift anisotropy as we derived classically in Section 2.6.1.
R1 =
3
4
b2 1
2
J(ω0) +
1
2
J(−ω0) ≈
3
4
b2
J(ω0), (7.14)
R2 = b2 1
2
J(0) +
3
8
J(ω0) ≈ R0 +
1
2
R1, (7.15)
where b = −2γB0δa and J(ω) is the spectral density function introduced in Section 2.3.
7.8 One-pulse experiment
Having the initial form of the density matrix, the Hamiltonians, and the operator of the measured
quantity, we can proceed and describe a real NMR experiment for a sample consisting of isolated
magnetic moments (not interacting with each other). The basic NMR experiment consists of two
parts. In the ﬁrst part, the radio-wave transmitter is switched on for a short time, needed to rotate
the magnetization to the plane perpendicular to the magnetic ﬁled B0. Such application of the radio
wave is called excitation pulse. In the second part, the radio-wave transmitter is switched oﬀ but the
receiver is switched on in order to detect rotation of the magnetization vector about the direction of
4
The surroundings can be also treated quantum mechanically, as described in Abragam: The principles of nuclear
magnetism, Oxford Press 1961, Chapter VIII, Section II.D.
186
B0. We start by describing the density matrix before the experiment, then we analyze evolution of
the density matrix during these two periods, evaluate the relaxation rate, and ﬁnally we calculate
the magnetization contributing to the detected signal.
Part 1: excitation by radio wave pulses
At the beginning of the experiment, the density matrix describes thermal equilibrium (Eq. 7.12):
ˆρ(0) = It + κIz. (7.16)
The Hamiltonian governing evolution of the system during the ﬁrst part of the experiment consists
of coherent and ﬂuctuating terms. The ﬂuctuating contributions result in relaxation, described by
the relaxation rates R1 and R2. The coherent contributions include
H = −γB0(1 + δi)Iz − γB1(1 + δi) cos(−ωradiot)Ix − γB1(1 + δi) sin(−ωradiot)Iy, (7.17)
where we have chosen the directions x and y so that φradiot) = 0 (see Section 1.5.5).
The Hamiltonian simpliﬁes in a coordinate system rotating with ωrot = −ωradio
H = (−γB0(1 + δi) − ωrot)
Ω
Iz + (−γB1(1 + δi))
ω1
Ix, (7.18)
but it still contains non-commuting terms (Ix vs. Iz). Let us check what can be neglected to
keep only commuting terms, which allows us to solve the Liouville-von Neumann equation using the
simple geometric approach.
• The value of ω1 deﬁnes how much of the magnetization is rotated to the x, y plane. The
maximum eﬀect is obtained for ω1τp = π/2, where τp is the length of the radio-wave pulse.
Typical values of τp for proton are approximately 10 µs, corresponding to frequency of rotation
of 25 kHz (90◦
rotation in 10 µs corresponds to 40 µs for a full circle, 1/40 µs = 25 kHz).
• Typical values of R1 are 10−1
s−1
to 100
s−1
and typical values of R2 are 10−1
s−1
to 102
s−1
for
protons in organic molecules and biomacromolecules. Therefore, eﬀects of relaxations can be
safely neglected during τp.
• When observing a single type of proton (or other nucleus), Ω can be set to zero by the choice of
ωradio. However, variation of Ω is what we observe in real samples, containing protons (or other
nuclei) with various δi. The typical range of proton δi is 10 ppm, corresponding to 5 kHz at a
500 MHz spectrometer.5
The carrier frequency ωradio is often set to the precession frequency of
the solvent. In the case of water, it is roughly in the middle of the spectrum (4.7 ppm at pH
7). So, we need to cover ±2.5 kHz. We see that |Ω| < |ω1|, but the ratio is only 10 % at the
edge of the spectrum.
5
Chosen as a compromise here: spectra of small molecules are usually recored at 300 MHz–500 MHz, while spectra
of biomacromolecules are recorded at ≥ 500 MHz.
7.8. ONE-PULSE EXPERIMENT 187
In summary, we see that we can safely ignore ﬂuctuating contributions, but we must be careful
when neglecting ΩIz. The latter approximation allows us to use the geometric solution of the
Liouville-von Neumann equation, but is deﬁnitely not perfect for larger Ω resulting in oﬀset eﬀects.
Using the simpliﬁed Hamiltonian H = ω1Ix, evolution of ˆρ during τp can be described as a
rotation about the ”Ix axis”:
ˆρ(0) = It + κIz −→ ˆρ(τp) = It + κ(Iz cos(ω1τp) − Iy sin(ω1τp)). (7.19)
For a 90◦
pulse,
ˆρ(τp) = It − κIy. (7.20)
Part 2: evolution of chemical shift after excitation
After switching oﬀ the transmitter, ω1Ix disappears from the Hamiltonian, which now contains
only commuting terms. On the other hand, signal is typically acquired for a relatively long time
(0.1 s to 10 s) to achieve a good frequency resolution. Therefore, the relaxation eﬀects cannot be
neglected.
The coherent evolution can be described as a rotation about the ”Iz axis” with the angular
frequency Ω
ˆρ(t) = It + κ(−Iy cos(Ωt) + Ix sin(Ωt)). (7.21)
The measured quantity M+ can be expressed as (Eq. 4.12)
M+ = Tr{ˆρ(t) ˆM+} = Nγ Tr{(It + κ(−Iy cos(Ωt) + Ix sin(Ωt))I+} (7.22)
= Nγ Tr{ItI+} − Nγ κ cos(Ωt)Tr{IyI+} + Nγ κ sin(Ωt)Tr{IxI+}. (7.23)
The ﬁnal expression includes the following three traces:
Tr{ItI+} = Tr
1
2
0
0 1
2
0 1
0 0
= Tr
0 1
2
0 0
= 0 (7.24)
Tr{IxI+} = Tr
0 1
2
1
2
0
0 1
0 0
= Tr
0 0
0 1
2
=
1
2
(7.25)
Tr{IyI+} = Tr
0 − i
2
i
2
0
0 1
0 0
= Tr
0 0
0 i
2
=
i
2
(7.26)
As mentioned above, relaxation eﬀects should be taken into account when analyzing acquisition
of the NMR signal. Including the exponential relaxation term and expressing κ
M+ =
Nγ2 2
B0
4kBT
e−R2t
(sin(Ωt) − i cos(Ωt)). (7.27)
which can be rewritten as
M+ =
Nγ2 2
B0
4kBT
e−R2t
cos Ωt −
π
2
+ i sin Ωt −
π
2
=
Nγ2 2
B0
4kBT
e−R2t
eiΩt
e−i π
2 . (7.28)
188
We know that in order to obtain purely Lorentzian (absorption) real component of the spectrum
by Fourier transformation, the signal should evolve as e−R2t
eiΩt
. We see that magnetization described
by Eq. 7.28 is shifted from the ideal signal by a phase of −π/2. However, this is true only if the
evolution starts exactly at t = 0. In practice, this is impossible to achieve for various technical
reasons (instrumental delays and phase shifts, evolution starts already during τp, etc.). Therefore,
the rotation has an unknown phase shift φ (including the π/2 shift among other contributions),
which is removed by an empirical correction during signal processing (corresponding to multiplying
Eq. 7.28 by eiπ/2
). It tells us that we can ignore the phase shift and write the phase-corrected signal
as
M+ =
Nγ2 2
B0
4kBT
e−R2t
(cos(Ωt) + i sin(Ωt)) =
Nγ2 2
B0
4kBT
e−R2t
eiΩt
. (7.29)
Knowing the expected magnetization, we can try to describe the one-dimensional NMR spectrum
quantitatively. Factors that should be taken into account are listed and analyzed in Sections 7.10.4
and 7.10.5. The analysis shows that the signal-to-noise ratio is proportional to γ5/2
B
3/2
0 and further
inﬂuenced by relaxation, that strongly depends on the temperature.
7.9 Conclusions
In general, the analysis of an ideal one-pulse experiment leads to the following conclusions:
• The analysis of a one-pulse NMR experiment shows that the density matrix evolves as
ˆρ(t) ∝ Ix cos(Ωt + φ) + Iy sin(Ωt + φ) + terms orthogonal to I+, (7.30)
and that the magnetization rotates during signal acquisition as
M+ = |M+|e−R2t
eiΩt
(7.31)
(with some unimportant phase shift which is empirically corrected).
• Fourier transform gives a complex signal proportional to
Nγ2 2
B0
4kBT
R2
R2
2 + (ω − Ω)2
− i
ω − Ω
R2
2 + (ω − Ω)2
. (7.32)
• The cosine modulation of Ix can be taken as the real component of the signal and the sine
modulation of Iy can be taken as the imaginary component of the signal:
7.9. CONCLUSIONS 189
Signalinchannel1:{y(t)}
t
Signalinchannel2:{y(t)}
t
After Fourier transformation:
{Y(ω)}
ω
Ω
{Y(ω)}
ω
Ω
• The signal-to-noise ratio (without relaxation) is proportional to |γ|5/2
B
3/2
0 , with the optimal
temperature given by relaxation properties (close to room temperatures for proteins in aqueous
solutions).
HOMEWORK
Analyze the One-pulse experiment (Section 7.8) and make sure that you understand the conclusions
presented in Section 7.9.
190
7.10 SUPPORTING INFORMATION
7.10.1 Decomposition of chemical shift Hamiltonian
The Hamiltonian of a homogeneous magnetic ﬁeld aligned with the z-axis of the coordinate frame can be decomposed into
• isotropic contribution, independent of rotation in space:
ˆHδ,i = −γB0δi(ˆIz) (7.33)
• axial component, dependent on ϕ and ϑ:
ˆHδ,a = −γB0δa(3 sin ϑ cos ϑ cos ϕˆIx + 3 sin ϑ cos ϑ sin ϕˆIy + (3 cos2
ϑ − 1)ˆIz)
= −γB0δa(3ZxZz
ˆIx + 3ZyZz
ˆIy + (3Z2
z − 1)ˆIz) (7.34)
• rhombic component, dependent on ϕ, ϑ, and χ:
ˆHδ,r = −γB0δr( (−(2 cos2
χ − 1) sin ϑ cos ϑ cos ϕ + 2 sin χ cos χ sin ϑ cos ϑ sin ϕ)ˆIx +
(−(2 cos2
χ − 1) sin ϑ cos ϑ sin ϕ − 2 sin χ cos χ sin ϑ cos ϑ cos ϕ)ˆIy +
((2 cos2
χ − 1) sin2
ϑ)ˆIz)
= γB0δr((cos(2χ)Zx − sin(2χ)Zy)Zz
ˆIx + (cos(2χ)Zy + sin(2χ)Zx)Zz
ˆIy + cos(2χ)(Z2
z − 1)ˆIz) (7.35)
7.10.2 Density matrix in thermal equilibrium
We use the mixed state approach to deﬁne the state of the sample in thermal equilibrium. In the large ensemble of nuclei observed in
NMR, the equilibrium distribution of magnetic moments is such that orientations in the x and y directions are equally probable, and the
orientation in the z direction (deﬁned by the direction of the magnetic induction of the external homogeneous ﬁeld B0) is slightly favored.
Classically, energy of individual moments depends only on µz:
Ej = −µj · B0 = −µz,jB0, (7.36)
where j identiﬁes the molecule with the observed nuclear magnetic moment, and the overall energy is j Ej.
Quantum mechanically, the ensemble of magnetic moments represents a mixed state and the expected value of the energy is given by
Eq. 6.2, where A = E and ˆA = ˆH. Note that Eq. 6.2 contains an operator (in our case the Hamiltonian) representing the quantity
of interest (in our case the energy) for a single magnetic moment, although we calculate the expected value for the whole ensemble. If
we use eigenfunctions of ˆIz as the basis (the best choice for magnetic moments in the ﬁled with B0 deﬁning the z axis), eigenvalues of
H = −γB0(1 + δi)ˆIz are the diagonal elements of the matrix representation of ˆH:
ˆH = −γB0(1 + δi)ˆIz = −γB0(1 + δi)
2
1 0
0 −1
=
−γB0(1 + δi) 2
0
0 +γB0(1 + δi) 2
=
Eα 0
0 Eβ
. (7.37)
Eq. 6.2 in this case has the form
E = NTr
cαc∗
α cαc∗
β
cβc∗
α cβc∗
β
ˆρ
Eα 0
0 Eβ
ˆH
= N cαc∗
α Eα + cβc∗
β Eβ = N Pα Eα + Pβ Eβ . (7.38)
We see that the expected value of the energy of our mixed state is a weighted average of the energies of the α and β eigenstates of a
single magnetic moment. The oﬀ-diagonal elements of ˆρ, the populations, play a role of statistical weights in the derived relation. At the
equilibrium, the populations can be evaluated using statistical arguments similar to the Boltzmann law in the classical molecular statistics:
Peq
α =
e−Eα/kBT
e−Eα/kBT + e−Eβ /kBT
, (7.39)
Peq
β =
e−Eβ /kBT
e−Eα/kBT + e−Eβ /kBT
, (7.40)
7.10. SUPPORTING INFORMATION 191
where kB = 1.38064852 × 10−23 m2 kg s−2 K−1 is the Boltzmann constant.
The thermal energy at 0 ◦C is more than 10 000 times higher than γB0 /2 for the most sensitive nuclei (protons) at spectrometers
with the highest magnetic ﬁelds (1.2 GHz in 2021). The eﬀect of the chemical shift is four orders of magnitude lower (roughly 10−8kBT).
We see that (i) the eﬀect of the chemical shift δi on Eα and Eβ can be safely neglected, and (ii) that the values in the exponents are much
lesser than unity. Therefore, we can approximate the exponential terms by a linear expansion
e
±
γB0(1+δi)
kBT
≈ 1 ±
γB0
2kBT
(7.41)
and calculate the populations as
Peq
α =
e−Eα/kBT
e−Eα/kBT + e−Eβ /kBT
=
1 + γB0
2kBT
1 + γB0
2kBT
+ 1 − γB0
2kBT
=
1 + γB0
2kBT
2
, (7.42)
Peq
β =
e−Eβ /kBT
e−Eα/kBT + e−Eβ /kBT
=
1 − γB0
2kBT
1 + γB0
2kBT
+ 1 − γB0
2kBT
=
1 − γB0
2kBT
2
. (7.43)
7.10.3 Bloch-Wangsness-Redﬁeld theory
The Liouville-von Neumann equation describing the relaxing system of magnetic moments interacting with moving electrons in a so-called
interaction frame (corresponding to the rotating coordinate frame in the classical description) has the form
d∆ˆρ
dt
= −
i
[ ˆHδ,a + ˆHδ,r, ∆ˆρ], (7.44)
where ˆHδ,a and ˆHδ,r are deﬁned by Eqs. 7.34 and 7.35, respectively, and ∆ˆρ is a diﬀerence (expressed in the interaction frame)
between density matrix at the given time and density matrix in the thermodynamic equilibrium. Writing ∆ˆρ in the same bases as used
for the Hamiltoninan,
∆ˆρ = dt
ˆIt + dz
ˆIz + d+
ˆI+eiω0t
+ d−
ˆI−e−iω0t
. (7.45)
If the chemical shift is axially symmetric and its size or shape do not change,
d(dz
ˆIz + d+
ˆI+eiω0t + d−
ˆI−e−iω0t)
dt
= −
ib
cz ˆIz +
3
8
c+ ˆI+eiω0t
+
3
8
c− ˆI−e−iω0t
, dz
ˆIz + d+
ˆI+eiω0t
+ d−
ˆI−e−iω0t
, (7.46)
where ˆI±e±iω0t are operators ˆI± = ˆIx ± iˆIy in the interaction frame, ω0 = −γB0(1 + δa), b = −2γB0δa, and
cz
=
1
2
(3 cos2
ϑ − 1) = Θ (7.47)
c+
=
3
2
sin ϑ cos ϑe−iϕ
=
2
3
Θ⊥
e−iϕ
(7.48)
c−
=
3
2
sin ϑ cos ϑe+iϕ
=
2
3
Θ⊥
e+iϕ
(7.49)
Analogically to the classical analysis, the evolution can be written as
d∆ˆρ
dt
= −
1
2
∞ˆ
0
[ ˆHδ,a(0), [ ˆHδ,a(t), ∆ˆρ]]dt. (7.50)
The right-hand side can be simpliﬁed dramatically by the secular approximation: all terms with e±iω0t are averaged to zero because
they rapidly oscillate with the angular frequency ω0. Only terms with (cz)2 and c+c− are non zero (both equal to 1/5 at tj = 0).6
These are the terms with [ˆIz, [ˆIz, ∆ˆρ]], [ˆI+, [ˆI−, ∆ˆρ]], and [ˆI−, [ˆI+, ∆ˆρ]]. Moreover, averaging over all molecules makes all three correlation
functions identical in isotropic liquids: cz(0)cz(t) = c+(0)c−(t) = c−(0)c+(t) = c(0)c(t).
In order to proceed, the double commutators must be expressed. We start with
[ˆIz, ˆI±] = [ˆIz, ˆIx] ± i[ˆIz, ˆIy] = ± (ˆIx ± iˆIy) = ± ˆI± (7.51)
6
We have factored out 3/8 in order to make c+c− = (cz)2.
192
and
[ˆI+, ˆI−] = [ˆIx, ˆIx] − i[ˆIx, ˆIy] + i[ˆIy, ˆIx] + [ˆIy, ˆIy] = 2 ˆIz. (7.52)
Our goal is to calculate relaxation rates for the expectation values of components parallel (Mz) and perpendicular (M+ or M−) to
B0.
Let us start with Mz. According to Eq. 4.12,
∆Mz = Tr{∆ˆρ ˆMz} (7.53)
where ∆ Mz is the diﬀerence from the expectation value of Mz in equilibrium. The operator of Mz for one magnetic moment observed
is (Eq. 7.1)
ˆMz = Nγ ˆIz, (7.54)
where N is the number of molecules per volume element detected by the spectrometer. Since the basis matrices are orthogonal,
products of ˆIz with the components of the density matrix diﬀerent from ˆIz are equal to zero and the left-hand side of Eq. 7.50 reduces to
ddz
dt
ˆIz (7.55)
when calculating relaxation rate of Mz . In the right-hand side, we need to calculate three double commutators:
[ˆIz, [ˆIz, ˆIz]] = 0 [ˆI+, [ˆI−, ˆIz]] = 2 2 ˆIz [ˆI−, [ˆI+, ˆIz]] = 2 2 ˆIz (7.56)
After substituting into Eq. 7.50,
ddz
dt
Tr{ˆIz
ˆIz} = −

 3
4
b2
∞ˆ
0
c+(0)c−(t)eiω0t
dt +
3
4
b2
∞ˆ
0
c−(0)c+(t)e−iω0t
dt

 dzTr{ˆIz
ˆIz} (7.57)
d∆ Mz
dt
= −

 3
4
b2
∞ˆ
0
c+(0)c−(t)eiω0t
dt +
3
4
b2
∞ˆ
0
c−(0)c+(t)e−iω0t
dt

 ∆ Mz (7.58)
The relaxation rate R1 for Mz, known as longitudinal relaxation rate in the literature, is the real part of the expression in the
parentheses
R1 =
3
4
b2



∞ˆ
0
c+(0)c−(t)eiω0t
dt +
∞ˆ
0
c−(0)c+(t)e−iω0t
dt



(7.59)
As already discussed in the classical description of relaxation, if the ﬂuctuations are random, they are also stationary: the current
orientation of the molecule is correlated with the orientation in the past in the same manner as it is correlated with the orientation in the
future. Therefore,
∞ˆ
0
c+(0)c−(t)eiω0t
dt =
1
2


∞ˆ
0
c+(0)c−(t)eiω0t
dt +
0ˆ
−∞
c+(0)c−(t)eiω0t
dt

 =
1
2
∞ˆ
−∞
c+(0)c−(t)eiω0t
dt. (7.60)
∞ˆ
0
c−(0)c+(t)e−iω0t
dt =
1
2


∞ˆ
0
c−(0)c+(t)e−iω0t
dt +
0ˆ
−∞
c−(0)c+(t)e−iω0t
dt

 =
1
2
∞ˆ
−∞
c−(0)c+(t)e−iω0t
dt, (7.61)
The right-hand side integrals are identical with the mathematical deﬁnition of the Fourier transform of the correlation functions and
real parts of such Fourier transforms are the spectral density functions J(ω).
The relaxation rate R1 can be therefore written in the same form as derived classically:
R1 =
3
4
b2 1
2
J(ω0) +
1
2
J(−ω0) ≈
3
4
b2
J(ω0) (7.62)
What is the physical interpretation of the obtained equation? Relaxation of Mz is given by the correlation functions (c+(0)c−(t) and
c−(0)c+(t) describing ﬂuctuations of the components of the chemical shift tensor perpendicular to B0 (in the case of an axially symmetric
tensor, of the Zx and Zy components of the vector deﬁning the symmetry axis Z). Such ﬂuctuating ﬁelds resemble the radio waves with
B1 ⊥ B0. If the frequency of such ﬂuctuations matches the precession frequency ω0, the resonance condition is fulﬁlled and, for a short time
(comparable to the frequency of molecular collisions) when a ﬂuctuation accidentally resonates with ω0, the −γBe,x
ˆIx and/or −γBe,x
ˆIy
components of the chemical shift Hamiltonian are not completely removed by the secular approximation. In analogy to Eq. 7.19, the Iz
component of ˆρ (and consequently Mz ) slightly changes due to −γBe,x
ˆIx and/or −γBe,x
ˆIy.
7.10. SUPPORTING INFORMATION 193
If the molecular motions are assumed to be completely random and independent of the distribution of magnetic moments, Mz is
expected to decay to zero, which does not happen in reality. If the coupling between molecular motions and magnetic moment distribution
is described correctly by the quantum theory (see footnote 4), a correlation function is obtained that describes correctly the return of ˆρ to
its equilibrium form.7 This drives the system back to the equilibrium distribution of magnetic moments.
Let us continue with M+. According to Eq. 4.12,
∆ M+ ≡ M+ = Tr{∆ˆρ ˆM+} (7.63)
The expectation value of M+ in equilibrium is zero, this is why we do not need to calculate the diﬀerence for M+ and why we did
not calculate the diﬀerence in the classical analysis.
The operator of M+ for one magnetic moment observed is
ˆM+ = Nγ ˆI+ = Nγ(ˆIx + iˆIy). (7.64)
Due to the orthogonality of basis matrices, the left-hand side of Eq. 7.50 reduces to
dd+
dt
ˆI+eiω0t
(7.65)
when calculating relaxation rate of ∆ M+ ≡ M+ . In the right-hand side, we need to calculate three double commutators:
[ˆIz, [ˆIz, ˆI+]] = 2 ˆI+ [ˆI+, [ˆI−, ˆI+]] = 2 2 ˆI+ [ˆI−, [ˆI+, ˆI+]] = 0. (7.66)
After substituting into Eq. 7.50,
dd+
dt
Tr{ˆI−
ˆI+} = −

b2
∞ˆ
0
cz(0)cz(t)dt +
3
4
b2
∞ˆ
0
c+(0)c−(t)eiω0t
dt

 d+Tr{ˆI−
ˆI+} (7.67)
d M+
dt
= −

b2
∞ˆ
0
cz(0)cz(t)dt +
3
4
b2
∞ˆ
0
c+(0)c−(t)eiω0t
dt

 M+ (7.68)
The relaxation rate R2 for M+, known as transverse relaxation rate in the literature, is the real part of the expression in the
parentheses.
R2 = b2
∞ˆ
0
cz(0)cz(t)dt +



3
4
b2
∞ˆ
0
c+(0)c−(t)eiω0t
dt



. (7.69)
Note that the ﬁrst integral in 7.69 is a real number, equal to R0 derived by the classical analysis.
Using the same arguments as for Mz,
R2 = b2 1
2
J(0) +
3
8
J(ω0) ≈ R0 +
1
2
R1. (7.70)
What is the physical interpretation of the obtained equation? Two terms in Eq. 7.70 describe two processes contributing to the
relaxation of M+. The ﬁrst one is the loss of coherence with the rate R0, given by the correlation function cz(0)cz(t) and describing
ﬂuctuations of the components of the chemical shift tensor parallel with B0 (of Zz). This contribution was analyzed in Section 2.6.1 using
the classical approach. The second contribution is due to ﬂuctuations of the components of the chemical shift tensor perpendicular to
B0 (of Zx and Zy), returning the magnetization vector M to its direction in the thermodynamic equilibrium. These ﬂuctuations renew
the equilibrium value of Mz, as described above, but also make the Mx and My components to disappear. Note however, that only one
correlation function (c+(0)c−(t)) contributes to the relaxation of M+, while both c+(0)c−(t) and c−(0)c+(t) contribute to the relaxation
of Mz. Therefore only R1/2, not R1, contributes to R2. If we deﬁned R2 as a relaxation rate of M−, only c−(0)c+(t) would contribute.8
7
It can be described as J(ω0) = e− ω0/kBT
J(−ω0). In the semi-classical Bloch-Wangsness-Redﬁeld theory, this is
taken into account by working with ∆ˆρ and ∆Mz instead of ˆρ and Mz .
8
Fluctuations with frequency +ω0 aﬀect M+ and ﬂuctuations with frequency −ω0 aﬀect M−, but both aﬀect Mz.
Alternatively, we could deﬁne R2 as a relaxation rate of Mx or My. Fluctuations of the Be,y component aﬀect Mx
but not My, while ﬂuctuations of the Be,x component aﬀect My but not Mx. On the other hand, both ﬂuctuations of
Be,x and Be,y aﬀect Mz. Working with M+, M− or Mx, My, the relaxation of Mz due to Be,x and Be,y is always twice
faster.
194
7.10.4 Thermal noise of electrical circuits
All NMR measurements are inﬂuenced by the thermal noise (also called Johnson noise) generated by random motions of electrons inside
electric conductors. In order to analyze the thermal noise, we use a simple model circuit, consisting of two electric devices connected by a
cable of the length a. As we are interested only in the eﬀect of the thermal noise, which is described by the electric resistance, we can view
the devices as two resistors R1 and R2. For the sake of simplicity, we assume that both resistors have the same resistance R1 = R2 = R
and that the cable does not radiate any energy.9 The thermal motions of electrons in the resistor R1 generate a randomly ﬂuctuating
voltage U1(t) that propagates through the cable to the resistor R2. If the cable does not irradiate any energy, all electric energy associated
with U1(t) is absorbed by R2. The ﬂuctuations of U1(t) travelling through the cable can be decomposed into a series of standing waves
(vibration modes) along the cable. The permitted wavelengths of the standing waves are
λj =
2a
j
, (7.71)
where j is an integer number.10 The corresponding allowed frequencies are
fj =
c
2a
j, (7.72)
where c is the speed of the wave propagation. In NMR spectroscopy, we observe frequency only in a certain window, selected by the
band-pass ﬁlter. Therefore, we are interested only the vibration modes in this window. If the window is deﬁned as a range between f0 and
f0 + ∆f, the number of modes in the window is equal to 2a∆f/c.
Statistical thermodynamics shows that the thermal energy of a single mode is (approximately,11 for suﬃciently low frequencies) equal
to kBT. The thermal energy of all modes within the frequency window ∆f is 2akBT∆f/c. Half of this energy (i.e., akBT∆f/c) is generated
by the resistor R1, transmitted by the cable, and absorbed by R2. The energy arrives at R2 in the time ∆t = a/c. The power transmitted
in this time is
P =
akBT∆f/c
∆t
=
akBT∆f/c
a/c
= kBT∆f. (7.73)
The power can be also described using the resistance. In general, P = UI and, applying the Ohm’s law (U = RI), P = RI2. The
voltage U1 generated by the thermal motions of electrons in R1 produces the current
I =
U1
R1 + R2
=
U1
2R
(7.74)
and consequently the power
P = RI2
= R
U1
2R
2
=
U2
1
4R
. (7.75)
By comparing Eqs. 7.73 and 7.75, U2
1 can be expressed as U2
1 = 4RkBT∆f. As U1 represents the noise voltage, we can deﬁne the
mean-squared noise voltage as
U2
noise = 4RkBT∆f. (7.76)
7.10.5 Spectrum and signal-to-noise ratio
In order to describe the one-dimensional NMR spectrum quantitatively, we need to know
1. how is the detected signal related to the magnetization. Here, we analyze a simple experimental setup with a detector coil
perpendicular to the external ﬁeld, and suﬃciently far from the sample. In this case, the voltage induced in the coil is described
by Eq. 55 (Section 0.5).
2. how is the noise deﬁned. As derived in Section 7.10.4, the mean square of the voltage variance is U2
noise = 4RkBT∆f (Eq. 7.76),
where R is the resistance and ∆f is the frequency bandwidth of the detector (the range of frequencies actually detected).
3. how is the time-dependent signal converted to a frequency spectrum. Here, the answer is described in Lecture 3, the most important
step is the Fourier transformation.
9
Technically, the impedance of the cable matches the impedance of the devices (in our case simply equal to the resistance R). In NMR
spectroscopy, we try to match the impedance by setting the capacitance of the matching capacitor when tuning the spectrometer before
the measurement.
10
In a similar manner, mechanic waves propagate along the plucked string of a guitar or another string instrument. Very similar
arguments led to the formulation of the basic ideas of quantum physics (black body radiation).
11
The exact value, given by the statistics of a quantum harmonic oscillator, is hf/(ehf/(kBT ) − 1).
7.10. SUPPORTING INFORMATION 195
According Eq. 55, describing the voltage induced in the detector coil in our setup, the amplitude of the induced voltage is
|Uinduced| =
µ0
4π
2n|µ|S
r3
|ω0|, (7.77)
where µ0 is the magnetic permeability of vacuum, r is the coil from the measured sample,12 n and S are the number of turns and the
cross-section area of the coil. The amplitude of the magnetic moment µ, rotating with the frequency ω0, is equal to the amplitude of the
transverse magnetization of the sample, multiplied by the volume sensed by the detector coil. Eq. 7.29 derived in Section 7.8 tells us that
the expected value of the magnetization rotating in the plane perpendicular to B0 is (in the laboratory coordinate frame)
M+ =
Nγ2 2B0
4kBT
e−R2t
(cos(ω0t) + i sin(ω0t)) =
Nγ2 2B0
4kBT
e−R2t
eiω0t
. (7.78)
We start our analysis ignoring the relaxation factor e−R2t. In such a case,
|Uinduced| =
µ0
4π
2nS
r3
N γ2 2B0
4kBT
|ω0| =
µ0
4π
2nS
r3
N γ2 2B0
4kBT
|γ|B0 =
µ0
4π 2
2
2nS
r3
N |γ|3B2
0
kBT
, (7.79)
where N is the number of magnetic moments in the volume sensed by the receiver coil.
As described in Section 3.10.1, the coil (serving both as transmitter and receiver coil) is a part of an LC circuit, acting as a resonator.
If the capacitor CT, wired in parallel with the coil, is tuned to the resonance frequency ω2
0 = LCT, than it accumulates the energy given
by 1
2
LI2, where I is the current induced in the coil. On the other hand, the coil has also some resistance Rcoil, and therefore it dissipates
a part of the energy as the Joule heat. Balance of the energy accumulation and dissipation is described by the quality factor Q, deﬁned as
Q = |ω0|
energy stored
power loss
= |ω0|
1
2
LI2
1
2
RcoilI2
= |ω0|L/Rcoil. (7.80)
When calculating the parallel impedance of the circuit, the resistance of the coil Rcoil can be repplaced by a parallel (shunt) resistance
of the circuit R
R = Q|ω0|L (7.81)
The amplitude of the voltage actually measured across the coil terminals is
|Umeasured| = Q|Uinduced| =
µ0
4π 2
2
2nQS
r3
N |γ|3B2
0
kBT
. (7.82)
Now, we move from the signal amplitude to the frequency spectrum and reintroduce relaxation. We derived in Section 3.4 (cf. Eq. 3.5)
that the height of a peak obtained by Fourier transformation of a signal with an amplitude A depends on the relaxation rate R2 and on
the acquisition time tmax as
Ymax = A
1 − e−R2tmax
R2
=
µ0
4π 2
2
2nQS
r3
N |γ|3B2
0
kBT
1 − e−R2tmax
R2
. (7.83)
From the practical point of view, it is not important how large is the detected signal (the measured voltage can be ampliﬁed or
attenuated if needed). The sensitivity of the measurement is given by the signal-to-noise ratio. Therefore, we also need to calculate the
noise in the spectrum. According to Eq. 7.76,
U2
noise = 4RkBT∆f = 4RkBT
∆ω
2π
(7.84)
As the noise voltage ﬂuctuates stochastically, we can describe its correlation function in a similar manner as we described it for the
magnetic moment ﬂuctuations in Sections 2.6.1 and 2.6.5, i.e. as Unoise(0) Unoise(t) , and calculate also the corresponding spectral density
function:
Jnoise(ω) =
∞ˆ
−∞
Unoise(0) Unoise(t) e−iωt
dt. (7.85)
The inverse Fourier transformation allows us to calculate
Unoise(0) Unoise(t) =
1
2π
∞ˆ
−∞
Jnoise(ω)eiωt
dω (7.86)
and by setting t = 0
12
We assume that this distance is large, which is not true in NMR spectrometers, but later we include the distance in a general parameter
deﬁning the geometry.
196
U2
noise = Unoise(0) Unoise(0) =
1
2π
∞ˆ
−∞
Jnoise(ω) dω. (7.87)
When applying a band-pass ﬁlter13 selecting only frequencies in the range from ωlow to ωhigh = ωlow + ∆ω,
U2
noise = Unoise(0) Unoise(0) =
1
2π
∞ˆ
−∞
Jnoise(ω) dω =
1
2π
ωlow+∆ωˆ
ωlow
Jnoise(ω) dω (7.88)
because Jnoise(ω) = 0 outside the limits ωlow and ωhigh = ωlow + ∆ω. Comparison with Eq. 7.84, where 4RkBT is frequency
independent, shows that:
U2
noise =
1
2π
4RkBT∆ω =
1
2π
4RkBT
ωlow+∆ωˆ
ωlow
dω =
1
2π
ωlow+∆ωˆ
ωlow
4RkBT dω =
1
2π
ωlow+∆ωˆ
ωlow
Jnoise(ω) dω (7.89)
and therefore Jnoise(ω) = 4RkBT. This ﬁnding helps us to evaluate how noise enters the signal-to-noise ratio of the frequency
spectrum. The Fourier transform
Ynoise =
tmaxˆ
0
Unoise(t)e−iωt
dt (7.90)
is a random quantity that cannot be evaluated easily. However, its mean square can be related to Jnoise(ω) if tmax is suﬃciently long
(tmax 1/∆ω):
Y 2
noise =
tmaxˆ
0
dt
tmaxˆ
0
Unoise(t)Unoise(t − t ) e−iω(t−t )
dt ≈
1
2
tmaxˆ
0
dt
∞ˆ
−∞
Unoise(t)Unoise(t − t ) e−iω(t−t )
dt
=
1
2
Jnoise(ω)
tmaxˆ
0
dt = 2RkBT
tmaxˆ
0
dt = 2RkBTtmax. (7.91)
We can use Eq. 7.81 to convert R to |ω0|L/Q. Since the inductance of a solenoid is L = µ0n2S/l, where l is the length of the solenoid,
R = Q|ω0|L =
Q|ω0|Sn2
l
=
Q|γ|B0Sn2
l
(7.92)
and
Y 2
noise =
2Q|γ|B0kBTSn2tmax
l
. (7.93)
We can now combine Eqs. 7.83 and 7.93, and calculate the signal-to-noise ratio as
Signal/noise =
Ymax
Y 2
noise
=
µ0
4π 2
2
2nQS
r3
N|γ|3
B2
0
kBT
1−e−R2tmax
R2
2Q|γ|B0kBT Sn2tmax
l
=
µ0
4π 2
2 √
2QVcoil
r3
K
N |γ|5/2B
3/2
0
k
3/2
B T3/2
1 − e−R2(T )tmax
R2(T)t
1/2
max
, (7.94)
where Vcoil = Sl is the coil volume. The signal-to-noise ratio in the spectrum also depends on other tricks applied during signal
processing. When deriving Eq. 7.29, we already assumed that the phase correction was applied. Another factor determining the sensitivity
of the spectrum in practice is apodization, but we ignore it now for the sake of simplicity. The actual sensitivity is also proportional to
square root of the ratio of the time of signal acquisition to the overall time of the experiment.14
Eq. 7.94 contains many factors. The blue geometry and construction factors do not deserve much attention as they depend on the
actual instrumental setup, and can be replaced by a general parameter K. The green factors are most interesting. They show why NMR
spectroscopists like to work with high concentrations (resulting in high N), with high-γ nuclei, and at high-ﬁeld spectrometers. The total
13
Limiting the detected range of frequencies is important. A completely random noise is present at all frequencies. Without the bandpass
ﬁlter, this inﬁnite range of frequencies (representing theoretically an inﬁnite noise power) would be aliased (Section 3.6) into the
spectral width given by the time increment of the digital signal.
14
In many experiments (but not necessarily in the one-dimensional experiment), recycle delay (waiting for the sample to return close to
the equilibrium before the next measurement) is much longer than the actual signal acquisition.
7.10. SUPPORTING INFORMATION 197
acquisition time (purple) and temperature (red) and are set for each experiment. We usually prefer to acquire the signal for tmax R2
in order to avoid truncation artifacts discussed in Section 3.4. However, noise also accumulates in time, it grows proportional to
√
tmax.
Therefore, an optimum tmax should be set (depending on R2) and/or well chosen apodization should be applied (Section 3.9). For example,
if our tmax is substantially longer than R2 and we decide to prolong it further, we accumulate only noise without acquiring any additional
signal. The temperature is also a factor that can be controlled easily. At the ﬁrst glance, lower temperatures seem to be beneﬁcial.
However, the dependence of signal-to-noise ratio on the relaxation rate R2 introduces also additional dependence on the temperature and,
in the case of the relaxation caused by the chemical shift anisotropy, on γB0. The relaxation seriously reduces sensitivity of detection of
magnetic moment precession in large, rigid molecules. In such molecules, the major contribution to R2 is the loss of coherence (we labeled
its rate R0 in Section 2.2). As shown in Section 2.2, in a large rigid spherical molecule,
1
R2
≈
6Drot
b2
=
3kBT
4πr3η(T)b2
. (7.95)
When inserted to Eq. 7.94, 1/R2 may seem to change the temperature dependence to 1/T1/2. However, the temperature dependence of
the water viscosity in Eq. 7.95 inﬂuences 1/R2 more than the linear temperature dependence of the numerator. Therefore, the temperature
dependence of sensitivity on the temperature has a maximum (interestingly close to room temperature for medium-size proteins in aqueous
solutions).
The factor 1/b2 in Eq. 7.95 is equal to 1/(γB0δa)2 for the for chemical shift anisotropy. It suggests that the signal-to-noise ratio
should decrease with increasing B0. However, relaxation in most chemical groups of molecules is dominated by other mechanisms than the
chemical shift anisotropy, in particular by the dipole-dipole interactions with magnetic moments of nearby protons. As the dipole-dipole
interactions do not depend on B0, a high ﬁeld usually increases the signal-to-noise ratio. Nevertheless, Eq. 7.95 warns us that using a high
magnetic ﬁeld does not always improves the sensitivity. For example, the relaxation due to the chemical shift anisotropy reduces sensitivity
at high ﬁelds in the case of 13C nuclei in sp2 hybridization without attached protons (e.g. in carbonyl groups).
It should be stressed that when deriving Eq. 12.4.8, we made many simpliﬁcations. We neglected the eﬀect of the preampliﬁer,
resistance of the sample, and assumed that the receiver coil and sample have the same temperature. In the most sensitive NMR probes,
the motions of the electrons are suppressed by cooling the receiver coil to a very low temperature, approximately 20 K. Therefore, we have
to include the sample and coil temperature separately. If the eﬀect of preampliﬁer is included, we get a bit more complex relation
Signal/noise =
Ymax
Y 2
noise
=
µ0
4π 2
2
K
N |γ|5/2B
3/2
0
k
3/2
B Tsample (Tcoil + TsampleR /R + (1 + R /R) T )
1 − e−R2(T )tmax
R2(T)t
1/2
max
, (7.96)
where R is the resistance of the coil, R is the resistance induced by the sample in the coil (proportional to the conductivity and
therefore to the ionic strength of the sample), and T is so called noise temperature of the ampliﬁer.15
The numerical values given by Eqs. 7.94 and 7.96 are of little practical use. However, it is useful to notice how sensitivity depends on
individual factors (temperature, ﬁeld, magnetogyric ratio of the observed nucleus).
15
The input noise is ampliﬁed by the factor (1 + T /T)G, where G is the gain of the preampliﬁer.
198
Lecture 8
Dipolar coupling, product operators
Literature: The product operator formalism for multi-spin systems is described in B17.4, B18,
C2.5.1, C2.7, L15. The dipole-dipole Hamiltonian is discussed in L9.3. Relaxation is described in
K9, L19–L20, C5 in diﬀerent manners. All texts are excellent. It is very helpful to read them all if you
really want to get an insight. However, relaxation is a diﬃcult topic and absorbing the information
requires a lot of time.
8.1 Dipolar coupling
So far, we analyzed eﬀects of various ﬁelds on nuclear magnetic moments, but we assumed that
individual magnetic moments are independent and their properties can be described by operators
composed of two-dimensional matrices. In this lecture, we take into account also mutual interactions
– interactions with ﬁelds generated by magnetic moments of other nuclei.
As usually, we start by the classical description of the interaction. If spin magnetic moments of
two spin-1/2 nuclei interact with each other, the magnetic moment of nucleus 1 is inﬂuenced by the
magnetic ﬁeld B2 of the magnetic moment of nucleus 2. Analysis presented in Section 8.9.1 shows
that the magnetic ﬁeld of nucleus 2 contributes to the magnetic ﬁeld at the position of nucleus 1 as


B2,x
B2,y
B2,z

 =
µ0
4πr5


3r2
x − r2
3rxry 3rxrz
3rxry 3r2
y − r2
3ryrz
3rxrz 3ryrz 3r2
z − r2

 ·


µ2x
µ2y
µ2z

 , (8.1)
where rj are components of a vector describing mutual positions of the nuclei in space. A graphical
representation of the eﬀect of B2 on nucleus 1 and of its dependence on the orientation of the nuclei
(given by the orientation of the molecule) is presented in Figure 8.1. The matrix in Eq. 8.1 can be
viewed as a representation of the tensor of dipolar interactions. In contrast to the chemical shift
tensor, the tensor of dipolar interactions does not have any isotropic or rhombic component.
Having the classical description of the interaction of two magnetic dipolar moments, derivation
of the quantum mechanical Hamiltonian is easy, as shown in Section 8.9.1. The result of Eq. 8.1 is
inserted into the general relation ED = −µ1 ·B2, the magnetic moments are expressed by the angular
momenta (µ1 = γ1I1, µ2 = γ2I2), and the energy and angular momentum components are replaced
by the corresponding operators. The result is
199
200
A
θ
B
θ
C
θ
Figure 8.1: A, Classical description of interaction of a spin magnetic moment of the observed nucleus (shown in
cyan) with a spin magnetic moment of another nucleus (shown in green). The thick purple arrow represents B0, the
thin green induction lines represent the magnetic ﬁeld B2 of the green nucleus (the small green arrows indicate its
direction). The black line represents the internuclear vector r. As the molecule rotates, the cyan nucleus moves from
a position where the ﬁeld of the spin magnetic moment of the green nucleus B2 has the opposite direction than B0
(A), through a position where B2 is perpendicular to B0 (B), to a position where B2 has the same direction as B0
(C).
ˆHD = −
µ0γ1γ2
4πr5
( ˆI1x
ˆI1y
ˆI1z )


3r2
x − r2
3rxry 3rxrz
3rxry 3r2
y − r2
3ryrz
3rxrz 3ryrz 3r2
z − r2




ˆI2x
ˆI2y
ˆI2z


= −
µ0
4πr3
(3r2
x − r2
)ˆI1x
ˆI2x + (3r2
y − r2
)ˆI1y
ˆI2y + (3r2
z − r2
)ˆI1z
ˆI2z+
3rxry
ˆI1x
ˆI2y + 3rxrz
ˆI1x
ˆI2z + 3ryrz
ˆI1y
ˆI2z + 3ryrx
ˆI1y
ˆI2x + 3rzrx
ˆI1z
ˆI2x + 3rzry
ˆI1z
ˆI2y . (8.2)
After deﬁning the Hamiltonian of the dipole-dipole interaction, we can ask how is the total Hamiltonian
representing energy of the magnetic moment pairs inﬂuenced by the dipolar coupling. In the
absence of radio waves,1
the energy of the magnetic moment pairs depends on B0, on chemical shifts
δ1 and δ2 of the coupled nuclei, and on the dipolar coupling. The corresponding Hamiltonian consists
of the isotropic component ˆH0, and of an anisotropic part including axial and rhombic components
of the chemical shift Hamiltonian and of the Hamiltonian representing the dipolar coupling, ˆHD.
The complete Hamiltonian ˆHD described by Eq. 8.2 is rather complex. However, it can be often
greatly simpliﬁed, as discussed in Section 8.9.2. The secular approximation depends on whether the
precession frequencies of the interacting magnetic moments are identical or diﬀerent. In the former
case, ˆHD simpliﬁes to
ˆHD = −
µ0γ1γ2
4πr3
3 cos2
ϑ − 1
2
2ˆI1z
ˆI2z − ˆI1x
ˆI2x − ˆI1y
ˆI2y , (8.3)
1
We assume that the ﬁeld of irradiating radio waves is much stronger than the dipolar interactions of nuclear
magnetic moments. Therefore, we neglect the eﬀect of dipolar coupling during the short radio wave pulses.
8.2. QUANTUM STATES OF MAGNETIC MOMENT PAIRS 201
in the latter case, to
ˆHD = −
µ0γ1γ2
4πr3
3 cos2
ϑ − 1
2
2ˆI1z
ˆI2z . (8.4)
As derived in Section 8.9.1, ˆHD depends on the orientation of the molecule like the anisotropic
component of the chemical shift. It implies that whole ˆHD averages to zero in isotropic liquids
(Section 8.9.2).
The Hamiltonian representing energy of an ensemble of pairs of directly interacting spin dipolar
magnetic moments in B0 reduces in isotropic liquids to
ˆH = −γ1B0(1 + δi,1)ˆI1z − γ2B0(1 + δi,2)ˆI2z. (8.5)
The simpliﬁed Eq. 8.5 is valid only in isotropic liquids and does not describe relaxation processes.
The eﬀect of ˆHD is huge in solid state NMR and can be also be measured e.g. in liquid crystals or
mechanically stretched gels. Last but not least, dipole-dipole interactions result in strong relaxation
eﬀects, discussed in Section 8.7.
8.2 Quantum states of magnetic moment pairs
We know how to construct the Hamiltonian of the dipole-dipole interactions from the operators
ˆI1x, ˆI1y, ˆI1z, ˆI2x, ˆI2y, ˆI2z, but we still did not describe the explicit forms of these operators or of the
wave function the Hamiltonian acts on. To ﬁll this gap in our knowledge, we look for a vector
representing the wave function representing coupled magnetic moments. Although we are concerned
with direct dipole-dipole interactions in this Lecture, we try to formulate our conclusions so that
they apply to various couplings of nuclear magnetic moments in general.
We ﬁrst describe a quantum state of a pair of non-interacting spin-1/2 nuclei. The wave function
Ψ of such a pair of particles can be decomposed into the spin-part and a part dependent on the other
degrees of freedom (spatial coordinates of the nuclei). The spin part can be further separated into a
product of wave functions dependent on the spin degrees of freedom of the individual nuclei:
Ψ = ψnon-spin(x1, y1, z1, x2, y2, z2) · ψspin(cα,1, cα,2) = ψnon-spin · ψ1,spin · ψ2,spin. (8.6)
Writing explicitly ﬁrst ψ1,spin
Ψ = ψnon-spin ·
cα,1
cβ,1
· ψ2,spin = ψnon-spin ·
cα,1ψ2,spin
cβ,1ψ2,spin
(8.7)
and then ψ2,spin
Ψ = ψnon-spin ·




cα,1
cα,2
cβ,2
cβ,1
cα,2
cβ,2



 = ψnon-spin ·




cα,1cα,2
cα,1cβ,2
cβ,1cα,2
cβ,1cβ,2



 ≡ ψnon-spin ·




cαα
cαβ
cβα
cββ



 , (8.8)
202
we obtain a four-component wave function built as a direct product2
(or Kronecker product) of
two-component wave functions (state vectors) of single spin magnetic moments:
cα,1
cβ,1
⊗
cα,2
cβ,2
=




cα,1
cα,2
cβ,2
cβ,1
cα,2
cβ,2



 =




cα,1cα,2
cα,1cβ,2
cβ,1cα,2
cβ,1cβ,2



 ≡




cαα
cαβ
cβα
cββ



 . (8.9)
A detailed analysis of the four-component wavefunction is presented in Section 8.9.3 (for noninteracting
and interacting magnetic moments). Here, we only summarize the results of the analysis.
In the eigenequation, ψnon-spin is canceled out (see Section 5.4). The introduced four-component
function is written in a basis of vectors that are simultaneous eigenfuctions of the angular momentum
operators ˆI2
1 , ˆI2
2 , ˆI1z, ˆI2z. If the magnetic moments are independent, ˆI2
1 = ˆI2
2 , ˆI1z = ˆI2z, and the
pair can be described in a two-component basis of the eigenfunctions of ˆIz = ˆI1z = ˆI2z (and of
ˆI2
= ˆI2
1 = ˆI2
2 ), as described in Section 6.1.
If the magnetic moments of the pair interact, they cannot be described in the two-component
basis of independent spin magnetic moments. State of the ﬁrst spin depends on the state of the
second spin. Therefore, the probability density matrix describing a large ensemble of pairs that
interact mutually, but are isolated from other pairs, must be four-dimensional, built from coeﬃcients
of the wave function in Eq. 8.8. In other words, we can use the mixed-state approach, but we must
describe the pair of the interacting magnetic moments and its four states as one entity. Furthermore,
the Hamiltonian of dipolar interactions (Eq. 8.2) is built from operators representing products of
individual components of the interacting magnetic moments. Let us now look for a basis that fulﬁls
these requirements.
8.3 Product operators
The wave function (state vector) describing a single interacting pair of magnetic moments is fourdimensional.
Therefore, the density matrix that describes an ensemble of such interacting pairs, and
consists of averaged combinations of the elements of the four-dimensional state vector, is a 4 × 4
matrix
ˆρ =




cααc∗
αα cααc∗
αβ cααc∗
βα cααc∗
ββ
cαβc∗
αα cαβc∗
αβ cαβc∗
βα cαβc∗
ββ
cβαc∗
αα cβαc∗
αβ cβαc∗
βα cβαc∗
ββ
cββc∗
αα cββc∗
αβ cββc∗
βα cββc∗
ββ



 . (8.10)
2
Direct product ˆA ⊗ ˆB is a mathematical operation when each element of the matrix ˆA is multiplied by the whole
matrix ˆB:
ˆA⊗ ˆB =
A11 A12
A21 A22
⊗
B11 B12
B21 B22
=




A11
B11 B12
B21 B22
A12
B11 B12
B21 B22
A21
B11 B12
B21 B22
A22
B11 B12
B21 B22



 =




A11B11 A11B12 A12B11 A12B12
A11B21 A11B22 A12B21 A12B22
A21B11 A21B12 A22B11 A22B12
A21B21 A21B22 A22B21 A22B22




8.3. PRODUCT OPERATORS 203
Basis used for such density matrices and for operators acting on the four-dimensional wave function
must consist of 42
= 16 matrices.3
The four-dimensional wave function (state vector) describing
the interacting pair of magnetic moments was constructed as a direct product of two-dimensional
single-spin state vectors. Not surprisingly,4
the basis of the 4 × 4 matrices can be built from direct
products of 2 × 2 basis matrices used for spins without mutual interactions. For example, Cartesian
single-spin operators can be used to create a basis for two spins (see Tables 8.1 and 8.2) using the
following direct products of normalized basis matrices:
√
2I
(1)
t ⊗
√
2I
(2)
t = I
(12)
t (8.11)
√
2I (1)
x ⊗
√
2I
(2)
t = I
(12)
1x (8.12)
√
2I (1)
y ⊗
√
2I
(2)
t = I
(12)
1y (8.13)
√
2I (1)
z ⊗
√
2I
(2)
t = I
(12)
1z (8.14)
√
2I
(1)
t ⊗
√
2I (2)
x = I
(12)
2x (8.15)
√
2I
(1)
t ⊗
√
2I (2)
y = I
(12)
2y (8.16)
√
2I
(1)
t ⊗
√
2I (2)
z = I
(12)
2z (8.17)
√
2I (1)
x ⊗
√
2I (2)
x = 2I1xI
(12)
2x (8.18)
√
2I (1)
x ⊗
√
2I (2)
y = 2I1xI
(12)
2y (8.19)
√
2I (1)
x ⊗
√
2I (2)
z = 2I1xI
(12)
2z (8.20)
√
2I (1)
y ⊗
√
2I (2)
x = 2I1yI
(12)
2x (8.21)
√
2I (1)
y ⊗
√
2I (2)
y = 2I1yI
(12)
2y (8.22)
√
2I (1)
y ⊗
√
2I (2)
z = 2I1yI
(12)
2z (8.23)
√
2I (1)
z ⊗
√
2I (2)
x = 2I1zI
(12)
2x (8.24)
√
2I (1)
z ⊗
√
2I (2)
y = 2I1zI
(12)
2y (8.25)
√
2I (1)
z ⊗
√
2I (2)
z = 2I1zI
(12)
2z , (8.26)
where the numbers in parentheses specify which nuclei constitute the spin system described by
the given matrix (these numbers are not written in practice). The matrices on the right-hand side
3
In general, the density matrix for n states is a n × n matrix. Basis used for such density matrices must consist of
4n
matrices.
4
The relation between the construction of the state vectors and of operators acting on them is described by the
group theory. It follows from the analysis of rotation of the state vectors and operators acting on them that the
coupling between the state vectors and between the operators is the same.
204
are known as product operators. Note that 2It, equal to5 ˆ1, is not written in the product operators
for the sake of simplicity. Note also that e.g. I
(12)
x and I
(2)
x are the same 2 × 2 matrices, but I
(12)
1x
and I
(12)
2x are diﬀerent 4×4 matrices. Basis matrices for more nuclei are derived in the same manner,
a more detailed discussion is presented in Section 12.4.5.
The basis presented in Eqs. 8.11–8.26 represents one of many possible choices. Other choices are
presented in Section 8.9.4.
8.4 Density matrix of a two-spin system
The introduced formal description of the density matrix would be useless if we did not understand
its physical signiﬁcance. Interpretation of the 4 × 4 density matrix requires more care than the
interpretation of its two-dimensional version. In general, the density matrix ˆρ is a linear combination
of 16 basis matrices Bj (the actual forms of Bj depend on the chosen basis):
ˆρ =
16
j=1
CjBj (8.27)
Each basis matrix Bj describes one feature of the mixed state (e.g., longitudinal polarization of
the ﬁrst magnetic moment) and the coeﬃcients Cj specify how much the given feature contributes
to the mixed state. Below, we interpret the individual matrices of a commonly used Cartesian
basis. Although we discuss direct dipole-dipole interaction in this Lecture, the interpretation of the
Cartesian matrices is general and applicable to other interactions between the magnetic moments.
The description of the matrices is also summarized in Tables 8.1 and 8.2.
The Cartesian basis contains four diagonal matrices. Like in the two-dimensional case, the diagonal
elements of ˆρ and diagonal matrices describe longitudinal polarization of the magnetic moments.
The sum of the diagonal elements is equal to one, like in the two-dimensional density matrix. Therefore,
we have three independent populations. Two of them, corresponding to contributions of matrices
labeled I1z and I2z, describe separately longitudinal magnetic moment polarization of nuclei 1 and 2,
respectively. Contribution of the third diagonal matrix, 2I1zI2z, describes correlation between µ1z
and µ2z, how much the longitudinal polarization of µ1 is inﬂuenced by the longitudinal polarization
of µ2, and vice versa.
Twelve oﬀ-diagonal elements or matrices composed of them are called coherences. Only six oﬀdiagonal
elements are independent because each element below the diagonal has its complex conjugate
above the diagonal. Note, however, that coherences are complex quantities. The six independent oﬀdiagonal
elements thus represent twelve real numbers. Therefore, none of twelve purely real or purely
imaginary matrices in Table 8.2 is redundant. Coherences corresponding to contributions of matrices
I1x and I2x, respectively, describe transverse polarization in the direction x of magnetic moments
of nuclei 1 and 2, regardless of the state of the other nucleus. Contributions of I1y and I2y describe
transverse polarization in the direction y in the same manner. A contribution of 2I1xI2z describes
how the transverse polarization of µ1 in the x direction depends on the longitudinal polarization of
µ2. Dependence of the transverse polarization of µ2 in the x direction on the longitudinal polarization
5ˆ1 is a 2 × 2 unit matrix in the case of I
(1)
t or I
(2)
t , and a 4 × 4 unit matrix in the case of I
(1)
t .
8.5. COMMUTATORS OF PRODUCT OPERATORS 205
Table 8.1: Contributions to the two-spin density matrix describing uniform distribution and longitudinal polarizations
of spin magnetic moments µ1 and µ2. In the graphical representation, the left and right distribution corresponds to of
superimposed µ1 and µ2, respectively. The uniform distribution is shown in black. In order to visualize correlation of
the longitudinal polarization, the following color-coding is used. In the case of longitudinal polarization of µ1, magnetic
moments of nucleus 1 in 10 % molecules with most polarized µ1 are shown in cyan, and magnetic moments of nucleus 2
in the same molecules are shown in green. In the case of longitudinal polarization of µ2, magnetic moments of nucleus
2 in 10 % molecules with most polarized µ2 are shown in green, and magnetic moments of nucleus 1 in the same
molecules are shown in cyan. The chosen distributions of orientation symbolize the trend of polarization represented
by the given matrix, the depicted degree of polarization is lower than the degree corresponding to the actual matrices:
basis matrices describe either no polarization (uniform distribution of orientations) or complete polarization (identical
orientations, i.e., a single arrow in the plot).
Matrix graph description
It = 1
2




+1 0 0 0
0 +1 0 0
0 0 +1 0
0 0 0 +1



 no polarization of µ1 or µ2
I1z = 1
2




+1 0 0 0
0 +1 0 0
0 0 −1 0
0 0 0 −1



 longitudinal polarization of µ1 regardless of µ2
I2z = 1
2




+1 0 0 0
0 −1 0 0
0 0 +1 0
0 0 0 −1



 longitudinal polarization of µ2 regardless of µ1
2I1zI2z = 1
2




+1 0 0 0
0 −1 0 0
0 0 −1 0
0 0 0 +1



 correlation of longitudinal polarizations of µ1 and µ2
of µ1 is given by the contribution 2I1zI2x. The same applies to 2I1yI2z, 2I1zI2y and to direction
y. Finally, contributions of 2I1xI2x, 2I1yI2y, 2I1xI2y, and 2I1yI2x describe mutual correlation
of transverse polarizations of µ1 and µ2.
8.5 Commutators of product operators
The Liouville-von Neumann equation can be written for coupled magnetic moments in the same form
as for spins without mutual interactions (Eq. 6.8):
dˆρ
dt
= i(ˆρH − H ˆρ) = i[ˆρ, H ] = −i[H , ˆρ], (8.28)
but the density matrix and Hamiltonian are now6
4 × 4 matrices. Also Eqs. 6.9 and 6.10 can be
generalized to product operators. The same simple geometric solution of the Liouville-von Neumann
equation is possible if the Hamiltonian does not vary in time and consists of commuting matrices only.
However, the operator space is now 16-dimensional. Therefore, the appropriate three-dimensional
subspace must be selected for each rotation. The subspaces are deﬁned by the commutation relations
6
In general, Eq. 6.8 is valid for n × n matrices describing ensembles of n mutually interacting nuclei.
206
Table 8.2: Contributions to the two-spin density matrix describing coherences (see Table 8.1 for color coding).
Matrix graph description
I1x = 1
2




0 0 +1 0
0 0 0 +1
+1 0 0 0
0 +1 0 0



 transverse polarization of µ1 in direction x, regardless of µ2
2I1xI2z = 1
2




0 0 +1 0
0 0 0 −1
+1 0 0 0
0 −1 0 0




correlation between transverse polarization of µ1 in direction x
and longitudinal polarization of µ2
I1y = i
2




0 0 −1 0
0 0 0 −1
+1 0 0 0
0 +1 0 0




transverse polarization of µ1 in direction y, regardless of µ2
2I1yI2z = i
2




0 0 −1 0
0 0 0 +1
+1 0 0 0
0 −1 0 0




correlation between transverse polarization of µ1 in direction y
and longitudinal polarization of µ2
I2x = 1
2




0 +1 0 0
+1 0 0 0
0 0 0 +1
0 0 +1 0



 transverse polarization of µ2 in direction x, regardless of µ1
2I1zI2x = 1
2




0 +1 0 0
+1 0 0 0
0 0 0 −1
0 0 −1 0




correlation between transverse polarization of µ2 in direction x
and longitudinal polarization of µ1
I2y = i
2




0 −1 0 0
+1 0 0 0
0 0 0 −1
0 0 +1 0



 transverse polarization of µ2 in direction y, regardless of µ1
2I1zI2y = i
2




0 −1 0 0
+1 0 0 0
0 0 0 +1
0 0 −1 0




correlation between transverse polarization of µ2 in direction y
and longitudinal polarization of µ1
2I1xI2x = 1
2




0 0 0 +1
0 0 +1 0
0 +1 0 0
+1 0 0 0




correlation between transverse polarization of µ1 and µ2
in direction x
2I1yI2y = 1
2




0 0 0 −1
0 0 +1 0
0 +1 0 0
−1 0 0 0




correlation between transverse polarization of µ1 and µ2
in direction y
2I1xI2y = i
2




0 0 0 −1
0 0 +1 0
0 −1 0 0
+1 0 0 0




correlation between transverse polarization of µ1 in direction x
and transverse polarization of µ2 in direction y
2I1yI2x = i
2




0 0 0 −1
0 0 −1 0
0 +1 0 0
+1 0 0 0




correlation between transverse polarization of µ1 in direction y
and transverse polarization of µ2 in direction x
8.6. OPERATOR OF THE OBSERVED QUANTITY FOR MORE NUCLEI 207
derived in Section 8.9.5. The relations (applicable to any set of n2
operators of spin systems consisting
of n spin-1/2 nuclei) are described by the following equations:
[Inx, Iny] = iInz [Iny, Inz] = iInx [Inz, Inx] = iIny (8.29)
[Inj, 2InkIn l] = 2[Inj, Ink]In l (8.30)
[2InjIn l, 2InkIn m] = [Inj, Ink]δlm + [In l, In m]δjk, (8.31)
where n and n specify the nucleus, j, k, l ∈ {x, y, z}, and δjk = 1 for l = m and δjk = 0 for j = k.
Since the dipolar interactions do not have coherent eﬀects in isotropic liquids, we postpone discussion
of the rotations in the product operator space to Section 10.4, where we discuss interactions that are
not averaged to zero in isotropic samples.
8.6 Operator of the observed quantity for more nuclei
In order to describe the observed signal for a system of n diﬀerent nuclei, Eq. 7.1, deﬁning the
operator of complex magnetization, must be slightly modiﬁed
ˆM+ =
n
Nγn(ˆInx + iˆIny) =
n
Nγn
ˆIn+, (8.32)
where the index n distinguishes diﬀerent types of nuclei. In the case of magnetic moment pairs
discussed in this Lecture, n = 2.
8.7 Dipolar relaxation
As mentioned above, dipole-dipole interactions do not have coherent eﬀects (do not inﬂuence the
measured values of precession frequencies) in isotropic liquids. On the other hand, the dipole-dipole
interactions represent a very important source of relaxation.
Rotation of the molecule (and internal motions) change the orientation of the inter-nuclear vector
and cause ﬂuctuations of the ﬁeld of the magnetic moment µ2 sensed by the magnetic moment µ1. It
leads to the loss of coherence in the same manner as described for the anisotropic part of the chemical
shift (cf. Eqs 1.86 and 8.56). However, the relaxation eﬀects of the dipole-dipole interactions are
more complex, reﬂecting the higher complexity of the Hamiltonian of the dipolar coupling. A detailed
analysis is presented in Section 8.9.6. The analysis shows how molecular motions determine constants
Ra1, Ra2, Rx, R2,1, and R2,2 in the following relaxation equations:
208
d∆ M1z
dt
= −Ra1∆ M1z − Rx∆ M2z , (8.33)
d∆ M2z
dt
= −Ra2∆ M2z − Rx∆ M1z , (8.34)
d M1+
dt
= −R2,1 M1+ , (8.35)
d M2+
dt
= −R2,2 M2+ . (8.36)
The values of the relaxation rates and their dependence are discussed in Section 8.9.7. Here we
mention only two features that directly aﬀect the experiment described in the next Lecture.
• Eqs. 8.33 and 8.34 reveal an important feature of the relaxation due to dipole-dipole interactions.
Return to the equilibrium polarization of nucleus 1 depends also on the actual
polarization of nucleus 2. This eﬀect, resembling chemical kinetics of a reversible reaction, is
known as cross-relaxation, or nuclear Overhauser eﬀect (NOE), and described by the crossrelaxation
constant Rx. The value of Rx is proportional to r−6
and thus provides information
about inter-atomic distances. NOE is a useful tool in analysis of small molecules and the most
important source of structural information for large biological molecules.
• Eqs. 8.35 and 8.36 have a similar form as those describing transverse relaxation due to the
chemical shift anisotropy (Section 7.7). In real samples, contributions to relaxation due to
the chemical shift anisotropy and due to dipole-dipole interactions (often with several spin
magnetic moments close in space) are combined. The constants R1 and R2 (and other) are
therefore sums of the relaxation rate constants described here and in Section 7.7. At moderate
B0 ﬁelds (up to 15–20 T, depending on the molecule), relaxation of 1
H in unlabeled molecules
and of 13
C and 15
N in CHn and NHn groups of 13
C/15
N labeled molecules is usually dominated
by dipole-dipole interactions with protons.
8.8 Thermal equilibrium with dipolar coupling
As shown in Section 8.9.8, if we neglect the chemical shifts (δi,1 1, δi,2 1), the density matrix
describing two diﬀerent nuclei coupled only through dipolar interactions is
ˆρeq
=





1
4
+ γ1B0
8kBT
+ γ2B0
8kBT
0 0 0
0 1
4
+ γ1B0
8kBT
− γ2B0
8kBT
0 0
0 0 1
4
− γ1B0
8kBT
+ γ2B0
8kBT
0
0 0 0 1
4
− γ1B0
8kBT
− γ2B0
8kBT





(8.37)
=
1
4




1 0 0 0
0 1 0 0
0 0 1 0
0 0 0 1



 +
γ1B0
8kBT




+1 0 0 0
0 +1 0 0
0 0 −1 0
0 0 0 −1



 +
γ2B0
8kBT




+1 0 0 0
0 −1 0 0
0 0 +1 0
0 0 0 −1



 (8.38)
8.8. THERMAL EQUILIBRIUM WITH DIPOLAR COUPLING 209
=
1
2
(It + κ1I1z + κ2I2z) , (8.39)
where
κj =
γjB0
2kBT
. (8.40)
HOMEWORK
To prepare for the next lecture, analyze evolution of the density matrix described in Section 9.2.
210
8.9 SUPPORTING INFORMATION
8.9.1 Tensor and Hamiltonian of dipolar coupling
As shown in Section 4.9.2, magnetic induction can be expressed as a curl (rotation) of the vector potential (B = × A). Therefore, the
magnetic induction of the ﬁeld of nucleus B2 is given by the classical electrodynamics as
B2 = × A2, (8.41)
where
≡
∂
∂x
,
∂
∂y
,
∂
∂z
. (8.42)
Let us assume (classically) that the source of the magnetic moment of nucleus 2 is a current loop. It can be derived from Maxwell
equations7 that the vector potential A2 in a distance much larger than radius of the loop is
A2 =
µ0
4π
µ2 × r
r3
, (8.43)
where r is a vector deﬁning the mutual position of nuclei 1 and 2 (inter-nuclear vector). The individual components of A2 are
A2,x =
µ0
4π
µ2y
rz
r3
− µ2z
ry
r3
, (8.44)
A2,y =
µ0
4π
µ2z
rx
r3
− µ2x
rz
r3
, (8.45)
A2,z =
µ0
4π
µ2x
ry
r3
− µ2y
rx
r3
. (8.46)
Calculation of B2 thus includes two vector products
B2 =
µ0
4π
×
µ2 × r
r3
. (8.47)
As a consequence, each component of B2 depends on all components of µ2:
B2,x =
µ0
4π
∂A2,z
∂ry
−
∂A2,y
∂rz
=
µ0
4π
µ2x
∂
∂ry
ry
r3
+
∂
∂rz
rz
r3
− µ2y
∂
∂ry
rx
r3
− µ2z
∂
∂rz
rx
r3
, (8.48)
B2,y =
µ0
4π
∂A2,x
∂rz
−
∂A2,z
∂rx
=
µ0
4π
µ2y
∂
∂rz
rz
r3
+
∂
∂rx
rx
r3
− µ2z
∂
∂rz
ry
r3
− µ2x
∂
∂rx
ry
r3
, (8.49)
B2,z =
µ0
4π
∂A2,y
∂rx
−
∂A2,x
∂ry
=
µ0
4π
µ2z
∂
∂rx
rx
r3
+
∂
∂ry
ry
r3
− µ2x
∂
∂rx
rz
r3
− µ2y
∂
∂ry
rz
r3
. (8.50)
To proceed, we have to evaluate the partial derivatives ∂
∂rj
rj
r3 and ∂
∂rj
rk
r3 :
∂
∂rj
rj
r3
=
∂
∂rj
rj
r2
x + r2
y + r2
z
3
=
∂
∂rj
rj
r2
x + r2
y + r2
z
3/2
=
1 · r3 − rj · 3
2
r · 2rj
r6
=
1
r3
−
3r2
j
r5
, (8.51)
∂
∂rj
rk
r3
=
∂
∂rj
rk
r2
x + r2
y + r2
z
3
∂
∂rj
rk
r2
x + r2
y + r2
z
3/2
=
0 · r3 − rk · 3
2
r · 2rj
r6
= −
3rjrk
r5
(8.52)
After inserting the partial derivatives from Eqs. 8.51 and 8.52 to Eqs. 8.48–8.50,
B2,x =
µ0
4πr5
((3r2
x − r2
)µ2x + 3rxryµ2y + 3rxrzµ2z) (8.53)
B2,y =
µ0
4πr5
(3rxryµ2x + (3r2
y − r2
)µ2y + 3ryrzµ2z) (8.54)
B2,z =
µ0
4πr5
(3rxrzµ2x + 3ryrzµ2y + (3r2
z − r2
)µ2z), (8.55)
7
The derivation is presented in The Feynman Lectures on Physics, Vol. 2, Chapter 14 (the general description is presented in Section 14.2.
and the current loop is discussed in Section 14.5), using an analogy with the description of the electric dipole in Section 14.3. of Vol. 2.
8.9. SUPPORTING INFORMATION 211
which can by described by a matrix equation


B2,x
B2,y
B2,z

 =
µ0
4πr5


3r2
x − r2 3rxry 3rxrz
3rxry 3r2
y − r2 3ryrz
3rxrz 3ryrz 3r2
z − r2

 ·


µ2x
µ2y
µ2z

 . (8.56)
The matrix in Eq. 8.56 represents a tensor describing the geometric relations of the dipolar coupling and has the same form as the
matrix in Eq. 1.86, describing the anisotropic contribution to the chemical shift tensor: the vector deﬁning the symmetry axis of the
chemical shift tensor Z is just replaced with the inter-nuclear vector r in Eq. 8.56. Like the anisotropic part of the chemical shift tensor,
the matrix in Eq. 8.56 simpliﬁes to
µ0
4πr3


−1 0 0
0 −1 0
0 0 2

 (8.57)
in a coordinate system with axis z r. Rotation to the laboratory frame is described by angles ϕ and ϑ deﬁning orientation of r in
the laboratory frame


−1 0 0
0 −1 0
0 0 2

 −→
1
r2


3r2
x − r2 3rxry 3rxrz
3rxry 3r2
y − r2 3ryrz
3rxrz 3ryrz 3r2
z − r2

 , (8.58)
where rx = r sin ϑ cos ϕ, ry = r sin ϑ sin ϕ, and rz = r cos ϑ.
As usually, Hamiltonian of the dipolar coupling can be obtained using the classical description of the energy. Classical electrodynamics
tells us that the energy of the interaction of the magnetic moment of nucleus 1 with the ﬁeld generated by the magnetic moment of nucleus
2, described by Eq. 8.56 is
ED = −µ1 · B2 = −
µ0
4πr3
( 3r2
x − r2
)µ1xµ2x + (3r2
y − r2
)µ1yµ2y + (3r2
z − r2
)µ1zµ2z+
+ 3rxryµ1xµ2y + 3rxrzµ1xµ2z + 3ryrzµ1yµ2z
+ 3ryrxµ1yµ2x + 3rzrxµ1zµ2x + 3rzryµ1zµ2y) . (8.59)
Describing the magnetic moments by the operators ˆµ1jγ1
ˆI1j and ˆµ2,jγ1
ˆI2,j, where j is x, y, and z, the Hamiltonian of dipolar coupling
ˆHD can be written as
ˆHD = −
µ0
4πr3
( (3r2
x − r2
)ˆI1x
ˆI2x + (3r2
y − r2
)ˆI1y
ˆI2y + (3r2
z − r2
)ˆI1z
ˆI2z+
+ 3rxry
ˆI1x
ˆI2y + 3rxrz
ˆI1x
ˆI2z + 3ryrz
ˆI1y
ˆI2z
+ 3ryrx
ˆI1y
ˆI2x + 3rzrx
ˆI1z
ˆI2x + 3rzry
ˆI1z
ˆI2y
= −
µ0γ1γ2
4πr5
( ˆI1x
ˆI1y
ˆI1z )


3r2
x − r2 3rxry 3rxrz
3rxry 3r2
y − r2 3ryrz
3rxrz 3ryrz 3r2
z − r2




ˆI2x
ˆI2y
ˆI2z

 =
ˆ
I1 · D ·
ˆ
I2, (8.60)
where D is the tensor of direct dipole-dipole interactions (dipolar coupling).
The Hamiltonian can be written in spherical coordinates as
ˆHD = −
µ0γ1γ2
4πr3
(3 sin2
ϑ cos2
ϕ − 1)ˆI1x
ˆI2x + (3 sin2
ϑ sin2
ϕ − 1)ˆI1y
ˆI2y + (3 cos2
ϑ − 1)ˆI1z
ˆI2z+
+ 3 sin2
ϑ sin ϕ cos ϕˆI1x
ˆI2y + 3 sin ϑ cos ϑ cos ϕˆI1x
ˆI2z + 3 sin ϑ cos ϑ sin ϕˆI1y
ˆI2z
+ 3 sin2
ϑ sin ϕ cos ϕˆI1y
ˆI2x + 3 sin ϑ cos ϑ cos ϕˆI1z
ˆI2x + 3 sin ϑ cos ϑ sin ϕˆI1z
ˆI2y . (8.61)
8.9.2 Secular approximation and averaging of dipolar Hamiltonian
Like the chemical-shift Hamiltonian, the Hamiltonian of dipolar coupling can be simpliﬁed in many cases.
212
• Magnetic moments with the same γ and chemical shift precess about the z axis with the same precession frequency. In addition
to the precession, the magnetic moments move with random molecular motions, described by re-orientation of r. In a coordinate
system rotating with the common precession frequency, r quickly rotates about the z axis in addition to the random molecular
motions. On a time scale longer than nanoseconds, the rapid oscillations of rx, ry, and rz are neglected (secular approximation).
The values of r2
x and r2
y do not oscillate about zero, but about a value r2
x = r2
y , which is equal to8 (r2 − r2
z )/2 because
r2
x + r2
y + r2
z = r2 = r2. Therefore, the secular approximation (i.e., neglecting the oscillations and keeping the average values)
simpliﬁes the Hamiltonian to
ˆHD = −
µ0γ1γ2
4πr5
3 r2
z − r2 ˆI1z
ˆI2z −
1
2
ˆI1x
ˆI2x −
1
2
ˆI1y
ˆI2y = −
µ0γ1γ2
4πr3
3 cos2 ϑ − 1
2
2ˆI1z
ˆI2z − ˆI1x
ˆI2x − ˆI1y
ˆI2y . (8.62)
• Magnetic moments with diﬀerent γ and/or chemical shift precess with diﬀerent precession frequencies. Therefore, the x and y
components of µ2 rapidly oscillate in a frame rotating with the precession frequency of µ1 and vice versa. When neglecting the
oscillating terms (secular approximation), the Hamiltonian reduces to
ˆHD = −
µ0γ1γ2
4πr5
3 r2
z − r2 ˆI1z
ˆI2z = −
µ0γ1γ2
4πr3
3 cos2 ϑ − 1
2
2ˆI1z
ˆI2z. (8.63)
• Averaging over all molecules in isotropic liquids has the same eﬀect as described for the anisotropic part of the chemical shielding
tensor because both tensors have the same form. Terms with diﬀerent coordinates average to zero because they contain products
of sine and cosine functions of 2ϑ, ϕ and 2ϕ. As the angles ϑ and ϕ are independent, their functions average independently. And
as 2ϑ and ϕ can have in isotropic liquids any value in the interval (0, 2π) with equal probability, the averages of their sine and
cosine functions are equal to zero
rxry = 3 sin2 ϑ sin ϕ cos ϕ =
3
2
(1 − cos(2ϑ)) ·
1
2
sin(2ϕ) =
3
4
sin(2ϕ) −
3
4
cos(2ϑ) · sin(2ϕ) = 0 − 0 · 0 = 0, (8.64)
rxrz = 3 sin ϑ cos ϑ cos ϕ =
3
2
(sin(2ϑ)) · cos ϕ =
3
4
sin(2ϑ) · cos ϕ = 0 · 0 = 0, (8.65)
ryrz = 3 sin ϑ cos ϑ sin ϕ =
3
2
(sin(2ϑ)) · sin ϕ =
3
4
sin(2ϑ) · sin ϕ = 0 · 0 = 0. (8.66)
The terms with the same coordinates are identical because no direction is preferred:
r2
x = r2
y = r2
z. (8.67)
Finally,
r2
x + r2
y + r2
z = r2
⇒ r2
x + r2
y + r2
z = 3r2
j = r2
⇒ 3r2
j − r2 = 0. (8.68)
8.9.3 Interacting and non-interacting magnetic moments
We have decomposed a wave function of a pair of magnetic moments to (Eq. 8.8)
Ψ = ψnon-spin ·





cα,1
cα,2
cβ,2
cβ,1
cα,2
cβ,2





= ψnon-spin ·




cα,1cα,2
cα,1cβ,2
cβ,1cα,2
cβ,1cβ,2



 ≡ ψnon-spin ·




cαα
cαβ
cβα
cββ



 , (8.69)
What tells us if we can describe the state of the individual magnetic moments in the two-dimensional basis |α , |β ? We inspect
eigenfunctions and eigenvalues of the Hamiltonian including the inﬂuence of B0, chemical shifts, and dipolar coupling, in the secular
approximation:
ˆH = −γ1B0(1 + δi,1)ˆI1z − γ2B0(1 + δi,2)ˆI2z −
µ0γ1γ2
4πr3
3 cos2 ϑ − 1
2
2ˆI1z
ˆI2z − ˆI1x
ˆI2x − ˆI1y
ˆI2y
= ω0,1
ˆI1z + ω0,2
ˆI2z + D 2ˆI1z
ˆI2z − ˆI1x
ˆI2x − ˆI1y
ˆI2y (8.70)
8
Note that r2
x = r2
y = r2
z in general.
8.9. SUPPORTING INFORMATION 213
If the magnetic moments are too distant to interact mutually (r → ∞ ⇒ D → 0), the Hamiltonian simpliﬁes to a sum of two operators
acting separately on each magnetic moment
ˆH = ω0,1
ˆI1z + ω0,2
ˆI2z. (8.71)
As discussed in Section 6.7.3, action of such Hamiltonian can be described by two independent eigenequations
ω0,1
ˆI1zψ(1)
= E
(1)
D ψ(1)
ω0,2
ˆI2zψ(2)
= E
(2)
D ψ(2)
. (8.72)
The eigenfunctions can be found immediately:
ω0,1
2




1 0 0 0
0 1 0 0
0 0 −1 0
0 0 0 −1



 ·





1
cα,2
cβ,2
0
cα,2
cβ,2





=
ω0,1
2





1
cα,2
cβ,2
0
cα,2
cβ,2





ω0,2
2




1 0 0 0
0 −1 0 0
0 0 1 0
0 0 0 −1









cα,1
1
0
cβ,1
1
0





=
ω0,2
2





cα,1
1
0
cβ,1
1
0





ω0,1
2




1 0 0 0
0 1 0 0
0 0 −1 0
0 0 0 −1



 ·





0
cα,2
cβ,2
1
cα,2
cβ,2





= −
ω0,1
2





0
cα,2
cβ,2
1
cα,2
cβ,2





ω0,2
2




1 0 0 0
0 −1 0 0
0 0 1 0
0 0 0 −1









cα,1
0
1
cβ,1
0
1





= −
ω0,2
2





cα,1
0
1
cβ,1
0
1





,
(8.73)
or, using direct products,
ω0,1
2
1 0
0 −1
⊗
1 0
0 1
·
1
0
⊗
cα,2
cβ,2
=
ω0,1
2
1
0
⊗
cα,2
cβ,2
ω0,1
2
1 0
0 −1
⊗
1 0
0 1
·
0
1
⊗
cα,2
cβ,2
= −
ω0,1
2
0
1
⊗
cα,2
cβ,2
, (8.74)
ω0,1
2
1 0
0 −1
·
1
0
ψ(2)
= +
ω0,1
2
1
0
ψ(2)
(8.75)
ω0,1
2
1 0
0 −1
·
0
1
ψ(2)
= −
ω0,1
2
0
1
ψ(2)
, (8.76)
ω0,1
2
1 0
0 −1
·
1
0
= +
ω0,1
2
1
0
ω0,1
2
1 0
0 −1
·
0
1
= −
ω0,1
2
0
1
(8.77)
for the ﬁrst magnetic moment, and
ω0,2
2
1 0
0 1
⊗
1 0
0 −1
·
cα,1
cβ,1
⊗
1
0
=
ω0,2
2
cα,1
cβ,1
⊗
1
0
ω0,2
2
1 0
0 1
⊗
1 0
0 −1
·
cα,1
cβ,1
⊗
0
1
= −
ω0,2
2
cα,1
cβ,1
⊗
0
1
, (8.78)
ω0,2
2
1 0
0 −1
·
1
0
ψ(1)
= +
ω0,2
2
1
0
ψ(1)
ω0,2
2
1 0
0 −1
·
0
1
ψ(1)
= −
ω0,2
2
0
1
ψ(1)
, (8.79)
ω0,2
2
1 0
0 −1
·
1
0
= +
ω0,2
2
1
0
ω0,2
2
1 0
0 −1
·
0
1
= −
ω0,2
2
0
1
(8.80)
for the second magnetic moment.
We see that the eigenfunctions of Eq. 8.77 are
1
0
and
0
1
for any ψ(2) =
cα,2
cβ,2
, and that the eigenfunctions of Eq. 8.80 are
also
1
0
and
0
1
for any ψ(1) =
cα,1
cβ,1
. The energy diﬀerences, given by the diﬀerences of the eigenvalues, are ω0,1 and ω0,2 . As
214
the left equation does not depend on ψ(2) and the right equation does not depend on ψ(1), the original set of four equations, represented
by the 4-dimensional matrices, was redundant. If the nuclei are identical, the left and right equations can be replaced by a single equation
with ω0,1 = ω0,2 = ω0 (cf. treatment of indistinguishable nuclei in Section 6.7.3). Such case is equivalent to the mixed state described by
the 2 × 2 density matrix in Section 6.1.
If the magnetic moments interact (D = 0) and the Hamiltonian cannot be simpliﬁed to Eq. 8.71, we have to work with four-dimensional
matrices and state vectors. The Hamiltonian then has the following matrix representation
ˆH =





ω0,1+ω0,2
2
+ D
2
0 0 0
0
ω0,1−ω0,2
2
− D
2
−D 0
0 −D −
ω0,1−ω0,2
2
+ D
2
0
0 0 0 −
ω0,1+ω0,2
2
+ D
2





. (8.81)
If ω0,1 and ω0,2 diﬀer substantially, secular approximation allows us to neglect also the −ˆI1x
ˆI2x − ˆI1y
ˆI2y terms and to obtain a
diagonal Hamiltonian matrix
ˆH ≈





ω0,1+ω0,2
2
+ D
2
0 0 0
0
ω0,1−ω0,2
2
− D
2
0 0
0 0 −
ω0,1−ω0,2
2
+ D
2
0
0 0 0 −
ω0,1+ω0,2
2
+ D
2





(8.82)
with four-dimensional eigenvectors




1
0
0
0



 ,




0
1
0
0



 ,




0
0
1
0



 ,




0
0
0
1



 . (8.83)
If ω0,1 and ω0,2 are similar, the oﬀ-diagonal elements warn us that the vectors listed above (direct products of |α and |β ) are no
longer eigenfunctions of the Hamiltonian in Eq 8.82. Note that the analysis presented in this Lecture and in the following Lectures cannot
be applied to such spin systems. We return to the interacting magnetic moments with very similar ω0,1 and ω0,2 in the end of our course
(Section 12.2).
8.9.4 Product operator bases
The basis presented in Tables 8.1 and 8.2 is used most frequently in NMR spectroscopy, but other choices are better suited for certain
applications. Here, we brieﬂy review several commonly used orthonormal bases of two-spin systems and comment their advantages.
• Cartesian product operator basis is presented in Tables 8.1 and 8.2, and in a condensed version in Table 8.3. The basis matrices
are written as 2InjIn k, where n and n specify the nucleus and j, k ∈ {x, y, z, t}, but the unit matrix 2Int is usually not
written in the product. As discussed in Section 8.4, the Cartesian product operator basis is well suited to describe contributions
(populations and coherences) to the probability density matrix ˆρ. Also, certain matrices after multiplication by appropriate physical
constants represent operators of components of spin angular momentum, magnetic moment, and consequently constituents of various
Hamiltonians.
• Coeﬃcients Cj describing contributions of single-element product operator basis to the density matrix (Table 8.4) are equal to
individual elements of the density matrix. The basis matrices are written as I1jI2k = I
(1)
j ⊗ I
(2)
k , where 1 and 2 specify the
nucleus and j, k ∈ {α, β, +, −}. The relationship between Cartesian and single-element product operator bases is given by Eqs. 6.5
and 6.4, applied to each I
(n)
j matrix in the product:
I
(n)
α = I
(n)
t + I
(n)
z I
(n)
β = I
(n)
t − I
(n)
z I
(n)
+ = I
(n)
x + iI
(n)
y I
(n)
− = I
(n)
x − iI
(n)
y (8.84)
I
(n)
x =
1
2
I
(n)
+ + I
(n)
− I
(n)
y = −
i
2
I
(n)
+ − I
(n)
− I
(n)
z =
1
2
I
(n)
α − I
(n)
β I
(n)
t =
1
2
I
(n)
α + I
(n)
β (8.85)
• Shift product operator basis, presented in Table 8.5, can be viewed as a combination of the previous two choices. The basis matrices
are direct products of the normalized matrices I
(n)
+ , I
(n)
− ,
√
2I
(n)
z , and
√
2I
(n)
t . The shift product operators directly reﬂect the
coherence between spin states. The value of j +k in InjIn k distinguishes single-quantum (j +k = ±1), zero-quantum (j +k = 0),
and double-quantum (j + k = ±2) coherences.
• Irreducible spherical tensor operators, presented in Table 8.6, are useful when rotation (of molecules, chemical groups) needs to be
taken into account (e.g., when analyzing stochastic motions resulting in relaxation).
8.9. SUPPORTING INFORMATION 215
I
(00)
0,0 = It (8.86)
I
(10)
1,−1 =
1
√
2
(I1x − iI1y) = I1− (8.87)
I
(10)
1,0 = I1z = I10 (8.88)
I
(10)
1,+1 = −
1
√
2
(I1x + iI1y) = −I1+ (8.89)
I
(02)
1,−1 =
1
√
2
(I2x − iI2y) = I2− (8.90)
I
(02)
1,0 = I2z = I20 (8.91)
I
(02)
1,+1 = −
1
√
2
(I2x + iI2y) = −I2+ (8.92)
I
(12)
0,0 = −
2
√
3
(I1xI2x + I1yI2y + I1zI2z) = −
1
√
3
(I1−I2+ + I10I20 + I1+I2−) (8.93)
I
(12)
1,−1 = −(I1xI2z − I1zI2x − i(I1yI2z − I1zI2y)) = −
1
√
2
(I1−I20 − I10I2−) (8.94)
I
(12)
1,0 =
√
2i(I1xI2y − I1yI2x) = −
1
√
2
(I1+I2− − I1−I2+) (8.95)
I
(12)
1,+1 = −(I1xI2z − I1zI2x + i(I1yI2z − I1zI2y)) = −
1
√
2
(I1+I20 − I10I2+) (8.96)
I
(12)
2,−2 = I1xI2x − I1yI2y − i(I1xI2y + I1yI2x) = I1−I2− (8.97)
I
(12)
2,−1 = +(I1xI2z + I1zI2x − i(I1yI2z + I1zI2y)) = +
1
√
2
(I1−I20 + I10I2−) (8.98)
I
(12)
2,0 = −
2
3
(I1xI2x + I1yI2y − 2I1zI2z) = −
1
√
6
(I1−I2+ − 2I10I20 + I1+I2−) (8.99)
I
(12)
2,+1 = −(I1xI2z + I1zI2x + i(I1yI2z + I1zI2y)) = −
1
√
2
(I1+I20 + I10I2+) (8.100)
I
(12)
2,+2 = I1xI2x − I1yI2y + i(I1xI2y + I1yI2x) = I1+I2+ (8.101)
8.9.5 Deriving commutators of product operators
The product operators are direct products of 2 × 2 matrices Ix, Iy, Iz, It,. Therefore, commutators of product operators can be derived
from the their relations and from the general properties of the direct product of matrices. In general expressions used in this section letters
j, k, l, m replace one of the subscript x, y, z (but not t), n, n distinguish nuclei (1 or 2), and δjk = 1 for j = k, and δjk = 1 for j = k.
Products of the 2 × 2 matrices Ix, Iy, Iz are related in the following manner (cf Eqs. 4.35–4.38)
Ix · Iy − Iy · Ix = [Ix, Iy] = iIz, (8.102)
Iy · Iz − Iz · Iy = [Iy, Iz] = iIx, (8.103)
Iz · Ix − Ix · Iz = [Iz, Ix] = iIy, (8.104)
Ij · Ik + Ik · Ij = δjkIt. (8.105)
The following properties of the direct (Kronecker) products allow us to ﬁnd the commutation relation also for the product operators.
216
Table 8.3: Cartesian basis of product operators for a pair of spin-1
2 nuclei
It = 1
2




+1 0 0 0
0 +1 0 0
0 0 +1 0
0 0 0 +1



 I1z = 1
2




+1 0 0 0
0 +1 0 0
0 0 −1 0
0 0 0 −1



 I2z = 1
2




+1 0 0 0
0 −1 0 0
0 0 +1 0
0 0 0 −1



 2I1zI2z = 1
2




+1 0 0 0
0 −1 0 0
0 0 −1 0
0 0 0 +1




I1x = 1
2




0 0 +1 0
0 0 0 +1
+1 0 0 0
0 +1 0 0



 2I1xI2z = 1
2




0 0 +1 0
0 0 0 −1
+1 0 0 0
0 −1 0 0



 I1y = 1
2




0 0 −i 0
0 0 0 −i
+i 0 0 0
0 +i 0 0



 2I1yI2z = 1
2




0 0 −i 0
0 0 0 +i
+i 0 0 0
0 −i 0 0




I2x = 1
2




0 +1 0 0
+1 0 0 0
0 0 0 +1
0 0 +1 0



 2I1zI2x = 1
2




0 +1 0 0
+1 0 0 0
0 0 0 −1
0 0 −1 0



 I2y = 1
2




0 −i 0 0
+i 0 0 0
0 0 0 −i
0 0 +i 0



 2I1zI2y = 1
2




0 −i 0 0
+i 0 0 0
0 0 0 +i
0 0 −i 0




2I1xI2x = 1
2




0 0 0 +1
0 0 +1 0
0 +1 0 0
+1 0 0 0



 2I1yI2y = 1
2




0 0 0 −1
0 0 +1 0
0 +1 0 0
−1 0 0 0



 2I1xI2y = 1
2




0 0 0 −i
0 0 +i 0
0 −i 0 0
+i 0 0 0



 2I1yI2x = 1
2




0 0 0 −i
0 0 −i 0
0 +i 0 0
+i 0 0 0




Table 8.4: Single-element basis of product operators for a pair of spin-1
2 nuclei
I1αI2α =




1 0 0 0
0 0 0 0
0 0 0 0
0 0 0 0



 I1αI2β =




0 0 0 0
0 1 0 0
0 0 0 0
0 0 0 0



 I1βI2α =




0 0 0 0
0 0 0 0
0 0 1 0
0 0 0 0



 I1βI2β =




0 0 0 0
0 0 0 0
0 0 0 0
0 0 0 1




I1αI2+ =




0 1 0 0
0 0 0 0
0 0 0 0
0 0 0 0



 I1αI2− =




0 0 0 0
1 0 0 0
0 0 0 0
0 0 0 0



 I1βI2+ =




0 0 0 0
0 0 0 0
0 0 0 1
0 0 0 0



 I1βI2− =




0 0 0 0
0 0 0 0
0 0 0 0
0 0 1 0




I1+I2α =




0 0 1 0
0 0 0 0
0 0 0 0
0 0 0 0



 I1−I2α =




0 0 0 0
0 0 0 0
1 0 0 0
0 0 0 0



 I1+I2β =




0 0 0 0
0 0 0 1
0 0 0 0
0 0 0 0



 I1−I2β =




0 0 0 0
0 0 0 0
0 0 0 0
0 1 0 0




I1+I2+ =




0 0 0 1
0 0 0 0
0 0 0 0
0 0 0 0



 I1+I2− = −




0 0 0 0
0 0 1 0
0 0 0 0
0 0 0 0



 I1−I2+ = −




0 0 0 0
0 0 0 0
0 1 0 0
0 0 0 0



 I1−I2− =




0 0 0 0
0 0 0 0
0 0 0 0
1 0 0 0




8.9. SUPPORTING INFORMATION 217
Table 8.5: Shift product operator basis for a pair of spin-1
2 nuclei
It = 1
2




1 0 0 0
0 1 0 0
0 0 1 0
0 0 0 1




I1+ = 1√
2




0 0 1 0
0 0 0 1
0 0 0 0
0 0 0 0



 I2+ = 1√
2




0 1 0 0
0 0 0 0
0 0 0 1
0 0 0 0




I10 = 1
2




+1 0 0 0
0 +1 0 0
0 0 −1 0
0 0 0 −1



 I20 = 1
2




+1 0 0 0
0 −1 0 0
0 0 +1 0
0 0 0 −1




I1− = 1√
2




0 0 0 0
0 0 0 0
1 0 0 0
0 1 0 0



 I2− = 1√
2




0 0 0 0
1 0 0 0
0 0 0 0
0 0 1 0




I1+I2+ =




0 0 0 1
0 0 0 0
0 0 0 0
0 0 0 0




I1+I20 = 1√
2




0 0 +1 0
0 0 0 −1
0 0 0 0
0 0 0 0



 I10I2+ = 1√
2




0 +1 0 0
0 0 0 0
0 0 0 −1
0 0 0 0




I1+I2− =




0 0 0 0
0 0 1 0
0 0 0 0
0 0 0 0



 I10I20 = 1
2




+1 0 0 0
0 −1 0 0
0 0 −1 0
0 0 0 +1



 I1−I2+ =




0 0 0 0
0 0 0 0
0 1 0 0
0 0 0 0




I10I2− = 1√
2




0 0 0 0
+1 0 0 0
0 0 0 0
0 0 −1 0



 I1−I20 = 1√
2




0 0 0 0
0 0 0 0
+1 0 0 0
0 −1 0 0




I1−I2− =




0 0 0 0
0 0 0 0
0 0 0 0
1 0 0 0




218
Table 8.6: Basis of irreducible spherical operators for a pair of spin-1
2 nuclei
I
(10)
1,−1 = 1√
2




0 0 0 0
0 0 0 0
+1 0 0 0
0 +1 0 0



 I
(02)
1,−1 = 1√
2




0 0 0 0
+1 0 0 0
0 0 0 0
0 0 +1 0




I
(00)
0,0 = 1
2




+1 0 0 0
0 +1 0 0
0 0 +1 0
0 0 0 +1



 I
(10)
1,0 = 1
2




+1 0 0 0
0 +1 0 0
0 0 −1 0
0 0 0 −1



 I
(02)
1,0 = 1
2




+1 0 0 0
0 −1 0 0
0 0 +1 0
0 0 0 −1




I
(10)
1,+1 = − 1√
2




0 0 +1 0
0 0 0 +1
0 0 0 0
0 0 0 0



 I
(02)
1,+1 = − 1√
2




0 +1 0 0
0 0 0 0
0 0 0 +1
0 0 0 0




I
(12)
2,−2 =




0 0 0 0
0 0 0 0
0 0 0 0
+1 0 0 0




I
(12)
1,−1 = 1
2




0 0 0 0
+1 0 0 0
−1 0 0 0
0 +1 −1 0



 I
(12)
2,−1 = 1
2




0 0 0 0
+1 0 0 0
+1 0 0 0
0 −1 −1 0




I
(12)
0,0 = 1
2
√
3




+1 0 0 0
0 −1 +2 0
0 +2 −1 0
0 0 0 +1



 I
(12)
1,0 = 1√
2




0 0 0 0
0 0 −1 0
0 +1 0 0
0 0 0 0



 I
(12)
2,0 = 1√
6




+1 0 0 0
0 −1 −1 0
0 −1 −1 0
0 0 0 +1




I
(12)
1,+1 = 1
2




0 +1 −1 0
0 0 0 +1
0 0 0 −1
0 0 0 0



 I
(12)
2,+1 = 1
2




0 −1 −1 0
0 0 0 +1
0 0 0 +1
0 0 0 0




I
(12)
2,+2 =




0 0 0 +1
0 0 0 0
0 0 0 0
0 0 0 0




8.9. SUPPORTING INFORMATION 219
( ˆA ⊗ ˆB) + ( ˆA ⊗ ˆC) = ˆA ⊗ ( ˆB + ˆC), (8.106)
( ˆA ⊗ ˆB) · ( ˆC ⊗ ˆD) = ( ˆA · ˆC) ⊗ ( ˆB · ˆD). (8.107)
First, we derive commutation relations among operators of the form Inj. Eq. 8.106 shows that
2It ⊗ (Ij · Ik) ± 2It ⊗ (Ik · Ij) = 2It ⊗ (Ij · Ik ± Ik · Ij), (8.108)
2(Ij · Ik) ⊗ It ± 2(Ij · Ik) ⊗ It = 2(Ij · Ik ± Ik · Ij) ⊗ It. (8.109)
Therefore, the relations among I1x, I1y, I1z and I2x, I2y, I2z can be obtained simply by replacing subscripts x, y, z in Eqs. 8.102–
8.105 by the subscripts 1x, 1y, 1z and 2x, 2y, 2z. This is written in a concise form in Eq. 8.29.
Second, we derive commutation relations between operators Inj and 2InkIn l. Their commutator is
[Inj, 2InkIn l] = 2InjInkIn l − 2InkIn lInj. (8.110)
Eq. 8.107 implies
I1jI2k = (Ij ⊗ It) · (It ⊗ Ik) = (Ij · It) ⊗ (It · Ik) =
1
4
Ij ⊗ Ik, (8.111)
I2kI1j = (It ⊗ Ik) · (Ij ⊗ It) = (It · Ij) ⊗ (Ik · It) =
1
4
Ij ⊗ Ik. (8.112)
Therefore, I1jI2k − I2kI1j = 0, i.e., I1j and I2k (operators of magnetic moment components of diﬀerent nuclei) commute and
can be applied in any order:
I1jI2k = I2kI1j (8.113)
This allows us to switch the last two operators in Eq. 8.110 and obtain the relation described by Eq. 8.30:
2InjInkIn l − 2InkIn lInj = 2InjInkIn l − 2InkInjIn l = 2[Inj, Ink]In l. (8.114)
Third, we derive commutation relations between operators 2InjIn l and 2InkIn m
[2InjIn l, 2InkIn m] = 4InjIn lInkIn m − 4InkIn mInjIn l. (8.115)
We start by switching the commuting operators of magnetic moment components of diﬀerent nuclei In l, Ink and In mInj.
[2InjIn l, 2InkIn m] = 4InjInkIn lIn m − 4InkInjIn mIn l. (8.116)
Then we use Eqs. 8.102–8.105 to express
2InjInk = (InjInk − InkInj) + (InjInk + InkInj) = [Inj, Ink] + δjkIt, (8.117)
−2InkInj = (InjInk − InkInj) − (InjInk + InkInj) = [Inj, Ink] − δjkIt, (8.118)
2In lIn m = (In lIn m − In mIn l) + (In lIn m + In mIn l) = [In l, In m] + δlmIt, (8.119)
−2In mIn l = (In lIn m − In mIn l) − (In lIn m + In mIn l) = [In l, In m] − δlmIt. (8.120)
Inserting the obtained expressions into Eq. 8.116 results in Eq. 8.31
[2InjIn l, 2InkIn m] = 4InjInkIn lIn m − 4InkInjIn mIn l =
([Inj, Ink] + δjkIt)([In l, In m] + δlmIt) − ([Inj, Ink] − δjkIt)([In l, In m] − δlmIt) = [Inj, Ink]δlm + [In l, In m]δjk. (8.121)
Note that
j = k ⇒ [Inj, Ink] = 0, δjk = 1 (8.122)
l = m ⇒ [In l, In m] = 0, δlm = 1. (8.123)
220
8.9.6 Dipole-dipole relaxation: derivation
The Bloch-Wangsness-Redﬁeld theory (see Section 7.10.3) describes also the relaxation due to the dipole-dipole interactions. The Liouvillevon
Neumann equation has the same form as Eq. 7.44, only the chemical shift Hamiltonian is replaced by the Hamiltonian describing the
interactions of spin magnetic moments:
d∆ˆρ
dt
= −
i
[ ˆHD, ∆ˆρ], (8.124)
In order to describe the dipole-dipole relaxation on the quantum level, it is useful to work in spherical coordinates and to convert the
product operators constituting the Hamiltonian ˆHD to a diﬀerent basis. The operators ˆI1x
ˆI2z, ˆI1y
ˆI2z, ˆI1z
ˆI2x, ˆI1z
ˆI2y are transformed using
the relation ˆI± = ˆIx ± iˆIy):
ˆI1x
ˆI2z =
1
2
(+ˆI1+
ˆI2z + ˆI1−
ˆI2z), (8.125)
ˆI1y
ˆI2z =
i
2
(−ˆI1+
ˆI2z + ˆI1−
ˆI2z), (8.126)
ˆI1z
ˆI2x =
1
2
(+ˆI1z
ˆI2+ + ˆI1z
ˆI2−), (8.127)
ˆI1z
ˆI2y =
i
2
(−ˆI1z
ˆI2+ + ˆI1z
ˆI2−). (8.128)
Since
cos ϕ + i sin ϕ = eiϕ
, (8.129)
cos ϕ − i sin ϕ = e−iϕ
, (8.130)
3 sin ϑ cos ϑ(ˆI1x
ˆI2z cos ϕ + ˆI1y
ˆI2z sin ϕ + ˆI1z
ˆI2x cos ϕ + ˆI1z
ˆI2y sin ϕ).
=
3
2
sin ϑ cos ϑ(ˆI1+
ˆI2ze−iϕ
+ ˆI1−
ˆI2zeiϕ
+ ˆI1z
ˆI2+e−iϕ
+ ˆI1z
ˆI2−eiϕ
) (8.131)
The ˆI1x
ˆI2x, ˆI1y
ˆI2y, ˆI1x
ˆI2y, ˆI1y
ˆI2x are transformed in a similar fashion
ˆI1x
ˆI2y =
i
4
(+ˆI1+
ˆI2− − ˆI1−
ˆI2+ − ˆI1+
ˆI2+ + ˆI1−
ˆI2−),
ˆI1y
ˆI2x =
i
4
(−ˆI1+
ˆI2− + ˆI1−
ˆI2+ − ˆI1+
ˆI2+ + ˆI1−
ˆI2−),
ˆI1x
ˆI2x =
1
4
(+ˆI1+
ˆI2− + ˆI1−
ˆI2+ + ˆI1+
ˆI2+ + ˆI1−
ˆI2−),
ˆI1y
ˆI2y =
1
4
(+ˆI1+
ˆI2− + ˆI1−
ˆI2+ − ˆI1+
ˆI2+ − ˆI1−
ˆI2−),
and
3 sin2
ϑ(ˆI1x
ˆI2x cos2
ϕ + ˆI1y
ˆI2y sin2
ϕ + ˆI1x
ˆI2y sin ϕ cos ϕ + ˆI1y
ˆI2x sin ϕ cos ϕ) − (ˆI1x
ˆI2x + ˆI1y
ˆI2y)
=
3
4
sin2
ϑ ( ˆI1+
ˆI2−(cos2
ϕ + sin2
ϕ + i sin ϕ cos ϕ − i sin ϕ cos ϕ)
+ˆI1−
ˆI2+(cos2
ϕ + sin2
ϕ − i sin ϕ cos ϕ + i sin ϕ cos ϕ)
+ˆI1+
ˆI2+(cos2
ϕ − sin2
ϕ − i sin ϕ cos ϕ − i sin ϕ cos ϕ)
+ˆI1−
ˆI2−(cos2
ϕ − sin2
ϕ + i sin ϕ cos ϕ + i sin ϕ cos ϕ) )
−
1
4
(2ˆI1+
ˆI2− + 2ˆI1−
ˆI2+)
=
1
4
ˆI1+
ˆI2−(3 sin2
ϑ − 2) +
1
4
ˆI1−
ˆI2+(3 sin2
ϑ − 2)
+
3
4
ˆI1+
ˆI2+ sin2
ϑe−i2ϕ
+
3
4
ˆI1−
ˆI2− sin2
ϑei2ϕ
= −
1
4
ˆI1+
ˆI2−(3 cos2
ϑ − 1) −
1
4
ˆI1−
ˆI2+(3 cos2
ϑ − 1)
+
3
4
ˆI1+
ˆI2+ sin2
ϑe−i2ϕ
+
3
4
ˆI1−
ˆI2− sin2
ϑei2ϕ
. (8.132)
8.9. SUPPORTING INFORMATION 221
Using Eqs. 8.131 and 8.132 and moving to the interaction frame (ˆIn± → ˆIn±e±iω0,nt), Eq. 8.61 is converted to
ˆHI
D = −
µ0γ1γ2
4πr3
ˆI1z
ˆI2z(3 cos2
ϑ − 1)
−
1
4
ˆI1+
ˆI2−(3 cos2
ϑ − 1)ei(ω0,1−ω0,2)t
−
1
4
ˆI1−
ˆI2+(3 cos2
ϑ − 1)e−i(ω0,1−ω0,2)t
+
3
2
ˆI1+
ˆI2z sin ϑ cos ϑe−iϕ
ei(ω0,1)t
+
3
2
ˆI1−
ˆI2z sin ϑ cos ϑeiϕ
e−i(ω0,1)t
+
3
2
ˆI1z
ˆI2+ sin ϑ cos ϑe−iϕ
ei(ω0,2)t
+
3
2
ˆI1z
ˆI2− sin ϑ cos ϑeiϕ
e−i(ω0,2)t
+
3
4
ˆI1+
ˆI2+ sin2
ϑe−i2ϕ
ei(ω0,1+ω0,2)t
+
3
4
ˆI1−
ˆI2− sin2
ϑei2ϕ
e−i(ω0,1+ω0,2)t
= −b 2czz ˆI1z
ˆI2z −
1
2
c+− ˆI1+
ˆI2− −
1
2
c−+ ˆI1−
ˆI2+
+
3
2
c+z ˆI1+
ˆI2z + c−z ˆI1−
ˆI2z + cz+ ˆI1z
ˆI2+ + cz− ˆI1z
ˆI2− + c++ ˆI1+
ˆI2+ + c−− ˆI1−
ˆI2− . (8.133)
The diﬀerence of the density matrix from its equilibrium form, written in a bases including the operators used to deﬁne ˆHD, is in
general
∆ˆρ = dt
ˆIt + d1z
ˆI1z + d1+
ˆI1+ + d1−
ˆI1− + d2z
ˆI2z + d2+
ˆI2+ + d2−
ˆI2−
+ dzz
ˆI1z
ˆI2z + d+−
ˆI1+
ˆI2− + d−+
ˆI1−
ˆI2+ + d+z
ˆI1+
ˆI2z + d−z
ˆI1−
ˆI2z + dz+
ˆI1z
ˆI2+ + dz−
ˆI1z
ˆI2− + d++
ˆI1+
ˆI2+ + d−−
ˆI1−
ˆI2−.
(8.134)
However, here we analyze only evolution of d1z
ˆI1z, d2z
ˆI2z, d1+
ˆI1+, needed to describe relaxation of ∆ M1z , ∆ M1z , and M1+ .
Similarly to Eq. 7.50, the dipole-dipole relaxation is described by
d∆ˆρ
dt
= −
1
2
∞ˆ
0
[ ˆHD(0), [ ˆHD(t), ∆ˆρ]]dt. (8.135)
The right-hand side can be simpliﬁed dramatically by the secular approximation as in Eq. 7.50: all terms with e±iω0,nt are averaged
to zero. Only terms with (czz)2, cz+cz−, c+zc−z, c+−c−+, and c++c−− are non zero (all equal to 1/5 at tj = 0).9 This reduces the
number of double commutators to be expressed from 81 to 9 for each density matrix component. The double commutators needed to
describe relaxation rates of the contributions of the ﬁrst nucleus to the magnetization M1z and M1+ are
ˆI1z
ˆI2z, [ˆI1z
ˆI2z, ˆI1z] = 0, (8.136)
ˆI1−
ˆI2+, [ˆI1+
ˆI2−, ˆI1z] = 2
(ˆI1z − ˆI2z), (8.137)
ˆI1+
ˆI2−, [ˆI1−
ˆI2+, ˆI1z] = 2
(ˆI1z − ˆI2z), (8.138)
ˆI1+
ˆI2z, [ˆI1−
ˆI2z, ˆI1z] =
1
2
2 ˆI1z, (8.139)
ˆI1−
ˆI2z, [ˆI1+
ˆI2z, ˆI1z] =
1
2
2 ˆI1z, (8.140)
ˆI1z
ˆI2+, [ˆI1z
ˆI2−, ˆI1z] = 0, (8.141)
ˆI1z
ˆI2−, [ˆI1z
ˆI2+, ˆI1z] = 0, (8.142)
ˆI1+
ˆI2+, [ˆI1−
ˆI2−, ˆI1z] = 2
(ˆI1z + ˆI2z), (8.143)
ˆI1−
ˆI2−, [ˆI1+
ˆI2+, ˆI1z] = 2
(ˆI1z + ˆI2z), (8.144)
9
Averaging over all molecules makes all correlation functions identical in isotropic liquids.
222
ˆI1z
ˆI2z, [ˆI1z
ˆI2z, ˆI2z] = 0, (8.145)
ˆI1−
ˆI2+, [ˆI1+
ˆI2−, ˆI2z] = 2
(ˆI2z − ˆI1z), (8.146)
ˆI1+
ˆI2−, [ˆI1−
ˆI2+, ˆI2z] = 2
(ˆI2z − ˆI1z), (8.147)
ˆI1+
ˆI2z, [ˆI1−
ˆI2z, ˆI2z] =
1
2
2 ˆI2z, (8.148)
ˆI1−
ˆI2z, [ˆI1+
ˆI2z, ˆI2z] =
1
2
2 ˆI2z, (8.149)
ˆI1z
ˆI2+, [ˆI1z
ˆI2−, ˆI2z] = 0, (8.150)
ˆI1z
ˆI2−, [ˆI1z
ˆI2+, ˆI2z] = 0, (8.151)
ˆI1+
ˆI2+, [ˆI1−
ˆI2−, ˆI2z] = 2
(ˆI2z + ˆI1z), (8.152)
ˆI1−
ˆI2−, [ˆI1+
ˆI2+, ˆI2z] = 2
(ˆI2z + ˆI1z), (8.153)
ˆI1z
ˆI2z, [ˆI1z
ˆI2z, ˆI1+] =
1
4
2 ˆI1+, (8.154)
ˆI1+
ˆI2−, [ˆI1−
ˆI2+, ˆI1+] = 2 ˆI1+, (8.155)
ˆI1−
ˆI2+, [ˆI1+
ˆI2−, ˆI1+] = 0, (8.156)
ˆI1+
ˆI2z, [ˆI1−
ˆI2z, ˆI1+] =
1
2
2 ˆI1+, (8.157)
ˆI1−
ˆI2z, [ˆI1+
ˆI2z, ˆI1+] = 0, (8.158)
ˆI1z
ˆI2+, [ˆI1z
ˆI2−, ˆI1+] =
1
2
2 ˆI1+, (8.159)
ˆI1z
ˆI2−, [ˆI1z
ˆI2+, ˆI1+] =
1
2
2 ˆI1+, (8.160)
ˆI1+
ˆI2+, [ˆI1−
ˆI2−, ˆI1+] = 0, (8.161)
ˆI1−
ˆI2−, [ˆI1+
ˆI2+, ˆI1+] =
1
2
2 ˆI1+. (8.162)
The relaxation rates can be then derived as described for the relaxation due to the chemical shift in Section 7.10.3.
For ∆M1z,
∆M1z = Tr{∆ˆρ ˆM1z} = NγTr{∆ˆρˆI1z}. (8.163)
As discussed in Section 7.10.3, the orthogonality of basis matrices reduces the left-hand side of Eq. 8.135 to
dd1z
dt
ˆI1z. (8.164)
Expressing the terms with the non-zero double commutators in the right-hand side of Eq. 8.135 results in six integrals
dd1z
dt
Tr{ˆI1z
ˆI1z} = −

1
4
b2
∞ˆ
0
c+−(0)c−+(t)ei(ω0,1−ω0,2)t
dt +
1
4
b2
∞ˆ
0
c−+(0)c+−(t)e−i(ω0,1−ω0,2)t
dt

 d1z(Tr{ˆI1z
ˆI1z} − Tr{ˆI2z
ˆˆI2z})
−

3
4
b2
∞ˆ
0
c+z(0)c−z(t)eiω0,1t
dt +
3
4
b2
∞ˆ
0
c−z(0)c+z(t)e−iω0,1t
dt

 d1zTr{ˆI1z
ˆI1z}
−

3
2
b2
∞ˆ
0
c++(0)c−−(t)ei(ω0,1+ω0,2)t
dt +
3
2
b2
∞ˆ
0
c−−(0)c++(t)e−i(ω0,1+ω0,2)t
dt

 (d1z(Tr{ˆI1z
ˆI1z} + Tr{ˆI2z
ˆI2z}).
(8.165)
8.9. SUPPORTING INFORMATION 223
As both sides of the equation contain the same coeﬃcients, dnzTr{ˆInz
ˆInz} can be converted to ∆ Mnz :
d∆ M1z
dt
= −

 1
4
b2
∞ˆ
0
c+−(0)c−+(t)ei(ω0,1−ω0,2)t
dt +
1
4
b2
∞ˆ
0
c−+(0)c+−(t)e−i(ω0,1−ω0,2)t
dt

 (∆ M1z − ∆ M2z )
−

 3
4
b2
∞ˆ
0
c+z(0)c−z(t)eiω0,1t
dt +
3
4
b2
∞ˆ
0
c−z(0)c+z(t)e−iω0,1t
dt

 ∆ M1z
−

 3
2
b2
∞ˆ
0
c++(0)c−−(t)ei(ω0,1+ω0,2)t
dt +
3
2
b2
∞ˆ
0
c−−(0)c++(t)e−i(ω0,1+ω0,2)t
dt

 (∆ M1z + ∆ M2z ). (8.166)
If the ﬂuctuations are random and consequently stationary, the current orientation of the molecule is correlated with the orientation
in the past in the same manner as it is correlated with the orientation in the future (see Section 7.10.3), and the bounds of the integrals
can be changed
d∆ M1z
dt
= −

 1
8
b2
∞ˆ
−∞
c+−(0)c−+(t)ei(ω0,1−ω0,2)t
dt +
1
8
b2
∞ˆ
−∞
c−+(0)c+−(t)e−i(ω0,1−ω0,2)t
dt

 (∆ M1z − ∆ M2z )
−

 3
8
b2
∞ˆ
−∞
c+z(0)c−z(t)eiω0,1t
dt +
3
8
b2
∞ˆ
−∞
c−z(0)c+z(t)e−iω0,1t
dt

 ∆ M1z
−

 3
4
b2
∞ˆ
−∞
c++(0)c−−(t)ei(ω0,1+ω0,2)t
dt +
3
4
b2
∞ˆ
−∞
c−−(0)c++(t)e−i(ω0,1+ω0,2)t
dt

 (∆ M1z + ∆ M2z ). (8.167)
Collecting the real parts of integrals preceding ∆ Mz of the same nucleus, noting that they are identical with the deﬁnitions of the
spectral density functions, and assuming J(ω) ≈ J(−ω),
d∆ M1z
dt
= −
1
8
b2
(2J(ω0,1 − ω0,2) + 6J(ω0,1) + 12J(ω0,1 + ω0,2))∆ M1z
+
1
8
b2
(2J(ω0,1 − ω0,2) − 12J(ω0,1 + ω0,2))∆ M2z
= − Ra1∆ M1z − Rx∆ M2z . (8.168)
The corresponding expression for relaxation of ∆ M2z is obtained in the same manner (or simply by switching subscripts 1 and 2 in
the result):
d∆ M2z
dt
= −
1
8
b2
(2J(ω0,2 − ω0,1) + 6J(ω0,2) + 12J(ω0,2 + ω0,1))∆ M2z
+
1
8
b2
(2J(ω0,2 − ω0,1) − 12J(ω0,2 + ω0,1))∆ M1z
= − Ra2∆ M2z − Rx∆ M1z . (8.169)
The same approach is applied to M1+.
∆ M1+ ≡ M1+ = Tr{∆ˆρ ˆM1+}. (8.170)
The operator of M1+ for one magnetic moment observed is
ˆM1+ = Nγ1
ˆI1+ = Nγ1(ˆI1x + iˆI1y). (8.171)
Due to the orthogonality of basis matrices, the left-hand side of Eq. 8.135 reduces to
dd1+
dt
ˆI1+eiω0,1t
(8.172)
The terms with the non-zero double commutators in the right-hand side of Eq. 8.135 give six integrals
224
dd1+
dt
Tr{ˆI1−
ˆI1+} = −b2


∞ˆ
0
czz(0)czz(t)dt +
3
4
∞ˆ
0
cz+(0)cz−(t)eiω0,2t
dt +
3
4
∞ˆ
0
cz−(0)cz+(t)e−iω0,2t
dt
+
1
4
∞ˆ
0
c+−(0)c−+(t)ei(ω0,1−ω0,2)t
dt +
3
4
∞ˆ
0
c+z(0)c−z(t)eiω0,1t
dt +
3
2
∞ˆ
0
c−−(0)c++(t)e−i(ω0,1+ω0,2)t
dt

 d1+Tr{ˆI1−
ˆI1+}.
(8.173)
The same coeﬃcients in both sides of the equation allow us to replace d1+Tr{ˆI1−
ˆI1+} by M1+ :
M1+
dt
= −b2


∞ˆ
0
czz(0)czz(t)dt +
3
4
∞ˆ
0
cz+(0)cz−(t)eiω0,2t
dt +
3
4
∞ˆ
0
cz−(0)cz+(t)e−iω0,2t
dt
+
1
4
∞ˆ
0
c+−(0)c−+(t)ei(ω0,1−ω0,2)t
dt +
3
4
∞ˆ
0
c+z(0)c−z(t)eiω0,1t
dt +
3
2
∞ˆ
0
c−−(0)c++(t)e−i(ω0,1+ω0,2)t
dt

 M1+ .
(8.174)
Like in the expression for ∆ M1z , the bounds of the integrals can be changed
M1+
dt
= −b2

1
2
∞ˆ
−∞
czz(0)czz(t)dt +
3
8
∞ˆ
−∞
cz+(0)cz−(t)eiω0,2t
dt +
3
8
∞ˆ
−∞
cz−(0)cz+(t)e−iω0,2t
dt
+
1
8
∞ˆ
−∞
c+−(0)c−+(t)ei(ω0,1−ω0,2)t
dt +
3
8
∞ˆ
−∞
c+z(0)c−z(t)eiω0,1t
dt +
3
4
∞ˆ
−∞
c−−(0)c++(t)e−i(ω0,1+ω0,2)t
dt

 M1+
(8.175)
and the real parts of the integrals can be identiﬁed with the spectral density values (assuming J(ω) ≈ J(−ω)), providing the ﬁnal
equation describing relaxation of the transverse magnetization of the ﬁrst nucleus:
d M1+
dt
= −
1
8
b2
(4J(0)+6J(ω0,2)+J(ω0,1−ω0,2)+3J(ω0,2)+6J(ω0,1+ω0,2)) M1+ = −R2,1 M1+ = − R0,1 +
1
2
Ra1 M1+ . (8.176)
8.9.7 Dipole-dipole relaxation: discussion
The following equations, derived in Section 8.9.6, describe relaxation due to the dipole-dipole interactions in a pair of nuclei separated by
a constant distance r:
d∆ M1z
dt
= −
1
8
b2
(2J(ω0,1 − ω0,2) + 6J(ω0,1) + 12J(ω0,1 + ω0,2))∆ M1z
+
1
8
b2
(2J(ω0,1 − ω0,2) − 12J(ω0,1 + ω0,2))∆ M2z
= −Ra1∆ M1z − Rx∆ M2z , (8.177)
d∆ M2z
dt
= −
1
8
b2
(2J(ω0,1 − ω0,2) + 6J(ω0,2) + 12J(ω0,1 + ω0,2))∆ M2z
+
1
8
b2
(2J(ω0,1 − ω0,2) − 12J(ω0,1 + ω0,2))∆ M1z
= −Ra2∆ M2z − Rx∆ M1z , (8.178)
d M1+
dt
= −
1
8
b2
(4J(0) + 6J(ω0,2) + J(ω0,1 − ω0,2) + 3J(ω0,1) + 6J(ω0,1 + ω0,2)) M1+
= −R2,1 M1+ = − R0,1 +
1
2
Ra1 M1+ , (8.179)
8.9. SUPPORTING INFORMATION 225
where
b = −
µ0γ1γ2
4πr3
. (8.180)
The relaxation rate R1 of the dipole-dipole relaxation is the rate of relaxation of the z-component of the total magnetization Mz =
M1z + M2z . R1 is derived by solving the set of Eqs. 8.177 and 8.178. The solution is simple if J(ω0,1) = J(ω0,2) = J(ω0) ⇒ Ra1 =
Ra2 = Ra (this is correct e.g. if both nuclei have the same γ, if the molecule rotates as a sphere, and if internal motions are negligible or
identical for both nuclei).10 Then,
d∆ Mz
dt
= −
1
8
b2
(6J(ω0) + 24J(2ω0))∆ Mz = − (Ra + Rx)
R1
∆ Mz . (8.181)
There are several remarkable diﬀerences between relaxation due to the chemical shift anisotropy and dipole-dipole interactions:
• The relaxation constant R0,1, describing the loss of coherence of the ﬁrst magnetic moment, depends on J(0) like in the case of
the relaxation due to the chemical shift anisotropy. It is a result of changing position of µ1 in the magnetic ﬁeld of µ2 as the
molecule moves (Figure 8.2A). However, R0,1 contains also an additional term, depending on the frequency of the other magnetic
moment, 3b2J(ω0,2)/4. This term has the following physical signiﬁcance. The ﬁeld generated by the second magnetic moment
depends on its state. For example, µ2 in a pure |α state11 reduces the ﬁeld and consequently precession frequency of µ1 if the
internuclear vector is horizontal (Figure 8.1A), whereas µ2 in a pure |β state has the opposite eﬀect.12 If the molecule rotates
about the vertical axis with a frequency that is for a short time close to ω0,2, µ1 stays (for the short time) in a place where the
magnetic ﬁeld of µ2 pulls it in the z direction (Figure 8.2B). A macroscopic consequence of such short and stochastic resonance of
the rate of molecular rotation with ω0,2 is redistribution of µ2 changing the average value of µ2,z (longitudinal polarization of µ2).
Fluctuations of ω0,1 due to the stochastic changes of the longitudinal polarization of µ2 (described by J(ω0,2)) contribute to the
loss of coherence of µ1.13
• The rate constants describing the return to the equilibrium polarization is more complex than for the chemical shift anisotropy
relaxation. In addition to the 3b2J(ω0,1)/4 term, describing eﬀect of stochastic molecular motions resonating with the precession
frequency of µ1 (Figure 8.2C), the auto-relaxation rate Ra1 contains terms depending on the sum and diﬀerence of the precession
frequency of µ1 and µ2. These terms account for temporary resonance of random molecular rotation with the mutual diﬀerence in
the precession of µ1 and µ2. For example, if the molecule rotates for a short time about the vertical axis with an angular frequency
ωmol which is accidentally close to ω0,1 + ω0,2, the precession of µ2 combined with the molecular rotation resonates with the
precession of µ1 (see examples in Figure 8.2D,E). Quantum mechanically, such eﬀects are described by the orientation-dependent
coeﬃcients preceding 2ˆI1x
ˆI2x, 2ˆI1y
ˆI2y, 2ˆI1x
ˆI2y, 2ˆI1y
ˆI2x components in ˆHD, contributing to J(ω0,1 ± ω0,2).
• As mentioned in Section 8.7, return to the equilibrium polarization of nucleus 1 depends also on the actual polarization of nucleus
2. The cross-relaxation is a cause of the nuclear Overhauser eﬀect (NOE), and its dependence on r (approximately proportional
to r−6) allow us to measure distances between hydrogen atoms in molecules.
8.9.8 Two magnetic moments in thermal equilibrium
The initial density matrix describing an ensemble of pairs of nuclear magnetic moments is derived in a similar manner as outlined in
Section 7.10.2 for an ensemble of isolated nuclei. Again, we start from the thermal equilibrium and use the Hamiltonian. The diﬀerence
from the case of isolated nuclei is that Hamiltonian must be represented by a 4 × 4 density matrix in order to describe a pair of mutually
interacting nuclei. If secular approximation is applicable, the matrix representation of the Hamiltonian is diagonal. In general, the
Hamiltonian should include eﬀects of the external ﬁeld B0, of chemical shifts of both nuclei, and of their coupling. However, the dipolar
coupling in isotropic liquids is averaged to zero. It is therefore suﬃcient to write the total Hamiltonian as
10
The general solution gives R1 = 1
2
Ra1 + Ra2 + (Ra1 − Ra2)2 + 4R2
x .
11
Note that we mentioned the |α and |β eigenstates as an example, µ2 can be in reality in many superposition states.
12
The interaction is described here for nuclei with positive γ1 and γ2, e.g. protons.
13
Such changes have a similar eﬀect as the chemical or conformational exchange, modifying the size of the chemical shift tensor (the
chemical/conformational exchange was brieﬂy discussed in Section 2.4). Therefore, 3b2J(ω0,2)/4 adds to R0 like the exchange contribution.
226
ˆH = −γ1B0(1 + δi,1)ˆI1z − γ2B0(1 + δi,2)ˆI2z = −γ1B0(1 + δi,1)
2




1 0 0 0
0 1 0 0
0 0 −1 0
0 0 0 −1



 − γ2B0(1 + δi,2)
2




1 0 0 0
0 −1 0 0
0 0 1 0
0 0 0 −1




=
B0
2




−γ1(1 + δi,1) − γ2(1 + δi,2) 0 0 0
0 −γ1(1 + δi,1) + γ2(1 + δi,2) 0 0
0 0 +γ1(1 + δi,1) − γ2(1 + δi,2) 0
0 0 0 +γ1(1 + δi,1) + γ2(1 + δi,2)




=




Eαα 0 0 0
0 Eαβ 0 0
0 0 Eβα 0
0 0 0 Eββ



 , (8.182)
where the diagonal elements (eigenvalues) are the energies of the eigenstates of a single pair of magnetic moments.
As explained for the isolated nuclei, the oﬀ-diagonal elements of the equilibrium density matrix (coherences) are equal to zero. The
four diagonal elements (populations) represent statistical weights in the relation describing the expected energy of the ensemble of pairs of
coupled magnetic moments
E = PααEαα + PαβEαβ + PβαEβα + PββEββ, (8.183)
The values of the populations are obtained as described in Section 7.10.2:
Peq
αα =
e−Eαα/kBT
e−Eαα/kBT + e−Eαβ /kBT
+ e−Eβα/kBT
+ e−Eββ /kBT
≈
1 − Eαα
kBT
4
, (8.184)
Peq
αβ =
e−Eαβ /kBT
e−Eαα/kBT + e−Eαβ /kBT
+ e−Eβα/kBT
+ e−Eββ /kBT
≈
1 −
Eαβ
kBT
4
, (8.185)
Peq
βα =
e−Eβα/kBT
e−Eαα/kBT + e−Eαβ /kBT
+ e−Eβα/kBT
+ e−Eββ /kBT
≈
1 −
Eβα
kBT
4
, (8.186)
Peq
ββ =
e−Eββ /kBT
e−Eαα/kBT + e−Eαβ /kBT
+ e−Eβα/kBT
+ e−Eββ /kBT
≈
1 −
Eββ
kBT
4
, (8.187)
and neglecting the small contributions of chemical shifts (δi,n 1)
Peq
αα ≈
1 − Eαα
kBT
4
=
1
4
+ γ1(1 + δi,1)
B0
8kBT
+ γ2(1 + δi,2)
B0
8kBT
≈
1
4
+ γ1
B0
8kBT
+ γ2
B0
8kBT
, (8.188)
Peq
αβ ≈
1 −
Eαβ
kBT
4
=
1
4
+ γ1(1 + δi,1)
B0
8kBT
− γ2(1 + δi,2)
B0
8kBT
≈
1
4
+ γ1
B0
8kBT
− γ2
B0
8kBT
, (8.189)
Peq
βα ≈
1 −
Eβα
kBT
4
=
1
4
− γ1(1 + δi,1)
B0
8kBT
+ γ2(1 + δi,2)
B0
8kBT
≈
1
4
− γ1
B0
8kBT
+ γ2
B0
8kBT
, (8.190)
Peq
ββ ≈
1 −
Eββ
kBT
4
=
1
4
− γ1(1 + δi,1)
B0
8kBT
− γ2(1 + δi,2)
B0
8kBT
≈
1
4
− γ1
B0
8kBT
− γ2
B0
8kBT
. (8.191)
(8.192)
8.9. SUPPORTING INFORMATION 227
t=2.5 ns t=4.0 nst=2.0 nst=0.0 ns
E
D
C
B
A
y
y
x
xz
t=1.5 ns t=3.0 ns
z
y
t=1.0 ns
xz
z
yx
z
yx
ωH
ωC
−ωCωH
t=0.5 ns t=3.5 ns
C+ωHω
Figure 8.2: Illustration how random molecular motions resonating with ﬁve characteristic frequencies aﬀect µ1 = µC in the CH bond in
a 500 MHz spectrometer (B0 = 11.75 T). The ﬁrst diagram in each row shows positions of the nuclei (13C in green, 1H in cyan), the second
diagram shows orientation of the coordinate frame, the following diagrams are snapshots of the orientations of magnetic moments shown
as arrows (13C in green, 1H in cyan) in a magnetic ﬁeld of the neighbour (shown as force lines). The red arrow indicates the direction of
the change of the orientation of the inﬂuenced magnetic moment, but the actual change is very small (not observable in the pictures) in the
short depicted period of 4 ns. The molecule is assumed to rotate with a constant frequency for 4 ns. A, µC oriented horizontally, µ2 = µH
oriented vertically, 13C in the yz plane, 1H in the center of the coordinate system. Rotation at any frequency about a horizontal axis varies
precession frequency of µC about B0 and results in the loss of coherence of 13C. B, µH oriented horizontally, µC oriented vertically, 1H
above the xy plane, 13C in the center of the coordinate system. Rotation at ωH about a horizontal axis pulls µH up (red arrow pointing up
in the given coordinate frame) in the magnetic ﬁelds of µC. This reorientation of µH (observed only at time much longer than the depicted
period of 4 ns) inﬂuences the eﬀect of the J-coupling. C, µC oriented horizontally, µH oriented vertically, 13C above the xy plane, 1H in
the center of the coordinate system. Rotation at ωC about a horizontal axis pulls µC up (red arrow pointing up in the given coordinate
frame) in the magnetic ﬁelds of µH. D, µC oriented horizontally, µH oriented horizontally, 13C close to the z axis, 1H in the center of the
coordinate system. Rotation at ωH + ωC about a vertical axis pulls µC up (red arrow pointing up in the given coordinate frame) in the
magnetic ﬁelds of µH. E, µC oriented horizontally, µH oriented horizontally, 13C in the yz plane, 1H in the center of the coordinate system.
Rotation at ωH − ωC about a horizontal axis pulls µC up (red arrow pointing mostly up in the given coordinate frame) in the magnetic
ﬁelds of µH. The reorientation of µC in Panels C–E (observed only at time much longer than the depicted period of 4 ns) contributes to
the return of the distribution of the 13C magnetic moments to the thermal equilibrium.
228
Lecture 9
Two-dimensional spectroscopy, NOESY
Literature: A very nice explanation of the principles of two-dimensional spectroscopy can be found
in K8.1–K8.2. The States-Haberkorn-Ruben method and other approaches to frequency discrimination
are explained in K8.12, L5.9.4, and C4.3.4. The idea of 2D spectroscopy, but for a diﬀerent type
of experiment (COSY) is also presented in C4.1, L5.6 and L5.9.
9.1 Two-dimensional spectroscopy
NMR spectroscopy based on application of short radio-wave pulses gives us an opportunity to display
frequencies of multiple magnetic moments in multiple dimensions of a single multidimensional spectrum.
The great advantage of this approach is the possibility to immediately see various correlations
among the observed nuclear magnetic moments and use this information in the structural analysis
of the studied molecule. When working with large molecules (proteins, nucleic acids), spectra with
three and more frequency dimensions are recorded routinely. In our course, we analyze only twodimensional
experiments, but we try to understand in detail how various correlations of interacting
magnetic moments are encoded in the spectra. Before we reach this point, we have to learn the basic
principle.
In order to explain principles of 2D spectroscopy, we ﬁrst analyze an experiment consisting of
three 90◦
pulses and two delays preceding the data acquisition. Later we learn that this experiment
is abbreviated NOESY and serves as a source of information about interatomic distances, but now
we use it just as a simple example. Application of three radio-wave pulse is already an advanced
experimental approach, deserving a clear formal presentation. The experiment can be described as
a(π/2)xb − t1 −c (π/2)xd − τm −e (π/2)xf − t2(acquire).
However, a pictorial representation shown in Figure 9.1 is more usual and practical.
In the drawing presented in Figure 9.1, each application of radio waves is represented by a black
rectangle. In our experiments, all rectangles have the same width because all pulses have the same
duration. Later we discuss experiments that combine 90◦
and 180◦
. In schemes of such experiments,
90◦
and 180◦
pulses are represented by narrow and wide rectangles, respectively. Durations of the
delays between the pulses are described by time variables t1 and τm, the time-dependence of the
acquired signal is labeled t2. In our analysis, we describe the density matrix just before and after
pulses, as indicated by red letters ”a” to ”f” in Figure 9.1.
229
230
t2
t1 τ
y/x
m
ba ec d f
Figure 9.1: Schematic drawing of a two-dimensional NMR experiment. The symbols are explain in the text.
9.2 Evolution in the absence of dipolar coupling
We start with an analysis for two non-interacting magnetic moments, e.g. of two protons that have
diﬀerent chemical shift δi,1 and δi,2, and are far from each other in a molecule.1
The pair of protons
is an example of a homonuclear system, where all nuclei have the same magnetogyric ratio γ. Before
we analyze evolution of the density matrix in a 2D experiment, we must deﬁne its initial form. Like
in the case of the isolated nuclear magnetic moments, we assume that the experiments starts from
thermal equilibrium. Therefore, we use ˆρeq
, derived in Sections 8.8 and 8.9.8, as ˆρ(t = 0) If we
neglect the very small eﬀect of diﬀerent chemical shifts in Eq. 8.39, the values of κ are also the same
for both protons. As in the one-pulse experiment, we follow the coherent evolution of ˆρ step-by-step,
and add the eﬀect of relaxation ad hoc.
• ˆρ(a) = 1
2
It + 1
2
κ(I1z + I2z)
We start from the thermal equilibrium described by Eq. 8.39. Note that the matrices are
diﬀerent than for the single-spin mixed state, but the constant is the same. Moreover, only
It, I1z, I2z contribute to ˆρ(a). If the magnetic moments do not interact, no 2I1jI2k operator
(where j, k ∈ x, y, z) contributes to any Hamiltonian. As a consequence, the I1z and I2z
components of the density matrix evolve separately, following the same rules as described for
Iz. Therefore, we can use Eq. 6.10 to analyze the evolution, we just repeat the analysis twice
for I1z and I2z, treating both as Iz in Eq. 6.10.
• ˆρ(b) = 1
2
It + 1
2
κ(−I1y − I2y)
Here we describe the eﬀect of the 90◦
pulse. For detailed analysis, see the one-pulse experiment.
• ˆρ(c) = 1
2
It + 1
2
κ (− cos(Ω1t1)I1y + sin(Ω1t1)I1x − cos(Ω2t1)I2y + sin(Ω2t1)I2x)
Here we describe evolution during t1 exactly as in the one-pulse experiment. To keep the equations
short, we replace the trigonometric terms describing the evolution by (time-dependent)
coeﬃcients c11, c21, s11, and s21:
ˆρ(c) = 1
2
It + 1
2
κ (−c11I1y + s11I1x − c21I2y + s21I2x)
The coeﬃcients c11, c21, s11, and s21 deserve some attention. First, note that the ﬁrst subscript
1
Protons in propynal (H–C≡C–CO–H) may serve as an example.
9.3. SIGNAL MODULATION IN A TWO-DIMENSIONAL EXPERIMENT 231
speciﬁes the nucleus and the second subscript speciﬁes the time period (so-far, it is always 1
because we have analyzed only evolution during t1). Second, we include the eﬀect of relaxation
into the coeﬃcients:
c11 → e−R2,1t1
cos(Ω1t1) s11 → e−R2,1t1
sin(Ω1t1)
c21 → e−R2,2t1
cos(Ω2t1) s21 → e−R2,2t1
sin(Ω2t1)
• ˆρ(d) = 1
2
It + 1
2
κ (−c11I1z + s11I1x − c21I2z + s21I2x)
Here we analyze the eﬀect of the second 90◦
pulse, similarly to the step a → b. The x-pulse
does not aﬀect the x magnetization, but rotates the −y magnetization further to −z. The ﬁnal
magnetization is parallel with B0, but the equilibrium polarization is inverted.
• ˆρ(e) =?
This is a new case, it should be analyzed carefully. Here we perform the analysis for a large
molecule such as a small protein: In proteins, Mx, My relax with R2 > 10 s−1
and Mz with
R1 ≈ 1 s−1
. The delay τm is usually longer than 0.1 s. Let us assume τm = 0.2 s and R2 =
20 s−1
. After 0.2 s, e−R2τm
= e−20×0.2
= e−4
≈ 0.02. We see that Mx, My relaxes almost
completely. Therefore, I1x, I1y, I2x, I2y can be neglected. On the other hand, e−R1τm
=
e−1×0.2
= e−0.2
≈ 0.82. We see that Mz does not relax too much. Therefore, we continue
analysis with I1z, I2z. The I1z, I2z terms do not evolve because they commute with H =
Ω1I1z + Ω2I2z. Consequently,
ˆρ(e) = 1
2
It + 1
2
κ −e−R1,1τm
c11I1z − e−R1,2τm
c21I2z = 1
2
It − A1I1z − A2I2z.
We further simpliﬁed the notation by introducing the factors A1 and A2. Again, we include
the relaxation eﬀects into A1 and A2 when we express the measurable signal:
A1 → κ
2
e−R1,1τm
c11 = κ
2
e−R1,1τm
e−R2,1t1
cos(Ω1t1)
A2 → κ
2
e−R1,2τm
c21 = κ
2
e−R1,2τm
e−R2,2t1
cos(Ω2t1)
• ˆρ(f) = 1
2
It + A1I1y + A2I2y
Here we analyze the eﬀect of the third pulse, in the same manner as we analyzed the ﬁrst pulse.
• ˆρ(t2) = 1
2
It + A1(cos(Ω1t2)I1y − sin(Ω1t2)I1x) + A2(cos(Ω2t2)I2y − sin(Ω2t2)I2x)
In the last step, we analyze evolution during the data acquisition.
9.3 Signal modulation in a two-dimensional experiment
Having ˆρ(t2), we can calculate M+ . As the size of the matrices increased, it is more convenient
to use the orthonormality of the basis than to calculate all matrix products.2
It follows from the
deﬁnition of orthonormal matrices that for the two-spin matrices
2
Orthonormality for a set of matrices ˆAj is deﬁned as Tr{ ˆA†
j
ˆAk} = δjk, where δjk = 1 for j = k, δjk = 0 for j = k,
and ˆA†
j is an adjoint matrix of ˆAj, i.e., matrix obtained from ˆAj by exchanging rows and columns and replacing all
numbers with their complex conjugates.
232
Tr {Inx(Inx + iIny)} = 1, (9.1)
Tr {Iny(Inx + iIny)} = i, (9.2)
and traces of products with other matrices are zero. Applying the orthonormality relations to
the product of ˆM+ with the obtained ˆρ(t2) and introducing relaxation, we get
M+ = Tr{ˆρ(t2) ˆM+}
= Nγ A1(e−R2,1t2
cos(Ω1t2)Tr {I1y(I1+ + I2+)} − e−R2,1t2
sin(Ω1t2)Tr {I1x(I1+ + I2+)})
+A2(e−R2,2t2
cos(Ω2t2)Tr {I2y(I1+ + I2+)} − e−R2,2t2
sin(Ω2t2)Tr {I2x(I1+ + I2+)})
= Nγ A1 ie−R2,1t2
cos(Ω1t2) − e−R2,1t2
sin(Ω1t2)
+ Nγ A2 ie−R2,2t2
cos(Ω2t2) − e−R2,2t2
sin(Ω2t2) . (9.3)
Note that the resulting phase is shifted by π/2 similarly to Eq. 7.27, but in the opposite direction.
After applying the phase correction, Fourier transformation of the signal provides spectrum in the
form (cf. Eq. 7.32)
{Y (ω)}
Nγ
A1R2,1
R2
2,1 + (ω − Ω1)2
+
A2R2,2
R2
2,2 + (ω − Ω2)2
−
{Y (ω)}
iNγ
A1(ω − Ω1)
R2
2,1 + (ω − Ω1)2
+
A2(ω − Ω2)
R2
2,2 + (ω − Ω2)2
.
(9.4)
Ω1 Ω2
{Y(ω)}
ω
Ω1 Ω2
{Y(ω)}
ω
In the one-dimensional experiment, A1 and A2 just scale the peak height. However, they depend
on the length of the delay t1 in our two-dimensional experiment. If the measurement is repeated
many times and t1 is increased by an increment ∆t each time, the obtained series of 1D spectra is
amplitude modulated by c11 = e−R2,1t2
cos(Ω1t1) and c21 = e−R2,2t2
cos(Ω2t1). Since the data are stored
in a computer in a digital form, they can be treated as a two-dimensional array (table), depending
on the real time t2 in one direction and on the length of the incremented delay t1 in the other
directions. These directions are referred to as the direct dimension and the indirect dimension. The
9.4. NOESY 233
Transmitter on
Transmitter oﬀ
Receiver on
Receiver oﬀ
t1 τm t2
t2 f2 f2
t1
f1
Figure 9.2: Principle of two-dimensional spectroscopy (experiment NOESY). The acquired signal is shown in red,
the signal after the Fourier transformation in the direct dimension is shown in magenta, and the signal after the Fourier
transformation in both dimensions is shown in blue.
Fourier transformation can be performed in each dimension providing direct frequency dimension and
indirect frequency dimension.
Since we acquire signal as a series of complex numbers, it is useful to introduce the complex
numbers in the indirect dimension as well. The advantage of such spectrum is that the positive and
negative values of the frequency oﬀset can be distinguished in both dimensions. Several protocols
that discriminate the positive and negative frequency oﬀsets in the indirect dimension are routinely
used. The frequency discrimination approaches are based on shifting phases of certain radio-wave
pulses. The method introduced by States, Haberkorn, and Ruben is described in Section 9.5.1. It
should be mentioned that phases are also changed in another manner, called phase cycling, in NMR
experiments. We discuss phase cycling later and describe recording two-dimensional signal arrays in
its full complexity in Section 11.7.
9.4 NOESY
If the two-dimensional spectra looked exactly as described in the preceding section, they would not
be very useful because they would not bring any new information. The same frequencies would be
measured in the direct and indirect dimension and all peaks would be found along the diagonal of the
spectrum. What makes the experiment really useful is the interaction between magnetic moments
during τm. Such approach is known as nuclear Overhauser eﬀect spectroscopy (NOESY) and is used
frequently to measure distances between protons in molecules.
234
As described by Eq. 8.33, relaxation of nucleus 1 is inﬂuenced by the state of nucleus 2 (and vice
versa):
−
d∆ M1z
dt
= Ra1∆ M1z + Rx∆ M2z (9.5)
−
d∆ M2z
dt
= Ra2∆ M2z + Rx∆ M1z . (9.6)
This set of equations is solved and the solution is analyzed in Section 9.5.2. The analysis shows
that the amplitudes A1 and A2 depend on both frequencies Ω1 and Ω2 (contain both c11 and c21).
Therefore, the spectrum contains both diagonal peaks (with the frequencies of the given magnetic
moment in both dimensions) and oﬀ-diagonal cross-peaks (with the frequencies of the given magnetic
moment in the direct dimension and the frequency of its interaction partner in the indirect dimension).
The presence of the cross-peaks provides very useful qualitative information about the studied
molecules. It tells us which nuclei are close in space. Such knowledge of spatial proximity often allows
us to assign measured frequencies to the hydrogen atoms in the studied molecule. But we often go
further and analyze the intensities of the cross-peaks quantitatively. As shown in Section 9.5.3, the
height of the NOESY cross-peaks Ymax depends on two factors: on the dynamics of the molecule
and on the distance of the interacting nuclei. Depending on the motions on the molecules, the peak
height can be positive or negative. If the molecular motions are slow, the cross-peaks have the same
sign as diagonal peaks. However, if the molecular motions are fast (e.g., if the molecule is small),
the sign is opposite. Obviously, there is a range of molecular motions that make the peak height
close to zero. In such case, other NMR techniques than NOESY should be applied. If the dynamics
of the molecule is favorable (suﬃciently fast or slow), the dependence on the distance between the
interacting nuclei can be used to estimate distances in the molecule. For short τm, the cross-peak
height is approximately proportional to r−6
. The studied molecules (especially large molecules like
proteins or nucleic acid fragments) often contain pairs of protons with a well-deﬁned geometry. For
example, the distance between geminal protons in the CH2 group is 0.17 nm, distances between
protons in the ortho- and meta- positions in aromatic rings are 0.25 nm and 0.42 nm, respectively.
Such distances can be used as a reference for the measurement of unknown distances. If we assume
that two protons have a similar dynamics as a reference pair of protons, the ratio of the heights3
of
the cross-peaks of the investigated and reference proton pairs is
Ymax
Ymax,ref
=
rref
r
6
. (9.7)
Therefore, the unknown distance r can be calculated as
r = rref
6 Ymax,ref
Ymax
. (9.8)
It is quite remarkable that the dipole-dipole interaction allows us to measure distances nine orders
of magnitude shorter than the wavelength of the used electromagnetic waves.
3
Volume (integral) of the peak gives more accurate distances because it is not inﬂuenced by the relaxation during
measurement. On the other hand, measurement of peak volumes may be diﬃcult in crowded spectra of large molecules.
9.4. NOESY 235
HOMEWORK
Analyze the intensities of the NOESY cross-peaks (Sections 9.4, 9.5.2, and 9.5.3, using Eqs. 8.33–8.34
from Section 8.7.)
236
9.5 SUPPORTING INFORMATION
9.5.1 States-Haberkorn-Ruben method of processing hypercomplex data
It is possible to introduce the complex numbers in the indirect dimension by repeating the measurement twice for each value of t1, each
time with a diﬀerent phase of the radio waves applied during the second pulse. We describe the procedure for a pair of non-interacting
spin-1/2 nuclei. First we acquire the signal with the second pulse applied with the same phase as the ﬁrst pulse. Such a phase is labeled x
in the NMR literature. Then, we repeat the acquisition with the phase of the radio waves shifted by 90◦ during the second pulse. Such a
phase is labeled y in the literature. The former case was analyzed in Section 9.2. In the latter case, the I1y and I2y components are not
aﬀected and relax during τm, while the I1x and I2x are rotated to −I1z and −I2z, respectively, and converted to the measurable signal
by the third pulse. Because the I1x and I2x coherences are modulated by s11 and s21, the amplitudes (labeled here B1 and B2) oscillate
as a sine function, in the even spectra (unlike the amplitudes A1, A2 of the odd spectra, oscillating as a cosine function). So, we obtain
cosine amplitude modulation in odd spectra and sine amplitude modulation in even spectra:
B1 → κ
2
e−R1,1τm s11 = κ
2
e−R1,1τm e−R2,1t1 sin(Ω1t1)
B2 → κ
2
e−R1,2τm s21 = κ
2
e−R1,2τm e−R2,2t1 sin(Ω2t1).
Processing of the complete data starts by complex Fourier transformation in the direct (t2) dimension providing two sets of 1D spectra,
cosine amplitude-modulated in t1
{y1(t1)} {Y2(ω2)}
Nγ
A1R2,1
R2
2,1 + (ω2 − Ω1)2
+
A2R2,2
R2
2,2 + (ω2 − Ω2)2
−
{y1(t1)} {Y2(ω2)}
iNγ
A1(ω2 − Ω1)
R2
2,1 + (ω2 − Ω1)2
+
A2(ω2 − Ω2)
R2
2,2 + (ω2 − Ω2)2
(9.9)
and sine amplitude-modulated in t1
{y1(t1)} {Y2(ω2)}
Nγ
B1R2,1
R2
2,1 + (ω2 − Ω1)2
+
B2R2,2
R2
2,2 + (ω2 − Ω2)2
−
{y1(t1)} {Y2(ω2)}
iNγ
B1(ω2 − Ω1)
R2
2,1 + (ω2 − Ω1)2
+
B2(ω2 − Ω2)
R2
2,2 + (ω2 − Ω2)2
. (9.10)
The cosine amplitude-modulated and sine amplitude-modulated real components of the obtained 1D spectra, respectively, are then
treated as the real and imaginary component of the complex signal in the indirect dimension.
{y1(t1)} {Y2(ω2)}
Nγ
A1R2,1
R2
2,1 + (ω2 − Ω1)2
+
A2R2,2
R2
2,2 + (ω2 − Ω2)2
+ i
{y1(t1)} {Y2(ω2)}
Nγ
B1R2,1
R2
2,1 + (ω2 − Ω1)2
+
B2R2,2
R2
2,2 + (ω2 − Ω2)2
(9.11)
= Nγ
(A1 + iB1)R2,1
R2
2,1 + (ω2 − Ω1)2
+
(A2 + iB2)R2,2
R2
2,2 + (ω2 − Ω2)2
(9.12)
=
Nγ2 2B0
4kBT
e−R1,1τm e−R2,1t1 cos(Ω1t1)
R2,1
R2
2,1 + (ω2 − Ω1)2
+ e−R1,2τm e−R2,2t1 cos(Ω2t1)
R2,2
R2
2,2 + (ω2 − Ω2)2
+ i
Nγ2 2B0
4kBT
e−R1,1τm e−R2,1t1 sin(Ω1t1)
R2,1
R2
2,1 + (ω2 − Ω1)2
+ e−R1,2τm e−R2,2t1 sin(Ω2t1)
R2,2
R2
2,2 + (ω2 − Ω2)2
(9.13)
=
Nγ2 2B0
4kBT
e−R1,1τm e−R2,1t1 eiΩ1t1
R2,1
R2
2,1 + (ω2 − Ω1)2
+ e−R1,2τm e−R2,2t1 eiΩ2t1
R2,2
R2
2,2 + (ω2 − Ω2)2
(9.14)
The complex Fourier transformation in t1 yields a spectrum
Nγ2 2B0
4kBT
e−R1,1τm
R2,1
R2
2,1 + (ω1 − Ω1)2
·
R2,1
R2
2,1 + (ω2 − Ω1)2
+ e−R1,2τm
R2,2
R2
2,2 + (ω1 − Ω2)2
·
R2,2
R2
2,2 + (ω2 − Ω2)2
+ i
Nγ2 2B0
4kBT
e−R1,1τm
ω1 − Ω1
R2
2,1 + (ω1 − Ω1)2
·
R2,1
R2
2,1 + (ω2 − Ω1)2
+ e−R1,2τm
ω1 − Ω2
R2
2,2 + (ω1 − Ω2)2
·
R2,2
R2
2,2 + (ω2 − Ω2)2
, (9.15)
where the positive and negative values of the frequency oﬀset are distinguished in both dimensions.
Schematically, the States-Haberkorn-Ruben method can be summarized as
9.5. SUPPORTING INFORMATION 237
y1(t1) · y2(t2) = ( {y1(t1)} + i {y1(t1)}) · ( {y2(t2)} + i {y2(t2)})
↓ Complex Fourier transformation in t2
( {y1(t1)} + i {y1(t1)}) · ( {Y2(ω2)} + i {Y2(ω2)})
↓ Discard imaginary component in ω2
( {y1(t1)} + i {y1(t1)}) · {Y2(ω2)}
↓ Complex Fourier transformation in t1
( {Y1(ω1)} + i {Y1(ω1)}) · {Y2(ω2)} = {Y1(ω1)} {Y2(ω2)} + i {Y1(ω1)} {Y2(ω2)} (9.16)
Note that without separating the real component in ω2 from the imaginary one prior to the second Fourier transformation a spectrum
with a phase-twisted line-shape is obtained:
y1(t1) · y2(t2) = ( {y1(t1)} + i {y1(t1)}) · ( {y2(t2)} + i {y2(t2)})
↓ Complex Fourier transformation in t2
( {y1(t1)} + i {y1(t1)}) · ( {Y2(ω2)} + i {Y2(ω2)})
↓ Complex Fourier transformation in t1
( {Y1(ω1)} + i {Y1(ω1)}) · ( {Y2(ω2)} + i {Y2(ω2)})
= {Y1(ω1)} {Y2(ω2)} − {Y1(ω1)} {Y2(ω2)}
real
+ i ( {Y1(ω1)} {Y2(ω2)} + {Y1(ω1)} {Y2(ω2)})
imaginary
(9.17)
with the real part having the following form (for each peak)
Nγ2 2B0
4kBT
e−R1,nτm
R2,n
R2
2,n + (ω1 − Ωn)2
·
R2,n
R2
2,n + (ω2 − Ωn)2
−
ω1 − Ωn
R2
2,n + (ω1 − Ωn)2
·
ω2 − Ωn
R2
2,n + (ω2 − Ωn)2
, (9.18)
which is not an absorption Lorentz function (due to the presence of the red term).
9.5.2 Quantitative analysis of cross-relaxation in NOESY
As described by Eqs. 8.33–8.34, relaxation of nucleus 1 is inﬂuenced by the state of nucleus 2 (and vice versa):
−
d∆ M1z
dt
= Ra1∆ M1z + Rx∆ M2z (9.19)
−
d∆ M2z
dt
= Ra2∆ M2z + Rx∆ M1z . (9.20)
The analysis greatly simpliﬁes if the auto-relaxation rates are identical for both magnetic moments.4 Then,
−
d∆ M1z
dt
= Ra∆ M1z + Rx∆ M2z , (9.21)
−
d∆ M2z
dt
= Ra∆ M2z + Rx∆ M1z . (9.22)
Such set of diﬀerential equations can be solved easily e.g. by the substitutions ∆+ = ∆ M1z +∆ M2z and ∆− = ∆ M2z −∆ M1z .
The result is
∆+ = ∆+(0)e−(Ra+Rx)t
, (9.23)
∆− = ∆−(0)e−(Ra−Rx)t
. (9.24)
Returning back to ∆ M1z and ∆ M2z ,
4
This is a reasonable assumption for protons with similar dynamics and in similar chemical environment.
238
∆ M1z = ((1 − ζ)∆ M1z (0) + ζ∆ M2z (0)) e−(Ra+Rx)t
, (9.25)
∆ M2z = ((1 − ζ)∆ M2z (0) + ζ∆ M1z (0)) e−(Ra+Rx)t
, (9.26)
where ζ = (1 − e2Rxt)/2. Therefore, the amplitudes A1 and A2 in our two-dimensional experiment are
A1 =
κ
2
((1 − ζ)c11 + ζc21)e−(Ra+Rx)τm , (9.27)
A2 =
κ
2
((1 − ζ)c21 + ζc11)e−(Ra+Rx)τm . (9.28)
9.5.3 Intensity of NOESY cross-peaks
The intensity (measured as peak height or peak integral, i.e., volume) of the cross-peaks is proportional to the amplitudes A1 and A2.
Here we analyze how A1 and A2 decay during τm. The overall loss of signal (”leakage”) due to the R1 relaxation is given by e−(Ra+Rx)τm
and intensities of the cross-peaks are given by the factor
ζe−(Ra+Rx)τm = −
1
2
eRxτm − e−Rxτm e−Raτm . (9.29)
For short τm, Taylor expansion can be applied. Neglecting higher terms, eRxτm − e−Rxτm ≈ 1 + Rxτm − 1 + Rxτm = 2Rxτm and
e−Raτm is close to one. Therefore, the expression describing the cross-peak intensities can be approximated as
−
1
2
eRxτm − e−Rxτm e−Raτm ≈ −Rxτm (9.30)
and Rx can be expressed explicitly using Eqs. 8.177, 8.178, and 8.180
−
1
2
eRxτm − e−Rxτm e−Raτm ≈ −Rxτm =
µ0
8π
2 γ4 2
r6
(J(0) − 6J(2ω0))τm, (9.31)
where the diﬀerence of the precession frequencies due to diﬀerent chemical shifts was neglected (assuming ω0,1 = ω0,2 = ω0 because
γ1 = γ2 and |ω0,1 − ω0,2| is ∼ 10−5ω0,1 or lower). The obtained result shows that the cross-peak intensity is proportional to r−6 and
to J(0) − 6J(2ω0) in the linear approximation. In order to investigate the impact of the dependence on J(0) − 6J(2ω0), we calculate the
spectral density function for a simple correlation function of a rigid spherical molecule (Eq. 2.3):
J(ω) =



∞ˆ
−∞
1
5
e−t/τC e−iωt
dt



= 2



∞ˆ
0
1
5
e
−
iωτC+1
τC
t
dt



=
2
5
τC
iωτC + 1
=
2
5
τC
1 + iωτC
1 − iωτC
1 − iωτC
=
2
5
τC
1 + (ωτC)2
. (9.32)
Setting ω = 0, we obtain J(0) = 2
5
τC.
If the molecular motions are slow, τC is long and 2ω0τC 1 ⇒ J(2ω0) 2
5
τC ⇒ J(0) > 6J(2ω0). Therefore the cross-peak intensity
proportional to J(0) − 6J(2ω0) is positive (i.e., cross-peaks have the same sign as diagonal peaks).
If the molecular motions are fast, 2ω0τC 1 ⇒ J(2ω0) ≈ 2
5
τC ⇒ J(0) = 2
5
τC < 6J(2ω0) ≈ 6 × 2
5
τC. Therefore the cross-peak
intensity proportional to J(0) − 6J(2ω0) is negative (i.e., cross-peaks and diagonal peaks have the opposite sign).
Lecture 10
J-coupling, spin echoes
Literature: The through-bond coupling (J-coupling) is described in L14 and L15, the Hamiltonian
is presented in L9.4 and J-coupled spins are described in L14.2, L14.3, and L14.5. Spin echoes are
nicely described in K7.8 and also presented in LA.10.
10.1 Through-bond coupling
Magnetic moments of nuclei connected by covalent bonds interact also indirectly, via interactions
with magnetic moments of the electrons of the bonds. This type of interaction is known as Jcoupling,
through-bond coupling, or indirect spin-spin coupling. A magnetic moment µ2 is a source
of a magnetic ﬁeld that perturbs the distribution of electron magnetic moments. Such a distortion
(perturbation of the electron spin states or modiﬁcation of electron orbital magnetic moments by
altering the magnetic ﬁeld felt by the electrons) modiﬁes a magnetic ﬁeld at the site of µ1. The fact
that such indirect interaction exists is itself not surprising. But it is less obvious (and was surprising
when ﬁrst observed) why the indirect interaction is not averaged to zero in isotropic liquids. The
indirect interaction is just a series of direct interactions of diﬀerent magnetic moments (of the nuclei
and of the electrons). We derived in Section 8.9.2 that the eﬀect of a direct interaction of two
magnetic dipoles is averaged to zero in isotropic liquids. Should not the whole series of interactions
be averaged to zero, when each interaction in the series seems to be?
Before we discuss this mystery, we write down a general form of a Hamiltonian representing a
contribution of the coupling to the magnetic energy of a pair of interacting nuclear magnetic moments.
For example, if nucleus 2 generates (indirectly, via interactions with the electrons as descried above)
a ﬁeld B2 at the site of nucleus 1, then coupling with µ2 contributes to the energy of the magnetic
moment µ1 by −µ1 · B2. We ca assume that each component of the ﬁeld felt by magnetic moment
1 (e.g. of 1
H) somehow depends on all components of the magnetic moment 2 (e.g. of 13
C), in a
similar manner as we described the through-space direct dipole-dipole coupling. Therefore, it should
be possible to describe the indirect interaction by a tensor (like chemical shift or dipolar coupling):
ˆHJ =
2π
( ˆI1x
ˆI1y
ˆI1z )


Jxx Jxy Jxz
Jyx Jyy Jyz
Jzx Jzy Jzz




ˆI1x
ˆI1y
ˆI1z

 =
2π ˆ
I1 · J ·
ˆ
I2. (10.1)
To proceed, we should investigate the physical origin of the interaction. As discussed in Sec-
239
240
tion 10.10.1, the major contribution to the J-coupling in most molecules is an interaction mediated
by electrons occurring at the same positions as the nuclei. Obviously, interaction of such electrons
with the nuclei does not change as the molecule rotates. As a consequence, the J-tensor has a dominant
isotropic (orientation-independent) component, whereas the anisotropic part is usually small
(and diﬃcult to distinguish from the dipolar coupling). Therefore, only the isotropic component
of the tensor is considered and the anisotropic component is neglected in practice. The isotropic
component is deﬁned as described in Section 1.5.6 for the chemical shift tensor:1
2π


JXX 0 0
0 JY Y 0
0 0 JZZ

 = 2π
JXX + JY Y + JZZ
3


1 0 0
0 1 0
0 0 1

 = 2πJ


1 0 0
0 1 0
0 0 1

 . (10.2)
The unit matrix tells us that we can replace the tensor J (represented by a 3 matrix) in the
Hamiltonian by a scalar value (single number) J. Accordingly, the J-coupling is often called scalar
coupling (implying that the anisotropic component is neglected).
The actual coupling between nuclei connected by chemical bonds is due to interactions of magnetic
moments of electrons that (i) can be found with non-zero probabilities at the nuclei, and (ii)
contribute to the covalent bonds between the nuclei. Wave functions of electrons in a simple σ-bond
are discussed in Section 10.10.2.
The value of the constant J can be positive or negative, depends on the actual distribution of
electrons, and its calculation requires advanced quantum chemistry methods. The factor of 2π reﬂects
the convention to express J in the units of Hz. Note that the J-coupling does not depend on the
external magnetic ﬁeld B0. Therefore, it does not make sense to express J in relative units (ppm).
Proton-proton J-coupling is signiﬁcant (exceeding 10 Hz) up to three bonds and observable for 4 or
5 bonds in special cases (planar geometry like in aromatic systems). Interactions of other nuclei are
weaker, but the one-bond couplings are always signiﬁcant (as strong as 700 Hz for 31
P-1
H, 140 Hz to
200 Hz for 13
C-1
H, −90 Hz for 15
N-1
H in amides, 30 Hz to 60 Hz for 13
C-13
C, −10 Hz to −15 Hz for
13
C-15
N). Typical values of two-bond (2
J) and three-bond (3
J) 1
H-1
H couplings are −15 Hz and 0 Hz
to 20 Hz, respectively. As the value of J is given by the distribution of electrons in bonds, it reports
the local geometry of the molecule. In particular, three-bond J-couplings can be used to measure
torsion angles in molecules.
The J-coupling has a quantum origin, but its inﬂuence on evolution of magnetic moments can be
described classically, as shown in Sections 10.10.3 and 10.10.5.
10.2 Secular approximation, averaging, and relaxation
If the anisotropic part of the J-tensor is neglected, the J-coupling does not depend on orientation
(scalar coupling) and no ensemble averaging is needed. The secular approximation is applied like in
the case of the dipolar coupling.
1
Note that it is suﬃcient to consider only the average of the diagonal elements of the tensor J = (JXX +JY Y +JZZ)/3
if the anisotropy (2JZZ − JY Y − JXX)/6 and rhombicity (JXX − JY Y )/2 are equal to zero.
10.2. SECULAR APPROXIMATION, AVERAGING, AND RELAXATION 241
A B C
Figure 10.1: J-coupling. A, probability of ﬁnding an electron in the hydrogen atom at particular coordinates is
described by the probability density ρ. The probability density described by the orbital 1s (depicted as a sphere) has
non-zero value at the position of the nucleus (shown in cyan). Therefore, there is a non-zero probability of ﬁnding
electron (red circle) exactly at the site of the nucleus. The ﬁeld produced at the site of the nucleus by the electron’s
magnetic moment (red arrow) does not depend on the orientation of the atom if the positions of the nucleus and
electron coincide. Therefore, the interaction of the nucleus with the electron is not averaged to zero if the atom
rotates isotropically. B and C, the probability density described by the sigma orbitals (depicted as an ellipsoid) in
molecules has also non-zero values at the sites of nuclei. The spin state of the electrons in the bonding sigma orbital
is a superposition of the |α ⊗ |β and |β ⊗ |α| eigenstates (indicated by the opposite direction of the red arrows),
perturbed by the magnetic moment of the nuclei. The parallel orientations of magnetic moments is energetically
favorable for a nucleus and an electron sharing its position.
The Hamiltonian of scalar coupling, i.e., of J-coupling with the small anisotropic contribution
neglected, has one of the following forms.
• In the case of magnetic moments with the same γ and chemical shift, precessing about
the z axis with the same precession frequency,
ˆHJ =
π
2ˆI1z
ˆI2z + 2ˆI1x
ˆI2x + 2ˆI1y
ˆI2y . (10.3)
This case is called strong coupling and is discussed in Lecture 12.
• In the case of magnetic moments with diﬀerent γ and/or chemical shift, precessing about
the z axis with diﬀerent precession frequencies,
ˆHJ =
2π ˆI1z
ˆI2z =
π
2ˆI1z
ˆI2z . (10.4)
This case is called weak coupling. Only the weak coupling Hamiltonian is considered in
Lectures 10 and 11.
In principle, the anisotropic part of the J-tensor would contribute to relaxation like the anisotropic
part of the chemical shift tensor, but it is small and usually neglected. The scalar coupling (described
by the isotropic part of the J-tensor) does not depend on the orientation. Therefore, it can contribute
to the relaxation only through a conformational or chemical exchange. Conformational eﬀects are
usually small: one-bond and two-bond couplings do not depend on torsion angles and three-bond
242
coupling constants are small. In summary, relaxation due to the J-coupling is rarely observed.
However, the J-coupling inﬂuences relaxation of the sample in another way. As described in Section
10.5, J-coupling creates density matrix components relaxing with diﬀerent rates than I1+ and
I2+, analyzed in Sections 8.7 and 8.9.6.
10.3 Homo- and heteronuclear magnetic moment pairs
So far, we did not distinguish homonuclear pairs of magnetic moments (magnetic moments of the
same type of nuclei, e.g., two protons) and heteronuclear pairs of magnetic moments (magnetic
moments of diﬀerent isotopes, e.g., proton and 13
C). It is useful to distinguish these two cases when
we analyze advanced NMR experiments. Although the density matrix has the same form in both
cases, the Hamiltonians describing the eﬀects of radio waves may diﬀer. The reason is technical.
Diﬀerences in chemical shifts are usually small and allow us to irradiate the sample by a radio
wave with a frequency suﬃciently close to the precession frequencies of both nuclei. Therefore, the
resonance conditions can be matched reasonably well for both nuclei and they are aﬀected by the
radio waves in a similar manner. On the other hand, precession frequencies of diﬀerent isotopes diﬀer
substantially and the frequency of the radio waves can resonate only with one of the isotopes. As
a consequence, each of the magnetic moments of the pair is aﬀected selectively, which is frequently
exploited in the NMR experiments.
In order to distinguish the heteronuclear systems from homonuclear ones in our written notes, we
save the symbols I1j and I2j for homonuclear pairs (most often two protons) and use symbols Ij
and Sj for operators of nucleus 1 and 2, respectively, if γ1 = γ2. Both labeling systems are mixed if
we describe more complex chemical groups. For example, we use symbols I1j, I2j, and Sj for the
operators representing contributions to density matrix describing (mixed) states of nuclear magnetic
moments in the 13
C1
H2 group.
10.4 Density matrix evolution in the presence of J-coupling
In order to extend description of NMR experiments to J-coupled pairs of nuclear magnetic moments,
we should update the analysis of the density matrix evolution derived in the previous lectures. As
always, analysis of the starts by the deﬁnition of the initial density matrix form. Derivation of
the density matrix in the thermal equilibrium, presented in Section 10.10.6, is very similar to that
described for two nuclei interacting through space (dipolar coupling) in Section 8.8. In principle, the
diagonal elements of the density matrix are slightly inﬂuenced by the J-coupling, but this inﬂuence
is at least ﬁve orders of magnitude weaker than the dominant eﬀect of the external magnetic ﬁeld
B0. Therefore, the J-coupling contribution can be neglected together with the eﬀect of the chemical
shifts, and the same equilibrium density matrix can be used as the starting point of the analysis
of NMR experiments in the presence of J-coupling, as it was used for systems with no or dipolar
coupling:
ˆρeq
=
1
2
(It + κ1I1z + κ2I2z) , (10.5)
10.4. DENSITY MATRIX EVOLUTION IN THE PRESENCE OF J-COUPLING 243
H1
H1
H1
H1
N15
C13
or
N15
C13
or
N15
C13
or
N15
C13
or
A
B
C
D
a b c d e
Figure 10.2: Graphical analysis of evolution of density matrix for 1
H (nucleus 1) and 13
C (nucleus 2) in an isolated
–CH– group. In individual rows, evolution of coherences is shown for three protons (distinguished by colors) with
slightly diﬀerent precession frequency due to the diﬀerent chemical shifts δi. The protons are bonded to 13
C. Solid
arrows represent fractions of proton magnetization in 10 % molecules with 13
C magnetic moments most polarized in
the direction of B0. Dashed arrow represent fractions of proton magnetization in 10 % molecules with 13
C magnetic
moments most aligned in the opposite direction. The ﬁrst column shows the arrows at the beginning of the echo (after
the initial 90◦
pulse at the proton frequency), the second column shows the arrows in the middle of the ﬁrst delay
τ, the third and fourth columns show the arrows immediately before and after the 180◦
pulse(s) in the middle of the
echo, respectively, the ﬁfths column shows the arrows in the middle of the second delay τ, the sixth column shows the
arrows at the end of the echo. Row A corresponds to an experiment when no 180◦
pulse is applied, row B corresponds
to the echo with the 180◦
pulse applied at the proton frequency, row C corresponds to the echo with the 180◦
pulse
applied at the 13
C frequency, and row D corresponds to the echo with the 180◦
pulses applied at both frequencies (see
the schematic drawings in left part of the ﬁgure). The x-axis points down, the y-axis points to the right.
where
κj =
γjB0
2kBT
. (10.6)
In the case of a homonuclear pair of magnetic moments (e.g., of two J-coupled protons), κ1 =
κ2 = κ because γ1 = γ2.
Also the second step, the analysis of the eﬀect of the 90◦
radio wave pulse (see the schematic
drawing in Figure 10.2A), gives the same result as for uncoupled systems. Again, the reason is that
the ﬁelds indirectly produced by the coupled magnetic moments are too weak (much weaker than
the radio-frequency ﬁeld) to have a noticeable eﬀect during the short pulse. Therefore, our analysis
of the evolution in the presence of the J-coupling starts from ˆρ(b) = 1
2
It + 1
2
κ(−I1y − I2y), where
the letter ”b” refers to the labeling of the time course in Figure 10.2.
244
In the presence of the J-coupling, the general Hamiltonian describing evolution after a 90◦
pulse
is complicated even in a coordinate system rotating with ωrot = −ωradio
H = −γ1B0(1 + δi,1)
Ω1
I1z −γ2B0(1 + δi,2)
Ω2
I2z + πJ (2I1zI2z + 2I1xI2x + 2I1yI2y) . (10.7)
However, if the precession frequencies diﬀer, the secular approximation simpliﬁes the Hamiltonian
to a form where all components commute:
H = −γ1B0(1 + δi,1)
Ω1
I1z −γ2B0(1 + δi,2)
Ω2
I2z + πJ 2I1zI2z. (10.8)
In such case, Eq. 6.9 can be applied and the Liouville-von Neumann equation can be solved
geometrically as rotations in three-dimensional subspaces of the 16-dimensional operator space. The
relevant subspaces are deﬁned by the commutation relations summarized in Eqs. 8.29–8.31 and
presented graphically in Figure 10.3. Graphical description of rotations in the 16D operator space of
a heterenuclear pair is derived from Figure 10.3 by changing I1j to Ij and I2j to Sj, or vice versa
(see Figure 10.4). Rotations described by diﬀerent components of the Hamiltonian are independent
and can be performed consecutively, in any order.
For a density matrix ˆρ(b) = 1
2
It + 1
2
κ(−I1y − I2y) after a 90◦
pulse, the evolution due to the
chemical shift (described by Ω1 and Ω2) and J-coupling (described by πJ) can be analyzed as follows
It −→ It−→It−→It (10.9)
−I1y −→



−c1I1y−→ − c1I1y−→
−c1cJ I1y
+c1sJ 2I1xI2z
+s1I1x−→ + s1I1x−→
+s1cJ I1x
+s1sJ 2I1yI2z
(10.10)
−I2y −→ −I2y−→



−c2I2y−→
−c2cJ I2y
+c2sJ 2I2xI1z
+s2I2x−→
+s2cJ I2x
+s2sJ 2I2yI1z
(10.11)
where the ﬁrst (cyan) arrows represent rotation ”about” I1z or I2z by the angle Ω1t or Ω2t, the
second (green) arrows represent rotation ”about” , the third (red) arrows represent rotation ”about”
2I1zI2z by the angle πJt, and
c1 = cos(Ω1t) s1 = sin(Ω1t) (10.12)
c2 = cos(Ω2t) s2 = sin(Ω2t) (10.13)
cJ = cos(πJt) sJ = sin(πJt). (10.14)
10.4. DENSITY MATRIX EVOLUTION IN THE PRESENCE OF J-COUPLING 245
A
B
C
D
ω1
ω1 ω1
ω1
I1z
−I1x
−I1z
−I1x
I1z
−I1x
−I1z
I1z
−I1z
I1z
−I1x
−I1z
Ω1 Ω1
πJ πJ
−2I1xI2z
2I1zI2z
−2I1xI2z
−2I1zI2z
2I1zI2z
−2I1zI2z
2I1zI2z
−2I1xI2z
−I1y
I1x
I1y
2I1yI2z
I1x
−I1y I1y
I1x
I1y−I1y
I1x
2I1xI2z
−I1y I1y
2I1xI2z
2I1zI2z
2I1yI2z
2I1xI2z
2I1yI2z−2I1yI2z
−I1y I1y
I1x
−I1x
−2I1zI2z
−2I1zI2z
−2I1yI2z
−2I1yI2z
Figure 10.3: Rotations in product operator space. A, eﬀects of the Hamiltonian describing the chemical shift; B,
eﬀects of the Hamiltonian describing the J coupling; C, eﬀects of the Hamiltonian describing the radio wave pulses
with the phase 0 (x); D eﬀects of the Hamiltonian describing the radio wave pulses with the phase π/2 (y). The
rotations are shown for the magnetic moment 1, a similar diagram for the magnetic moment 2 can be obtained by
switching the subscripts 1 and 2 of the operators I1j and I2k.
246
A
B
C
D
ωI
ωI ωI
ωI
Iz
−Ix
−Iz
−Ix
Iz
−Ix
−Iz
Iz
−Iz
Iz
−Ix
−Iz
ΩI ΩI
πJ πJ
−2IxSz
2IzSz
−2IxSz
−2IzSz
2IzSz
−2IzSz
2IzSz
−2IxSz
−Iy
Ix
Iy
2IySz
Ix
−Iy Iy
Ix
Iy−Iy
Ix
2IxSz
−Iy Iy
2IxSz
2IzSz
2IySz
2IxSz
2IySz−2IySz
−Iy Iy
Ix
−Ix
−2IzSz
−2IzSz
−2IySz
−2IySz
Figure 10.4: Rotations in heteronuclear product operator space. A, eﬀects of the Hamiltonian describing the chemical
shift; B, eﬀects of the Hamiltonian describing the J coupling; C, eﬀects of the Hamiltonian describing the radio wave
pulses with the phase 0 (x); D eﬀects of the Hamiltonian describing the radio wave pulses with the phase π/2 (y).
10.5. SIGNAL IN THE PRESENCE OF THE J-COUPLING 247
As mentioned above, the same result is obtained if we ﬁrst ”rotate about” 2I1zI2z, and then
”about” I1z or I2z.
The described analysis can be further simpliﬁed if we use pairs of arrows to represent the product
operators contributing to ˆρ. In this representation, direction of the arrows speciﬁes the transverse
polarization of the observed magnetic moment and the style of the arrows (solid or dashed) speciﬁes
the longitudinal polarization of the coupled neighbor. Pairs of arrows with the same orientations
represent coherences I1x, I1y, I2x, I2y, describing transverse polarizations regardless of the neighbor’s
longitudinal polarizations. Pairs of arrows with the opposite orientations represent coherences
2I1xI2z, 2I1yI2z, 2I2xI1z, 2I2yI1z, describing correlations of transverse polarizations with the
neighbor’s longitudinal polarizations. For example,
−I1y =
y
x
, 2I1xI2z =
x
y
, etc.
The evolution due to the chemical shift, described by the Hamiltonian components Ω1I1z and
Ω2I2z is represented by simultaneous rotation of the arrows (solid and dashed arrows rotate by the
same angle Ω1t or Ω2t in the same direction). The evolution due to the J-coupling, described by
the Hamiltonian component πJ 2I1zI2z is represented by mutual rotation of the arrows (solid and
dashed arrows rotate by the same angle πJt in the opposite direction). More details are discussed in
Section 10.10.7.
10.5 Signal in the presence of the J-coupling
The last step is the evaluation of the expectation value of the transverse magnetization. Only I1x,
I1y, I2x, I2y contribute to the expected value of M+, giving non-zero trace when multiplied by ˆI+
(taking advantage of using orthonormal basis, see Section 9.3):
Tr {I1x(I1x + iI1y)} = Tr {I2x(I2x + iI2y)} = 1, (10.15)
Tr {I1y(I1x + iI1y)} = Tr {I2y(I2x + iI2y)} = i, (10.16)
Well-known trigonometric relations cos(a±b) = cos a cos b sin a sin b and sin(a±b) = sin a cos b±
cos a sin b allow us to convert the products cncJ (modulating Iny) and sncJ (modulating Inx) in
Eqs. 10.10–10.11 to sums of cosine and sine functions, respectively:
c1cJ =
1
2
cos((Ω1 + πJ)t) +
1
2
cos((Ω1 − πJ)t) (10.17)
s1cJ =
1
2
sin((Ω1 + πJ)t) +
1
2
sin((Ω1 − πJ)t) (10.18)
c2cJ =
1
2
cos((Ω2 + πJ)t) +
1
2
cos((Ω2 − πJ)t) (10.19)
s2cJ =
1
2
sin((Ω2 + πJ)t) +
1
2
sin((Ω2 − πJ)t) (10.20)
248
The expected value of M+ calculated from the complete density matrix is then
M+ = Tr ˆρ(t) ˆM+ = Nγ Tr {ˆρ(t)(I1x + iI1y + I2x + iI2y)}
= −iNγ
κ
4
(cos((Ω1 + πJ)t) + cos((Ω1 − πJ)t) + cos((Ω2 + πJ)t) + cos((Ω2 − πJ)t))
+ Nγ
κ
4
(sin((Ω1 + πJ)t) + sin((Ω1 − πJ)t) + sin((Ω2 + πJ)t) + sin((Ω2 − πJ)t))
= Nγ
κ
4
(−i) (cos((Ω1 − πJ)t) + i sin((Ω1 − πJ)t) + cos((Ω1 + πJ)t) + i sin((Ω1 + πJ)t))
+ Nγ
κ
4
(−i) (cos((Ω2 − πJ)t) + i sin((Ω2 − πJ)t) + cos((Ω2 + πJ)t) + i sin((Ω2 + πJ)t))
=
Nγ2 2
B0
8kBT
e−i π
2 ei(Ω1−πJ)t
+ ei(Ω1+πJ)t
+ ei(Ω2−πJ)t
+ ei(Ω2+πJ)t
(10.21)
At this moment, we should also include relaxation. We have analyzed relaxation in Sections
7.7, 7.10.3, and 8.7, 8.9.6. However, the density matrix in the presence of the J-coupling evolves
into new terms 2I1xI2z, 2I1yI2z, 2I1zI2x, and 2I1zI2y, and these terms relax diﬀerently. Their
relaxation rates can be derived using the Bloch-Wangsness-Redﬁeld approach, but we do not do it
in this course. If both dipole-dipole interactions and chemical shift anisotropy contribute to the
relaxation, another complication appears: relaxation of I1+ depends on 2I1+I2z and vice versa,
and the same applies to I2+ and 2I1zI2+.2
To keep our analysis as simple as possible, we (i)
assume that the contribution of the chemical shift anisotropy is negligible, (ii) describe relaxation
of the inter-converting ˆρ contributions I1+, 2I1+I2z and I2+, 2I1zI2+ by average rate constants,
and (iii) assume that the average rate constants are identical for both nuclei (we use the symbol R2).
Including relaxation and applying a phase shift by 90 ◦
, we obtain description of the time evolution
of the expected value of M+
M+ =
Nγ2 2
B0
8kBT
e−R2t
ei(Ω1−πJ)t
+ ei(Ω1+πJ)t
+ ei(Ω2−πJ)t
+ ei(Ω2+πJ)t
(10.22)
which gives four peaks in the spectrum after the Fourier transformation:
2
The mutual dependence of relaxation is described by constants known as cross-correlated cross-relaxation rate
constants, resembling Rx in Eqs. 8.33 and 8.34.
10.6. SPIN ECHOES 249
Nγ2 2
B0
8kBT
R2
R
2
2 + (ω − Ω1 + πJ)2
+
R2
R
2
2 + (ω − Ω1 − πJ)2
+
R2
R
2
2 + (ω − Ω2 + πJ)2
+
R2
R
2
2 + (ω − Ω2 − πJ)2
−i
Nγ2 2
B0
8kBT
ω − Ω1 + πJ
R
2
2 + (ω − Ω1 + πJ)2
+
ω − Ω1 − πJ
R
2
2 + (ω − Ω1 − πJ)2
+
ω − Ω2 + πJ
R
2
2 + (ω − Ω2 + πJ)2
+
ω − Ω2 − πJ
R
2
2 + (ω − Ω2 − πJ)2
.
(10.23)
Ω1
2πJ
Ω2
2πJ
{Y(ω)}
ω
Ω1
2πJ
Ω2
2πJ
{Y(ω)}
ω
The four peaks in the spectrum form two doublets, one at an average angular frequency Ω1,
the other one at an average angular frequency Ω2. Both doubles are split by an angular frequency
diﬀerence πJ − (−πJ) = 2πJ, or by the value of J if the frequencies are plotted in Hz.
After describing spectrum of a homonuclear pair of magnetic moments, we should also mention
how spectra of heteronuclear pairs diﬀer from the homonuclear ones. The selective irradiation of
either nucleus 1 or nucleus 2 also implies that the peaks of nuclei 1 and 2 are not observed in the
same spectrum. The signals of nucleus 1 and nucleus 2 are recorded in two experiments with diﬀerent
frequencies (resonating with the precession frequency of nucleus 1 in one spectrum and of nucleus
2 in the other one) of the radio waves, as shown in Figure 10.5. The sensitivities (signal-to-noise
ratios) of the experiments are in the ratio |γ1/γ2|5/2
(Eq. 7.94). For example, sensitivity of 13
C and
15
N spectra is reduced by a factor of 32 (see Figure 10.5) and 300, respectively, compared to proton
spectra, even if the molecules contain 100 % 13
C and 15
N isotopes.
10.6 Spin echoes
In many NMR experiments, the J-coupling is not just detected, but creatively employed to deliberately
change quantum states (mixed states) of the studied system. Such a manipulation resembles
250
Ω1
2πJ
{Y(ω)}
ω
Ω1
2πJ
{Y(ω)}
ω
Ω2
2πJ
{Y(ω)}
ω
Ω2
2πJ
{Y(ω)}
ω
Figure 10.5: Spectra of a heteronuclear pair. Top, real and imaginary component of a spectrum recorded after
applying a radio wave pulse close to the precession frequency of nucleus 1. Bottom, real and imaginary component of
a spectrum recorded after applying a radio wave pulse close to the precession frequency of nucleus 2. Note that the
frequency oﬀsets Ω1 and Ω2 are measured from diﬀerent carrier frequencies (close to ω0,1 and ω0,2, respectively). The
spectra are plotted so that the noise is the same in both spectra, the relative intensities correspond to a pair of 1
H
(nucleus 1) and 13
C (nucleus 2). The value of J is the same in the top and bottom spectra.
10.6. SPIN ECHOES 251
the dream of the medieval alchemists, transmutation of chemical elements,3
and is sometimes called
”spin alchemy”.
Spin echoes are basic tools of spin alchemy, consisting of a 180 ◦
(π) radio-wave pulse sandwiched
by two delays of equal duration τ. In the case of a heteronuclear pair, we can apply the 180 ◦
pulse
selectively to magnetic moment 1, to magnetic moment 2, or simultaneously to both (see Figure 10.2).
Such a collection of spin echoes gives us the possibility to control evolution of the chemical shift and
J-coupling separately. In the case of a homonuclear pair, the radio waves aﬀect both magnetic
moments simultaneously, as shown in Figure 10.2D.4
Below, we analyze three types of spin echoes applied to a heteronucler system (1
H and 13
C in
our example). For the sake of simplicity, we do not discuss relaxation eﬀects, although relaxation is
usually observable. On the other hand, we have to extend the analyzed system to see how the echoes
aﬀect evolution due to the chemical shift diﬀerences. Therefore, we analyze three pairs with diﬀerent
chemical shifts of the observed nucleus in.
To see how the echoes inﬂuence polarization of the sample, we should compare the eﬀect of the
echoes with the free evolution. Evolution of a single homonuclear pair of magnetic moments in the
presence of J-coupling was described in Section 10.4. To convert the description to our set of three
heteronuclear pairs, we should follow evolution of a density matrix starting from
ˆρ(a) =
1
25
It +
3
n=1
1
25
κ1Inz +
3
n=1
1
25
κ2Snz. (10.24)
However, complexity of such analysis might obscure the eﬀects of the analyzed spin echoes.
Therefore, we write the evolution for one heteronuclear pair and depict the set of three pairs only in
the graphical analysis, as shown in Figure 10.2A.
• ˆρ(a) = 1
2
It + 1
2
κ1Iz + 1
2
κ2Sz
thermal equilibrium, the constants κ1 and κ2 are diﬀerent because the nuclei have diﬀerent γ.
• ˆρ(b) = 1
2
It − 1
2
κ1Iy + 1
2
κ2Sz
90◦
pulse applied to nucleus I only
• ˆρ(e) = 1
2
It + 1
2
κ1 (−c1cJ Iy + s1cJ Ix + c1sJ 2IxSz + s1sJ 2IySz) + 1
2
κ2Sz
free evolution during 2τ (t → 2τ in c1 etc.)
3
Transmutation of the mercury isotope 197
80 Hg (which can be prepared from the stable isotope 198
80 Hg) to a common
isotope of gold 197
79 Au is a nuclear reaction known as electron capture: a proton in the nucleus absorbs an inner-shell
electron, emits a neutrino νe and changes to neutron. Since proton and neutron can be described as diﬀerent quantum
states of an object called nucleon, the transmutation of mercury to gold can be viewed as a change of the quantum
state. Interestingly, proton and neutron diﬀer in the isospin projection quantum number I3, whereas the quantum
states manipulated in NMR spectroscopy diﬀer in the spin projection quantum number sz. The similar nomenclature
is used to emphasize similar symmetry (the same mathematical description) of two diﬀerent physical phenomena.
4
If the chemical shift of nuclei in a homonuclear pair diﬀer substantially, a selective application of 180 ◦
pulses to
either magnetic moment is possible. In such a case, power of the radio waves should be low, and their amplitude is
often modulated during the pulse to achieve a higher selectivity.
252
The 2IxSz, 2IySz coherences give zero trace when multiplied by I+ (they are not measurable
per se), but cannot be ignored if the pulse sequence continues because they can evolve into measurable
coherences later (note that the J-coupling Hamiltonian 2πJIzSz converts them to Iy, Ix,
respectively).
The graphical ”double-arrow” analysis in Figure 10.2A shows how the coherences evolve with
diﬀerent chemical shifts (arrows of diﬀerent colors rotate with diﬀerent frequency) and how is the
evolution inﬂuenced by the J-coupling (solid arrows rotate slower5
than dashed arrows of the same
color).
10.7 Refocusing echo
The refocusing echo consists of a 90◦
pulse exciting magnetic moment 1 and a 180◦
pulse applied
to the excited nucleus in the middle of the echo (see the schematic drawing in Figure 10.2B). The
middle 180◦
pulse ﬂips all arrows from left to right (rotation about the vertical axis x by 180 ◦
). The
faster arrows start to evolve with a handicap at the beginning of the second delay τ and they reach
the slower arrows at the end of the echo regardless of the actual speed of rotation.
Even without a detailed analysis of product operators, we see that the ﬁnal state of the system
does not depend on chemical shift or J-coupling: the evolution of both chemical shift and J-coupling
is refocused during this echo.
The evolution of the density matrix can be guessed from the graphical analysis. The frequency
of the applied radio waves resonates with proton precession frequency and is far from the precession
frequency of 13
C. Therefore, magnetic moments of 13
C should stay in their equilibrium distribution,
described by It and Sz. The initial state of protons was described (after the 90◦
pulse) by −Iy in
terms of product operators and by three arrows with the same −y orientation. As the arrows only
changed their direction at the end of the experiment (all arrows have the +y orientation at the end
of the echo), we can deduce that the ﬁnal state of protons is +Iy. Taken together, each pair of
magnetic moment ends in the state described by
• ˆρ(e) = 1
2
It + 1
2
κ1Iy + 1
2
κ2Sz
10.8 Decoupling echo
The decoupling echo consists of a 90◦
pulse exciting magnetic moment 1 and a 180◦
pulse applied
to the other nucleus in the middle of the echo (see the schematic drawing in Figure 10.2C). The
graphical analysis is shown in Figure 10.2C. The middle 180◦
is applied at the 13
C frequency. It does
not aﬀect proton coherences, depicted as arrows in Figure 10.2C, but inverts longitudinal polarizations
(populations) of 13
C (solid arrows change to dashed ones and vice versa). The faster arrows become
slower, the slower arrows become faster, and they meet at the end of the echo.
Without a detailed analysis of product operators, we see that the ﬁnal state of the system does
not depend on J-coupling (the diﬀerence between solid and dashed arrows disappeared) but the
5
This is true for nuclei with γ > 0.
10.9. SIMULTANEOUS ECHO 253
evolution due to the chemical shift took place (arrows of diﬀerent colors rotated by diﬀerent angles
2Ω1τ). As the eﬀects of the J-coupling are masked, this echo is known as the decoupling echo.
We again derive the ﬁnal density matrix from the graphical analysis. As the arrows at the end
of the echo have the same orientations as if the nuclei were not coupled at all, we can deduce that
the ﬁnal state of protons is identical to the density matrix evolving due to the chemical shift only.
Magnetic moments of 13
C nuclei were aﬀected only by the middle 180◦
pulse that inverted longitudinal
polarization. The density matrix at the endo of the echo is
• ˆρ(e) = 1
2
It + 1
2
κ1 (c1Iy − s1Ix) − 1
2
κ2Sz
10.9 Simultaneous echo
The last echo consists of a 90◦
pulse exciting magnetic moment 1 and 180◦
pulses applied to both
nuclei in the middle of the echo (see the schematic drawing in Figure 10.2D). As both nuclei are
aﬀected, it can be applied to heteronuclear or homonuclear pairs. The homonuclear version includes
one 180◦
pulses of radio waves with a frequency close to the precession frequency of both magnetic
moments. In the heteronuclear variant, two 180◦
pulses are applied simultaneously to both nuclei.
The graphical analysis of the heteronuclear application is shown in Figure 10.2D. The 180◦
pulses
are applied at 1
H and 13
C frequencies in the middle of the echo, resulting in combination of both
eﬀects described in Figs. 10.2B and C. The proton pulse ﬂips arrows representing proton coherences
and the 13
C pulse inverts longitudinal polarizations (populations) of 13
C nuclei (solid arrows change
to dashed ones and vice versa). As a result, the average direction of dashed and solid arrows is
refocused at the end of the echo but the diﬀerence due to the coupling is preserved (the handicapped
arrows were made slower by the inversion of longitudinal polarization of 13
C).
Without a detailed analysis of product operators, we see that the eﬀect of the chemical shift is
removed (the average direction of arrows of the same color is just reversed), but the ﬁnal state of
the system depends on J-coupling (the solid and dashed arrows collapsed). We can deduce from
the graphical analysis that the ﬁnal state of the density matrix is obtained by rotation ”about”
2IzSz, but not ”about” Iz in the product operator space, and by changing the sign of the resulting
coherences:
• ˆρ(e) = 1
2
It + 1
2
κ1 (cJ Iy − sJ 2IxSz) − 1
2
κ2Sz
HOMEWORK
Analyze the spin echoes (Sections 10.6–10.9).
254
10.10 SUPPORTING INFORMATION
10.10.1 Interaction between nuclei mediated by bond electrons
In principle, both orbital and spin magnetic moments of electrons can mediate the J-coupling, but the contribution of the orbital magnetic
moments is usually negligible (coupling between hydrogen nuclei in water is an interesting exception). In order to describe the mediation
of the J-coupling by the electron spin, we ﬁrst investigate the interaction between electron and proton in the hydrogen atom.
A classical picture of interactions of nuclear and electronic spin magnetic moments is presented in Figure 10.6. Energy of the interaction
between the (spin) magnetic moment of nucleus µn and the magnetic ﬁeld generated by the spin magnetic moment of electron Be is given
by (cf. Eq. 8.47)
E = −µn · Be = −
µ0
4π
µn · ×
µe × r
r3
= −
µ0
4π
µn · × ×
µe
r
. (10.25)
In principle, the interaction with an electron does not diﬀer from an interaction between two nuclear magnetic moments, described in
Sections 8.1 and 8.9.1. Depending on the mutual orientation of the nucleus and electron, the direction of Be varies (Figure 10.6A–C). If the
distribution of electrons is spherically symmetric, or if the molecules tumble isotropically, the interactions of the spin magnetic moments
of the electron and the nucleus average to zero. With one exception, depicted in Figure 10.6D. If the electron is present exactly at the
nucleus, the vector of the electron spin magnetic moment µe has the same direction as Be and E is proportional to the scalar product
−µn · µe. The exact co-localization of electron and nucleus may look strange in the classical physics, but the interaction between the
nucleus and electron inside the nucleus can be simulated by a hypothetical current loop giving the correct magnetic moment when treated
classically. To include the distribution of the electron around the nucleus into our classical model, the total energy of the integration must
be calculated by integrating Eq. 10.25 over the electron coordinates. As mentioned above, the integral tends to zero for r > 0 in isotropic
samples. However, the integral has a non-zero value in the limit r → 0, as discussed e.g. in Abragam: The principles of nuclear magnetism,
Oxford Press 1961, Chapter VI, Section II.A.
Here, we present a quantum-mechanical analysis, following the original paper by Fermi in Z. Phys. 60 (1930) 320–333. Fermi started
from the eigenfunctions of the Dirac Hamiltonian for an electron in an electromagnetic ﬁeld (Eq. 5.151) of nuclei of alkali metals. We
investigate the simplest example, the ground state of hydrogen atom. The 1s atomic orbital of the hydrogen atom is particularly interesting
because it has a non-zero value in the center, at the place of the nucleus (cf. Figure 10.1A). The eigenfunctions describing an electron in
the 1s orbital6 are
Ψ(1s1/2, +1/2) =
1
√
π
µ0
4π
Q2mc2
2
3
2







e
−
µ0
4π
Q2mc2
2 r
0
i
2
µ0
4π
Q2
c
cos ϑ e
−
µ0
4π
Q2mc2
2 r
i
2
µ0
4π
Q2
c
sin ϑeiϕ e
−
µ0
4π
Q2mc2
2 r







=




ψ
0
− i
2
λC
z
r
ψ
− i
2
λC
x+iy
r
ψ



 , (10.26)
Ψ(1s1/2, −1/2) =
1
√
π
µ0
4π
Q2mc2
2
3
2







0
−e
−
µ0
4π
Q2mc2
2 r
− i
2
µ0
4π
Q2
c
sin ϑe−iϕ e
−
µ0
4π
Q2mc2
2 r
i
2
µ0
4π
Q2
c
cos ϑ e
−
µ0
4π
Q2mc2
2 r







=




0
−ψ
+ i
2
λC
x−iy
r
ψ
− i
2
λC
z
r
ψ



 , (10.27)
6
Derivation of the Dirac or Schr¨odinger orbitals is beyond the scope of our course. It can be found in quantum chemistry textbooks.
Here we only use the results, reviewed e.g. by Powel in J. Chem. Educ. 45 (1968) 558–563. Note that we use results of the original
derivation for simple Coulombic potential (published in 1928). We ignore corrections of the interaction of the electron with its own ﬁeld,
that has to be made to achieve a good agreement with the experiment.
A
θ
B
θ
C
θ
D
Figure 10.6: Classical description of interactions of nuclear and electronic spin magnetic moments.
10.10. SUPPORTING INFORMATION 255
where λC = /(mc) is known as the Compton wavelength, ψ is the familiar non-relativistic (Schr¨odinger) orbital 1s, and ψ = dψ/dr.
Note that the 1s orbital is a real wave function, i.e. ψ∗ = ψ.
Contribution of the interaction between magnetic moments of the nucleus and of the electron to the expected energy can be calculated
by applying Eq. 4.8 to the spin magnetic part of the Hamiltonian in Eq. 5.151
E =
ˆ
V
Ψ∗
Qc −An,xˆγ0
ˆγ1
− An,y ˆγ0
ˆγ2
− An,z ˆγ0
ˆγ3
Ψ dx dy dz, (10.28)
where An is the vector potential of the nucleus. Using Eq. 8.43, the vector potential can be expressed in terms of the nuclear magnetic
moment and electron coordinates
E = −
µ0Qc
4π
ˆ
V
1
r3
Ψ∗
(zµn,y − yµn,z)ˆγ0
ˆγ1
+ (xµn,z − zµn,x)ˆγ0
ˆγ2
+ (yµn,x − xµn,y)ˆγ0
ˆγ3
Ψ dx dy dz. (10.29)
The integral with Ψ(1s1/2, +1/2) includes the following three terms
Ψ∗ zµn,y − yµn,z
r3
ˆγ0
ˆγ1
Ψ =
zµn,y − yµn,z
r3
ψ 0 i
2
λC
z
r
ψ i
2
λC
x−iy
r
ψ




0 0 0 1
0 0 1 0
0 1 0 0
1 0 0 0








ψ
0
− i
2
λC
z
r
ψ
− i
2
λC
x+iy
r
ψ




=
zµn,y − yµn,z
r3
ψ 0 i
2
λC
z
r
ψ i
2
λC
x−iy
r
ψ




− i
2
λC
x+iy
r
ψ
− i
2
λC
z
r
ψ
0
ψ




=
zµn,y − yµn,z
r3
−
i
2
λC
x + iy
r
ψψ +
i
2
λC
x − iy
r
ψψ =
zµn,y − yµn,z
r3
λC
y
r
ψψ =
yzµn,y − y2µn,z
r4
λCψψ ,
(10.30)
Ψ∗ zµn,y − yµn,z
r3
ˆγ0
ˆγ2
Ψ =
xµn,z − zµn,x
r3
ψ 0 i
2
λC
z
r
ψ i
2
λC
x−iy
r
ψ




0 0 0 −i
0 0 i 0
0 −i 0 0
i 0 0 0








ψ
0
− i
2
λC
z
r
ψ
− i
2
λC
x+iy
r
ψ




=
xµn,z − zµn,x
r3
ψ 0 i
2
λC
z
r
ψ i
2
λC
x−iy
r
ψ




− 1
2
λC
x+iy
r
ψ
+1
2
λC
z
r
ψ
0
iψ




=
xµn,z − zµn,x
r3
−
1
2
λC
x + iy
r
ψψ −
1
2
λC
x − iy
r
ψψ =
xµn,z − zµn,x
r3
λC
−x
r
ψψ =
xzµn,x − x2µn,z
r3
λCψψ ,
(10.31)
Ψ∗ yµn,x − xµn,y
r3
ˆγ0
ˆγ1
Ψ =
yµn,x − xµn,y
r3
ψ 0 i
2
λC
z
r
ψ i
2
λC
x−iy
r
ψ




0 0 1 0
0 0 0 −1
1 0 0 0
0 −1 0 0








ψ
0
− i
2
λC
z
r
ψ
− i
2
λC
x+iy
r
ψ




=
yµn,x − xµn,y
r3
ψ 0 i
2
λC
z
r
ψ i
2
λC
x−iy
r
ψ




− i
2
λC
z
r
ψ
+ i
2
λC
x+iy
r
ψ
ψ
0




=
yµn,x − xµn,y
r3
−
i
2
λC
z
r
ψψ +
i
2
λC
z
r
ψψ = 0. (10.32)
Inserting results of Eqs. 10.30–10.32 into Eq. 10.29,
256
E =
µ0Qc
4π
λC
ˆ
V
1
r2
x2 + y2
r2
µn,z −
xz
r2
µn,x −
yz
r2
µn,y ψψ dxdydz =
µ0
4π
Q
m
ˆ
V
1
r2
r2 − z2
r2
µn,z −
xz
r2
µn,x −
yz
r2
µn,y ψψ dxdydz.
(10.33)
Expressed in spherical coordinates x = r sin ϑ cos ϕ, y = r sin ϑ sin ϕ, z = r cos ϑ, dV = dxdydz = r2 sin ϑdrdϑdϕ,
E =
µ0
4π
Q
m
µn,z
2πˆ
0
dϕ
πˆ
0
sin ϑdϑ(1 − cos2
ϑ)
∞ˆ
0
ψ(r)ψ (r)dr
−
µ0
4π
Q
m
µn,x
2πˆ
0
dϕ cos ϕ
πˆ
0
sin ϑdϑ sin ϑ
∞ˆ
0
ψ(r)ψ (r)dr
−
µ0
4π
Q
m
µn,y
2πˆ
0
dϕ sin ϕ
πˆ
0
sin ϑdϑ sin ϑ
∞ˆ
0
ψ(r)ψ (r)dr (10.34)
Only the ﬁrst term diﬀers from zero because cos ϕ and sin ϕ are periodic functions and their integrals over the whole period
2πˆ
0
dϕ cos ϕ = 0,
2πˆ
0
dϕ sin ϕ = 0. (10.35)
The ﬁrst term can be evaluated using the substitution u = cos ϑ, du = − sin ϑdϑ, and noting that
dρ(r)
dr
=
d|ψ(r)|2
dr
≡
dψ2(r)
dr
= 2ψ
dψ(r)
dr
= 2ψ(r)ψ (r), (10.36)
where ρ(r) is the probability density of ﬁnding the electron at the distance r from the nucleus.
Therefore,
E =
µ0
4π
Q
m
µn,z
2πˆ
0
dϕ
πˆ
0
sin ϑdϑ(1−cos2
ϑ)
∞ˆ
0
ψ(r)ψ (r)dr =
µ0
4π
Q
m
µn,z2π
1ˆ
−1
(1−u2
)du
ρ(∞)ˆ
ρ(0)
dρ(r)
2
=
µ0
4π
Q
m
µn,z2π u −
u3
3
1
−1
ρ
2
ρ(∞)
ρ(0)
.
(10.37)
As the probability of ﬁnding the electron at an inﬁnite distance from the nucleus tends to zero,
E =
µ0
4π
Q
m
µn,zπ 2 −
23
3
0 −
ρ(0)
2
= −
µ0
4π
Q
m
µn,z
4π
3
ρ(0) = −
µ0Q
3m
µn,zρ(0). (10.38)
Note that Q /2m is the eigenvalue of the component of the magnetic moment of the electron parallel to the magnetic ﬁeld. This time,
it is the magnetic ﬁeld of the nucleus (B0 does not play any role here). If we use the direction of µn as the z-axis of our coordinate system,
E = −
2
3
µ0ρ(0)µn · µe. (10.39)
Accordingly, the corresponding Hamiltonian is
ˆHF = −
2
3
µ0γnγeρ(0)
ˆ
In ·
ˆ
Ie , (10.40)
where
ˆ
In and
ˆ
Ie are operators of the spin of the nucleus and the electron, respectively, γn and γe are magnetogyric ratios of the spin
of the nucleus and the electron, respectively, and the integral is equal to one inside the nucleus and to zero outside the nucleus. This
type of interaction is known as the Fermi contact interaction and does not depend on orientation of the molecule in the magnetic ﬁeld, as
documented by the scalar product in Eq. 10.40.
We can now proceed from the nucleus-electron interactions to interactions between two sigma-bonded nuclei mediated by electrons of
the bond. The electrons in the bonding sigma orbital also have non-zero probability density at the positions of the nuclei (Figure 10.7).
If the nuclei did not have any magnetic moments, the eigenfunction of the electrons is the linear combination 1√
2
|α ⊗ |β − 1√
2
|β ⊗ α|,
as discussed in Section 10.10.2 and shown schematically in Figure 10.7A. Due to the Fermi interaction, parallel orientation of the nuclear
and electron spin magnetic moments (Figure 10.7B) has a lower energy and the opposite orientation (Figure 10.7C) has a higher energy
than the unperturbed stationary state. Thus the orientation of the magnetic moment of the ﬁrst nucleus is indirectly inﬂuenced by the
orientation of the second magnetic moment: the energy is proportional to the scalar product µ1 · µ2, where µ1 and µ2 are the nuclear
magnetic moments. The exact value of the energy depends on the actual distribution of the electrons in the bonding orbital, the calculation
10.10. SUPPORTING INFORMATION 257
A
B
C
Figure 10.7: J-coupling. A, the stationary spin state of the electrons in the bonding sigma orbital without nuclear magnetic moments
is a superposition of the |α ⊗|β and |β ⊗α| eigenstates (indicated by the opposite direction of the red arrows). B, energetically favorable
state of electrons interacting with nuclear magnetic moments (green and cyan arrows). C, energetically unfavorable state of electrons
interacting with nuclear magnetic moments.
258
of the energy requires advanced quantum chemical methods. Such methods can be applied to more complex systems too. In general, the
described indirect interaction is described by the Hamiltonian
ˆHJ =
2π
(ˆI1x
ˆI2x + ˆI1y
ˆI2y + ˆI1z
ˆI2z), (10.41)
where 2πJ is a constant describing the strength of the indirect, electron mediated interaction and ˆInj are operators of the components
of the angular momenta of the nuclei.
10.10.2 Two electrons in a sigma orbital
A wave function describing two electrons must be antisymmetric, as stated in Section 6.7.1. Assuming that the spin degrees of freedom
can be separated (see the discussion in Sections 6.1 and 6.7.3), we can decompose the wave function (i) into a symmetric non-spin part σs
and an antisymmetric spin part ψa, or (ii) to an antisymmetric non-spin part σa and a symmetric spin part ψs. We try to express the spin
wave function in a suitable basis. In the case of a single particle in a ﬁeld described by the Hamiltonian −γB0
ˆIz, we used a basis consisting
of eigenfunctions of the operator ˆIz, i.e., the eigenvectors |α =
1
0
and |β =
0
1
. These eigenvectors are also eigenfunctions of the
operator of I2 because the matrix representation of ˆI2 is proportional to the unit matrix (see Eq. 5.11) and ˆ1ψ = ψ for any ψ. For a pair
of two electrons, we could use the eigenfunctions of ˆI1z, ˆI2
1 , ˆI2z, and ˆI2
2 (i.e., eigenvectors listed in Eq. 8.83). However, it is more useful
to chose eigenfunctions of operators representing the z-component and the square of the total spin angular momentum I = I1 + I2, in
combination with ˆI2
1 and ˆI2
2 . Note that all operators of the set ˆI2
1 , ˆI2
2 , ˆI2, and ˆIz commute (the ﬁrst two operators are proportional to the
unit matrix that commutes with any matrix of the same size, commutation of the last two operators is given by Eq. 4.38). The explicit
forms of the chosen operators are obtained using the matrix representations of the product operators in Tables 8.1 and 8.2:
ˆI2
1 ψk =
3 2
4




1 0 0 0
0 1 0 0
0 0 1 0
0 0 0 1








c1k
c2k
c3k
c4k



 , (10.42)
ˆI2
2 ψk =
3 2
4




1 0 0 0
0 1 0 0
0 0 1 0
0 0 0 1








c1k
c2k
c3k
c4k



 , (10.43)
ˆI2
ψk =
ˆ
I1 +
ˆ
I2
2
ψk = ˆI2
1 + ˆI2
2 + 2
ˆ
I1 ·
ˆ
I2 ψk =
= ˆI2
1 + ˆI2
2 + 2ˆI1x
ˆI2x + 2ˆI1y
ˆI2y + 2ˆI1z
ˆI2z ψk = 2




2 0 0 0
0 1 1 0
0 1 1 0
0 0 0 2








c1k
c2k
c3k
c4k



 , (10.44)
ˆIzψk = ˆI1z + ˆI2z ψk =




1 0 0 0
0 0 0 0
0 0 0 0
0 0 0 1








c1k
c2k
c3k
c4k



 . (10.45)
The eigenfunctions of ˆI1z, ˆI2
1 , ˆI2z, and ˆI2
2 clearly cannot be eigenfunctions of the operator ˆI2, represented by a non-diagonal matrix.
Therefore, we have to look for a new basis, where the operator ˆI2 is represented by a diagonal matrix ˆI 2. For this purpose, we use a
procedure that is not very elegant, but does not require any special approaches of matrix algebra.
From the mathematical point of view, we have to ﬁnd a transformation matrix ˆT so that
ˆT ˆI 2
= ˆI2 ˆT. (10.46)
Then, the diagonalized matrix ˆI 2 representing the ˆI2 operator is obtained by multiplying the equation from left by a matrix ˆT−1,
inverse to ˆT (i.e., ˆT−1 ˆT = ˆ1):
H = ˆT−1
H ˆT. (10.47)
Multiplying by ˆT from left gives
ˆTH = H ˆT. (10.48)
The desired eigenvalues are diagonal elements of the diagonalized matrix
10.10. SUPPORTING INFORMATION 259




λ1 0 0 0
0 λ2 0 0
0 0 λ3 0
0 0 0 λ4



 . (10.49)
The eigenvalues λk and eigenvectors |ψk can be obtained by comparing the eigenvalue equation
H |ψk = ωk|ψk (10.50)
with the left-hand side of Eq. 10.48
ˆTH =




T11 T12 T13 T14
T21 T22 T23 T24
T31 T32 T33 T34
T41 T42 T43 T44








λ1 0 0 0
0 λ2 0 0
0 0 λ3 0
0 0 0 λ4



 =




λ1T11 λ2T12 λ3T13 λ4T14
λ1T21 λ2T22 λ3T23 λ4T24
λ1T31 λ2T32 λ3T33 λ4T34
λ1T41 λ2T42 λ3T43 λ4T44



 . (10.51)
The eigenvalue equation can be written as a set of four equations for k = 1, 2, 3, 4
H |ψk = 2




2 0 0 0
0 1 1 0
0 1 1 0
0 0 0 2








T1k
T2k
T3k
T4k



 = 2




2T1k
T2k + T3k
T2k + T3k
2T4k



 = λk




T1k
T2k
T3k
T4k



 = λk|ψk . (10.52)
The ﬁrst row of the middle equality allows us to identify
λ1 = 2 2
(10.53)
if we set T21 = T31 = T41 = 0, i.e.,
|ψ1 =




T11
0
0
0



 . (10.54)
Similarly,
λ4 = 2 2
(10.55)
for
|ψ4 =




0
0
0
T44



 . (10.56)
The λ2 and λ3 values can be calculated from the equations
λkT2k = 2
(T2k + T3k) (10.57)
λkT3k = 2
(T2k + T3k), (10.58)
(setting T12 = T42 = T13 = T43 = 0).
T3k can be expressed from the ﬁrst equation
T3k =
λk − 2
2
T2k (10.59)
and inserted into the second equation
λk
λk − 2
2
T2k = (λk − 2
)T2k + 2
T2k = λkT2k, (10.60)
(λk)2
− 2 2
λk = λk(λk − 2 2
) = 0, (10.61)
directly giving
λ2 = 0, λ3 = 2 2
. (10.62)
We have identiﬁed all diagonal elements of the diagonalized operator
260
ˆI 2
= 2 2




1 0 0 0
0 0 0 0
0 0 1 0
0 0 0 1



 . (10.63)
The new basis is given by Eqs. 10.57, 10.58, and the normalization condition
ψk|ψk = 1 ⇒
4
j=1
T2
jk = 1. (10.64)
The normalization condition immediately deﬁnes T11 = T44 = 1.
Substituting λ2 into Eqs. 10.57 and 10.58 gives
T22 + T32 = 0 ⇒ T22 = −T32. (10.65)
The normalization condition 1 = T2
22 + T2
32 = 2T2
22 requires
T22 =
1
√
2
, T32 = −
1
√
2
. (10.66)
Substituting λ3 into Eqs. 10.57 and 10.58 gives
2 2
T23 = 2
(T23 + T33) (10.67)
2 2
T33 = 2
(T23 + T33) (10.68)
⇒ T23 = T33. (10.69)
⇒ T23 = T23. (10.70)
The normalization condition 1 = T2
23 + T2
33 = 2T2
23 requires
T23 =
1
√
2
, T33 =
1
√
2
. (10.71)
Taken together, the new basis consists of the following eigenvectors
|ψ1 =




1
0
0
0



 = |α ⊗|α , |ψ2 =





0
1√
2
− 1√
2
0





=
1
√
2
(|α ⊗|β −|β ⊗|α ), |ψ3 =





0
1√
2
1√
2
0





=
1
√
2
(|α ⊗|β +|β ⊗|α ), |ψ4 =




0
0
0
1



 = |β ⊗|β .
(10.72)
Among them, |ψ1 , |ψ3 , and |ψ4 , are symmetric and are multiplied by the antisymmetric σa, whereas |ψ2 is antisymmetric and is
multiplied by the symmetric σs. Calculations of the non-spin functions σa and σs is not easy7 and requires advanced quantum chemistry.
The result of such calculation is the bonding sigma orbital σs with lower energy and the antibonding sigma orbital σs with higher energy.
Therefore, we are interested in σs|ψ2 = σs(|α ⊗|β −|β ⊗|α )/
√
2 if we study ground state of the molecule. The corresponding eigenvalues
are 3 2/4 for ˆI2
1 and ˆI2
1 , zero for ˆI2 and ˆIz.
10.10.3 Classical analysis of two J-coupled polarizations
Although the physical origin of the J-coupling is a consequence of quantum behavior of electrons, the evolution of the macroscopic
magnetization can be described classically. We have described precession of a magnetic moment µ in a magnetic ﬁeld B as
dµ
dt
= ω × µ = −γB × µ (10.73)
(Eq. 51). As µ = γL,
dL
dt
= ω × L = −γ × B. (10.74)
7
The major diﬃculty is a mutual interactions of the electron charges.
10.10. SUPPORTING INFORMATION 261
We assume that the magnetic moment is placed in a strong homogeneous magnetic ﬁeld B0 deﬁning the direction of the axis z (a
typical case in NMR). On suﬃciently long time scales, eﬀects of other, weaker ﬁelds, average to zero, unless the weak ﬁelds are oriented
along B0 or rotate with a frequency close to −γB0 (secular approximation). Therefore, orbital magnetic moments of electrons modify the
ﬁeld B0 only by adding a small chemical shift, to (1 + δ)B0. If two magnetic moments are coupled, they are inﬂuenced not only by the
external ﬁeld B0, but also by a ﬁeld proportional to the neighbor magnetic moment
dL1
dt
= −γ1(1 + δ1)B0 × L1 + ζL2 × L1 = −γ1(1 + δ1)B0 × L1 − ζL1 × L2 (10.75)
dL2
dt
= −γ2(1 + δ2)B0 × L2 + ζL1 × L2, (10.76)
where ζ is a so far undeﬁned coupling constant. As magnitudes of nuclear magnetic moments diﬀer only in γ1, γ2, but |L1| = |L2| = L,
we can divide both sides of the equations by L and describe rotation of dimensionless vectors r1 = (x1, y1, z1) and r2 = (x2, y2, z2),
describing orientations of µ1 and µ2, respectively, and write
dr1
dt
= −γ1(1 + δ1)B0 × r1 − ζLr1 × r2 = ω0,1 × r1 − πJr1 × r2 (10.77)
dr2
dt
= −γ2(1 + δ2)B0 × r2 + ζLr1 × r2 = ω0,2 × r2 + πJr1 × r2, (10.78)
where we introduced the traditional symbols ω0,1, ω0,a, and πJ.
Writing the vector products explicitly,
dx1
dt
= −ω0,1y1 − πJy1z2 + πJz1y2 (10.79)
dy1
dt
= +ω0,1x1 + πJx1z2 − πJz1x2 (10.80)
dz1
dt
= −πJx1y2 + πJy1x2 (10.81)
and
dx2
dt
= −ω0,2y2 + πJy1z2 − πJz1y2 (10.82)
dy2
dt
= +ω0,2x2 − πJx1z2 + πJz1x2 (10.83)
dz2
dt
= +πJx1y2 − πJy1x2. (10.84)
In the case of the weak J-coupling, evolution on suﬃciently long time scales is inﬂuenced only by vertical magnetic ﬁelds. Therefore,
it is suﬃcient to analyze rotations about the z axis. The equations simplify to
dx1
dt
= −ω0,1y1 − πJy1z2 (10.85)
dy1
dt
= +ω0,1x1 + πJx1z2 (10.86)
dz1
dt
= 0 (10.87)
and
dx2
dt
= −ω0,2y2 − πJz1y2 (10.88)
dy2
dt
= +ω0,2x2 + πJz1x2 (10.89)
dz2
dt
= 0. (10.90)
262
In NMR spectroscopy, we do not observe individual magnetic moments, but the bulk magnetization deﬁned as
Mn = Nµn = NγnLn = NγnLrn, (10.91)
where the bar indicates an ensemble average. Therefore, the ensemble-averaged terms xn, xnzn , yn, ynzn should be followed to
describe evolution of weakly coupled magnetizations:
dx1
dt
= −ω0,1y1 − πJy1z2 (10.92)
dy1
dt
= +ω0,1x1 + πJx1z2 (10.93)
and
dx2
dt
= −ω0,2y2 − πJz1y2 (10.94)
dy2
dt
= +ω0,2x2 + πJz1x2. (10.95)
We will describe rotation of µ1 and µ2 in complex representation, introduced in Section 1.5.4. We express r1 and r2 in spherical
coordinates
x1 = sin ϑ1 cos ϕ1 x2 = sin ϑ2 cos ϕ2,
y1 = sin ϑ1 sin ϕ1 y2 = sin ϑ2 sin ϕ2,
z1 = cos ϑ1 z2 = cos ϑ2
(10.96)
and represent the orientations by two-dimensional spinors according to Eq. 1.52
a1
b1
=
cos ϑ1
2
e−i
ϕ1
2
sin ϑ1
2
e+i
ϕ1
2
a2
b2
=
cos ϑ2
2
e−i
ϕ2
2
sin ϑ2
2
e+i
ϕ2
2
. (10.97)
So far, we have described individual vectors. Now we extend the description to a coupled pair of µ1 and µ2. Inspired by representation
of the coupled quantum states, we write a direct product of the orientation spinors
a1
b1
⊗
a2
b2
=




a1a2
a1b2
b1a2
b1b2



 . (10.98)
We check if the four components of the direct product really represent orientation vectors. Using Eq. 10.97, we evaluate
ana∗
n + bnb∗
n = cos2 ϑn
2
+ sin2 ϑn
2
= 1, (10.99)
ana∗
n − bnb∗
n = cos2 ϑn
2
− sin2 ϑn
2
= sin ϑn = zn, (10.100)
anb∗
n + bna∗
n = sin
ϑn
2
cos
ϑn
2
e−iϕn + e+iϕn = sin ϑn cos ϕn = xn, (10.101)
anb∗
n − bna∗
n = sin
ϑn
2
cos
ϑn
2
e−iϕn − e+iϕn = −i sin ϑn sin ϕn = −iyn, (10.102)
where n = 1 or n = 2. Combining components of r1 and r2, we express
x1 = x1 · 1 = (a1b∗
1 + b1a∗
1)(a2a∗
2 + b2b∗
2) = a1a2b∗
1a∗
2 + b1a2a∗
1a∗
2 + a1b2b∗
1b∗
2 + b1b2a∗
1b∗
2 = u1 + u∗
1 + v1 + v∗
1 , (10.103)
−iy1 = −iy1 · 1 = (a1b∗
1 − b1a∗
1)(a2a∗
2 + b2b∗
2) = a1a2b∗
1a∗
2 − b1a2a∗
1a∗
2 + a1b2b∗
1b∗
2 − b1b2a∗
1b∗
2 = u1 − u∗
1 + v1 − v∗
1 , (10.104)
x1z2 = x1 · z2 = (a1b∗
1 + b1a∗
1)(a2a∗
2 − b2b∗
2) = a1a2b∗
1a∗
2 + b1a2a∗
1a∗
2 − a1b2b∗
1b∗
2 − b1b2a∗
1b∗
2 = u1 + u∗
1 − v1 − v∗
1 , (10.105)
−iy1z2 = −iy1 · z2 = (a1b∗
1 − b1a∗
1)(a2a∗
2 − b2b∗
2) = a1a2b∗
1a∗
2 − b1a2a∗
1a∗
2 − a1b2b∗
1b∗
2 + b1b2a∗
1b∗
2 = u1 − u∗
1 − v1 + v∗
1 , (10.106)
10.10. SUPPORTING INFORMATION 263
x2 = 1 · x2 = (a1a∗
1 + b1b∗
1)(a2b∗
2 + b2a∗
2) = a1a2a∗
1b∗
2 + b1a2b∗
1b∗
2 + a1b2a∗
1a∗
2 + b1b2b∗
1a∗
2 = u2 + v2 + u∗
2 + v∗
2 , (10.107)
−iy1 = −1 · iy2 = (a1a∗
1 + b1b∗
1)(a2b∗
2 − b2a∗
2) = a1a2a∗
1b∗
2 + b1a2b∗
1b∗
2 − a1b2a∗
1a∗
2 − b1b2b∗
1a∗
2 = u2 + v2 − u∗
2 − v∗
2 , (10.108)
z1x2 = z1 · x2 = (a1a∗
1 − b1b∗
1)(a2b∗
2 + b2a∗
2) = a1a2a∗
1b∗
2 − b1a2b∗
1b∗
2 + a1b2a∗
1a∗
2 − b1b2b∗
1a∗
2 = u2 − v2 + u∗
2 − v∗
2 , (10.109)
−iz1y2 = −iz1 · y2 = (a1a∗
1 − b1b∗
1)(a2b∗
2 − b2a∗
2) = a1a2a∗
1b∗
2 − b1a2b∗
1b∗
2 − a1b2a∗
1a∗
2 + b1b2b∗
1a∗
2 = u2 − v2 − u∗
2 + v∗
2 , (10.110)
Our goal is to describe how the orientations evolve in time. We start by analyzing a single magnetic moment µ in a magnetic ﬁeld B.
Rotation of a spinor about the z axis changes only the azimuths ϕ1 and ϕ2. According to Eq. 1.62, rotation of a two-component spinor
(representing a single vector) about the z axis with the angular velocity ω = dϕ/dt is given by
a
b
=
e−i ωt
2 0
0 e+i ωt
2
a
b
=
e−i ωat
2 0
0 e−i
ωbt
2
a
b
, (10.111)
where we relabeled the frequencies as ω = ωa and −ω = ωb, resulting in the relation ω = (ωa − ωb)/2. The diﬀerential equation
describing evolution of the spinor can be derived by analyzing rotation for a small time increment dt:
a + da
b + db
=
e−i ωadt
2 0
0 e−i
ωbdt
2
a
b
. (10.112)
As dt → 0, the exponential terms can be replaced by 1 − iωdt/2 and 1 + iωdt/2, respectively (Taylor’s expansion), and the equation
can be written as
da
db
=
1 − i ωadt
2
0
0 1 − i ωbdt
2
a
b
−
a
b
= dt
1 − i ωa
2
0
0 1 − i ωb
2
a
b
−
1 0
0 1
a
b
= dt
−i ωa
2
0
0 −i ωb
2
a
b
. (10.113)
Dividing by dt yields
d
dt
a
b
=
−iωa
2
0
0 −i ωb
2
a
b
. (10.114)
The diﬀerential equations describing evolution of spinors representing individual orientations can be easily extended to a pair of
coupled magnetic moments. The spinor includes four combinations of azimuths that rotate with four frequencies, labeled here as ωaa, ωab,
ωba, and ωbb:
d
dt




a1a2
a1b2
b1a2
b1b2



 =




−i ωaa
2
0 0 0
0 −i ωab
2
0 0
0 0 −i ωba
2
0
0 0 0 −i ωbb
2








a1a2
a1b2
b1a2
b1b2



 . (10.115)
The same equation describes evolution of magnetizations, assuming that the distributions of magnetic moments rotate coherently with
constant frequencies (i.e., without relaxation which redistributes magnetic moment orientations). In order to identify the frequencies ωaa,
ωab, ωba, and ωbb, we use Eq. 10.115 to express time derivatives of un and vn.
du1
dt
=
d(a1a2)
dt
b∗
1a∗
2 + a1a2
d(b∗
1a∗
2)
dt
= −i
ωaa
2
a1a2b∗
1a∗
2 + i
ωba
2
a1a2b∗
1a∗
2 = −i
ωaa − ωba
2
u1, (10.116)
dv1
dt
=
d(a1b2)
dt
b∗
1b∗
2 + a1b2
d(b∗
1b∗
2)
dt
= −i
ωab
2
a1b2b∗
1b∗
2 + i
ωbb
2
a1a2b∗
1b∗
2 = −i
ωab − ωbb
2
v1, (10.117)
du2
dt
=
d(a1a2)
dt
a∗
1b∗
2 + a1a2
d(a∗
1b∗
2)
dt
= −i
ωaa
2
a1a2a∗
1b∗
2 + i
ωba
2
a1a2a∗
1b∗
2 = −i
ωaa − ωab
2
u2, (10.118)
dv2
dt
=
d(b1a2)
dt
b∗
1b∗
2 + b1a2
d(b∗
1b∗
2)
dt
= −i
ωba
2
b1a2b∗
1b∗
2 + i
ωbb
2
b1a2b∗
1b∗
2 = −i
ωba − ωbb
2
v2. (10.119)
du∗
1
dt
= +i
ωaa − ωba
2
u∗
1, (10.120)
dv∗
1
dt
= +i
ωab − ωbb
2
v∗
1 , (10.121)
du∗
2
dt
= +i
ωaa − ωab
2
u∗
2, (10.122)
dv∗
2
dt
= +i
ωba − ωbb
2
v∗
2 . (10.123)
264
After combining Eqs. 10.92–10.95 and using Eqs. 10.103–10.110
d(x1 − iy1)
dt
= 2
d(u1 + v1)
dt
= −iω0,1(x1 − iy1) − iπJ(x1z2 − iy1z2) = −2iω0,1(u1 + v1) − 2iπJ(u1 − v1), (10.124)
d(x2 − iy2)
dt
= 2
d(u2 + v2)
dt
= −iω0,2(x2 − iy2) − iπJ(z1x2 − iz1y2) − 2iω0,2(u2 + v2) − 2iπJ(u2 − v2). (10.125)
Substituting the time derivatives of the spinor components Eqs. 10.116–10.123 and dividing the equations by the imaginary unit, we
obtain
d(u1 + v1)
idt
= −
ωaa − ωba
2
u1 −
ωab − ωbb
2
v1 = −ω0,1(u1 + v1) + πJ(u1 − v1) = −(ω0,1 + πJ)u1 + (ω0,1 − πJ)v1, (10.126)
d(u2 + v2)
idt
= −
ωaa − ωab
2
u2 −
ωba − ωbb
2
v2 = −ω0,2(u2 + v2) + πJ(u2 − v2) = −(ω0,2 + πJ)u2 + (ω0,2 − πJ)v2. (10.127)
Comparison of frequency terms multiplying un and vn shows that
ω0,1 + πJ =
ωaa − ωba
2
, ω0,1 − πJ =
ωab − ωbb
2
, (10.128)
ω0,2 + πJ =
ωaa − ωab
2
, ω0,2 − πJ =
ωba − ωbb
2
. (10.129)
Adding and subtracting the ω0,n±πJ terms and noting that the imaginary exponents in the spinor (and therefore their time derivatives,
i.e., frequencies) sum to zero
ωaa + ωba + ωab + ωbb = 0 (10.130)
yields a set of equations
ω0,1 =
ωaa + ωab − ωba − ωbb
4
, (10.131)
ω0,2 =
ωaa − ωab + ωba − ωbb
4
, (10.132)
πJ =
ωaa − ωab − ωba + ωbb
4
, (10.133)
0 =
ωaa + ωab + ωba + ωbb
4
. (10.134)
Combinations of the equations provide
ωaa = +ω0,1 + ω0,2 + πJ, (10.135)
ωab = +ω0,1 − ω0,2 − πJ, (10.136)
ωba = −ω0,1 + ω0,2 − πJ, (10.137)
ωbb = −ω0,1 − ω0,2 + πJ. (10.138)
Knowing the frequencies, we can express explicitly
u1 = u10e−i
ωaa−ωba
2
t
= a10b∗
10a20a∗
20e−i(ω0,1+πJ)t
= sin
ϑ10
2
cos
ϑ10
2
e−iφ10 cos2
ϑ20
2
e−i(ω0,1+πJ)t
, (10.139)
v1 = u10e−i
ωab−ωbb
2
t
= a10b∗
10b20b∗
20e−i(ω0,1−πJ)t
= sin
ϑ10
2
cos
ϑ10
2
e−iφ10 sin2 ϑ20
2
e−i(ω0,1−πJ)t
, (10.140)
where un0, vn0, an0, bn0, ϑn0, and ϕ10 are the initial values of un, vn, an, bn, ϑn, and ϕ1, respectively. Using standard trigonometric
relations and deﬁnitions of spherical coordinates,
10.10. SUPPORTING INFORMATION 265
u1 =
1
4
sin ϑ10(cos φ10 − i sin φ10)(1 + cos ϑ20)e−i(ω0,1+πJ)t
=
1
4
(x10 − iy10)(1 + z20)(cos((ω0,1 + πJ)t) − i sin((ω0,1 + πJ)t))
=
1
4
(x10 − iy10 + x10z20 − iy10z20) cos((ω0,1 + πJ)t) +
1
4
(−ix10 − y10 − ix10z20 − y10z20) sin((ω0,1 + πJ)t) (10.141)
v1 =
1
4
sin ϑ10(cos φ10 − i sin φ10)(1 − cos ϑ20)e−i(ω0,1−πJ)t
=
1
4
(x10 − iy10)(1 − z20)(cos((ω0,1 + πJ)t) − i sin((ω0,1 + πJ)t))
=
1
4
(x10 − iy10 − x10z20 + iy10z20) cos((ω0,1 − πJ)t) +
1
4
(−ix10 − y10 + ix10z20 + y10z20) sin((ω0,1 − πJ)t). (10.142)
Changing the sign of the imaginary unit,
u∗
1 =
1
4
(x10 + iy10 + x10z20 + iy10z20) cos((ω0,1 + πJ)t) +
1
4
(ix10 − y10 + ix10z20 − y10z20) sin((ω0,1 + πJ)t) (10.143)
v∗
1 =
1
4
(x10 + iy10 − x10z20 − iy10z20) cos((ω0,1 − πJ)t) +
1
4
(ix10 − y10 + ix10z20 + y10z20) sin((ω0,1 − πJ)t). (10.144)
Evolution of u2, u∗
2, v2, and v∗
2 can be described in the same fashion.
We are now ready to analyze evolution of magnetization. In a ﬁrst example, we examine magnetization of the ﬁrst nucleus in a
heteronuclear pair after a 90◦ excitation pulse. M1+ = M1x + iM1y is proportional to x1 + iy1 = 2(u∗
1 + v∗
1 ).
After the 90◦ excitation pulse, the magnetic moments are polarized in the −y direction. Therefore, x10 = 0. If we assume that the
equilibrium distributions of µ1 and µ2 are almost independent, then
y10z20 = y10 · z20 y10 (10.145)
because magnetic moments are very little polarized under typical circumstances (z20 1). We can therefore write
x1 + iy1 = 2(u∗
1 + v∗
1 ) =
y10
2
(i cos((ω0,1 + πJ)t) + i cos((ω0,1 − πJ)t) − sin((ω0,1 + πJ)t) − sin((ω0,1 − πJ)t))
= i
y10
2
(cos((ω0,1 + πJ)t) + cos((ω0,1 − πJ)t) + i sin((ω0,1 + πJ)t) + i sin((ω0,1 − πJ)t)) =
y10
2
ei π
2 ei(ω0,1+πJ)t
+ ei(ω0,1−πJ)t
,
(10.146)
which is the same result as obtained by analyzing the density matrix.
In a second example, we assume that M1+ ﬁrst evolved for t = 1/(2J). Then,
x1 + iy1 t =
1
2J
=
y10
2
ei π
2 ei
ω0,1
2J
+i π
2 + ei
ω0,1
2J
−i π
2 =
y10
2
ei
ω0,1
2J eiπ
+ e0
=
y10
2
ei
ω0,1
2J (−1 + 1) = 0, (10.147)
On the other hand,
x1z2 = u1 + u∗
1 − v1 − v∗
1 = −
y10
2
sin((ω0,1 + πJ)t) +
y10
2
sin((ω0,1 − πJ)t) (10.148)
and
x1z2 t =
1
2J
=
y10
2
− sin
ω0,1
2J
+
π
2
+ sin
ω0,1
2J
−
π
2
= −y10 cos
ω0,1
2J
. (10.149)
If we continue in a rotating coordinate frame and M1 is exactly on resonance (ω0,1 → Ω1 = 0),
x1z2 t =
1
2J
= −y10. (10.150)
If we start to count time from this moment, x10 = y10 = 0 but y10z20 = 0. M1+ then evolves as
x1 + iy1 = 2(u∗
1 + v∗
1 ) =
y10z20
2
(i cos((ω0,1 + πJ)t) − i cos((ω0,1 − πJ)t) − sin((ω0,1 + πJ)t) + sin((ω0,1 − πJ)t))
= i
y10z20
2
(cos((ω0,1 + πJ)t) − cos((ω0,1 − πJ)t) + i sin((ω0,1 + πJ)t) − i sin((ω0,1 − πJ)t)) =
y10z20
2
ei π
2 ei(ω0,1+πJ)t
− ei(ω0,1−πJ)t
.
(10.151)
We obtain the same result, corresponding to anti-phase peaks, in Section 11.2 by the analysis of the density matrix.
266
10.10.4 Comparison of classical and quantum analysis of J-coupling
We now try to compare how the classical analysis presented in Section 10.10.3 and quantum mechanical analysis describe relationship
between energy of stationary states and precession frequency. We start by the classical treatment. We assume that a magnetic moment is
placed in a strong homogeneous magnetic ﬁeld B0 deﬁning the direction of the axis z (a typical case in NMR). On suﬃciently long time
scales, eﬀects of other, weaker ﬁelds, average to zero, unless the weak ﬁelds are oriented along B0 or rotate with a frequency close to −γB0
(secular approximation).
Position of µ in a magnetic ﬁeld B, composed of B0 and a z-component of an additional weak ﬁeld, changes as µ precesses about the
z-axis with a frequency (Eq. 20)
ω = −γBz = −γ|B|. (10.152)
The only exceptions are two stationary states, when µ is oriented in the same direction as B or in the direction opposite to B. The
former stationary state of µ (labeled here as µa) corresponds to a spinor with a = 1 and b = 0 (therefore, z = aa∗ − bb∗ = 1) and has an
energy (Eqs. 6 and 19)
Ea = −B · µa = −γB · La = −γBzLa,z = ωLa,z = −ωL. (10.153)
The latter stationary state of µ (labeled here as µb = −µa) corresponds to a spinor with a = 0 and b = 1 (therefore, z = aa∗−bb∗ = −1)
and has an energy
Eb = −B · µb = −γB · Lb = −γBzLb,z = ωLb,z = +ω|L|. (10.154)
If we compare Eqs.10.152, 10.153, and 10.154, we obtain
Eb − Ea = ω(ωLb,z − ωLa,z) = ω∆Lz = ω(2|L|). (10.155)
As we have described in Section 5.5, relativistic quantum mechanics also reveals existence of two quantum states (|α and |β ) of
a spin magnetic moment of an electron (and similar particles) in a homogeneous magnetic ﬁeld. A result of measuring z-component of
spin angular momentum must be one of two eigenvalues of the operator ˆIz, i.e., + /2 or − /2, related to two eigenvalues of the spin
Hamiltonian ˆH = −γBz
ˆIz:
Eα = −γBzIα,z = −γBz
2
, (10.156)
Eβ = −γBzIβ,z = +γBz
2
. (10.157)
The energy diﬀerence Eβ − Eα is related to the precession frequency of a spin magnetic moment in a general (superposition) state as
Eβ − Eα = −γBz(Iβ,z − Iα,z) = ω(Iβ,z − Iα,z) = ω∆Iz = ω . (10.158)
We see that the classical and quantum results diﬀer only in the value of ∆Lz (equal to 2|L|) vs. ∆Iz (equal to = 2|I|/
√
3).
10.10.5 J-coupling compared to classical coupled oscillators
To see relation between J-coupling and coupling discussed in classical physics, we analyze obviously classical coupled systems: two coupled
oscillators or pendulums. Let us consider two equal masses m attached to walls with springs of the stiﬀness k1 = k2 = k and connected
with another spring of the stiﬀness k12:
| · · · · · · k1 · · · · · · (m) · · · · · · k12 · · · · · · (m) · · · · · · k2 · · · · · · |
The horizontal displacements x1 and x2, respectively, of the masses from their equilibrium positions can be calculated from the second
Newton’s law:
F1 = ma1 = m
d2x1
dt2
F2 = ma2 = m
d2x2
dt2
. (10.159)
If both masses move together (x1 = x2), the middle spring does not get stretched or compressed and both masses just experience the
forces of the outer springs
F1 = −k1x1 = −kx1 F2 = −k2x2 = −kx2. (10.160)
As x1 = x2, both masses experience the same force F1 = F2. Therefore,
F1 + F2 = m
d2x+
dt2
= −kx+, F1 − F2 = 0 = m
d2x−
dt2
= −kx− (10.161)
10.10. SUPPORTING INFORMATION 267
where x+ = x1 + x2 and x− = x1 − x2. The second derivatives of sine and cosine functions are proportional to the sine and cosine
functions themselves:
d2
dt2
sin(ω+t) =
d
dt
d sin(ω+t)
dt
=
d
dt
(ω+ cos(ω+t)) = −ω2
+ sin(ω+t), (10.162)
d2
dt2
cos(ω+t) =
d
dt
d cos(ω+t)
dt
=
d
dt
(−ω+ sin(ω+t)) = −ω2
+ cos(ω+t). (10.163)
The solution of our diﬀerential equation can be thus written in a form
x+ = A+ sin(ω+t) + B+ cos(ω+t). (10.164)
The coeﬃcients A+, B+ can be obtained from the initial conditions. If we start measurement from the time when both masses pass
their equilibrium positions (x1 = x2 = 0 at t = 0),
x+(t = 0) = 0 = A+ sin(0) + B+ cos(0) = B+ (10.165)
and
x+ = A+ sin(ω+t). (10.166)
Diﬀerentiation
d2x+
dt2
= A+
d2 sin(ω+t)
dt2
= −A+ω2
+ sin(ω+t) = −ω2
+x+ = −
k
m
x+ (10.167)
shows that
ω+ =
k
m
. (10.168)
Motions of individual masses can be back-calculated from the evaluated x+ and from the fact that x− = 0:
x1 =
1
2
(x+ + x−) =
A+
2
sin(ω+t), x2 =
1
2
(x+ − x−) =
A+
2
sin(ω+t). (10.169)
If the masses move exactly in the opposite directions (anti-phase), the stiﬀness of the middle spring k12 increases the forces F1 = −F2
experienced by both masses
F1 = −(k1 + k12)x1 = −(k + k12)x1 F2 = −F1 = (k2 + k12)x2 = (k + k12)x2 (10.170)
and
F1 − F2 = m
d2x−
dt2
= −(k + 2k12)x−, F1 + F2 = 0 = m
d2x+
dt2
. (10.171)
The solution is
x− = A− sin(ω−t) + B− cos(ω−t). (10.172)
If we start again from equilibrium positions (x1 = x2 = 0 at t = 0),
x−(t = 0) = 0 = A− sin(0) + B− cos(0) = B−, (10.173)
x− = A− sin(ω−t), (10.174)
d2x−
dt2
= A−
d2 sin(ω−t)
dt2
= −A−ω2
− sin(ω−t) = −ω2
−x− = −
k + 2k12
m
x−, (10.175)
and
ω− =
k + 2k12
m
. (10.176)
Motions of individual masses are now calculated from the evaluated x− and from the fact that x+ = 0:
x1 =
1
2
(x+ + x−) = +
A−
2
sin(ω+t), x2 =
1
2
(x+ − x−) = −
A−
2
sin(ω+t). (10.177)
268
The two discussed modes of motions (in-phase and anti-phase oscillations) are stationary. The masses move with constant frequencies
and amplitudes. All other modes (e.g. starting with the left mass in the equilibrium and the right mass displaced) exhibit beats, double
oscillations with combinations of ω+ and ω−.
For example, if we start from
x1(0) = A, x2(0) = 0, (10.178)
the initial conditions are
x+(0) = A+ sin(ω+0)+B+ cos(ω+0) = x1(0)+x2(0) =
A
2
, x−(0) = A− sin(ω−0)+B− cos(ω−0) = x1(0)−x2(0) =
A
2
, (10.179)
telling us that A+ = A− = 0 and B+ = B− = A. The individual masses move as
x1 =
1
2
(x+ + x−) =
A
2
(cos(ω+t) + cos(ω−t)) =
A
2
cos
ω+ + ω−
2
t +
ω+ − ω−
2
t + cos
ω+ + ω−
2
t −
ω+ − ω−
2
t
= A cos
ω+ + ω−
2
t cos
ω+ − ω−
2
t (10.180)
and
x2 =
1
2
(x+ − x−) =
A
2
(cos(ω+t) − cos(ω−t)) =
A
2
cos
ω+ + ω−
2
t +
ω+ − ω−
2
t − cos
ω+ + ω−
2
t −
ω+ − ω−
2
t
= A sin
ω+ + ω−
2
t sin
ω+ − ω−
2
t . (10.181)
If k12 k, ω+ and ω− are similar and
ω+ − ω−
2
ω+ + ω−
2
. (10.182)
The second cosine in equations describing motions of the individual masses represents a second, slow oscillation of the rapidly oscillating
positions.
Analysis of two coupled pendulums (connected with a spring of the stiﬀness k12) yields the same solution (for small swing angles),
k/m is just replaced by g/l (gravitational acceleration divided by the length of the pendulum).
How are the coupled oscillators or pendulums related to J-coupling? If we further diﬀerentiate Eq. 10.115 we obtain second-order
equations of the same form as those describing the second Newton’s law for the coupled oscillators. For example,
d2(a1a2)
dt2
=
d
dt
d(a1a2)
dt
= i
−ω0,1 − ω0,2 − πJ
2
i
−ω0,1 − ω0,2 − πJ
2
a1a2, = −
ω0,1 + ω0,2 + πJ
2
2
a1a2. (10.183)
As described above, the general form of the solution is
a1a2 = Aaa sin
ω0,1 + ω0,2 + πJ
2
+ Baa cos
ω0,1 + ω0,2 + πJ
2
. (10.184)
The coeﬃcients Aaa and Baa can be determined by comparing Eq. 10.184 with the solution obtained earlier (Eq. ??):
a1a2 = a1(0)a2(0)ei
−ω0,1−ω0,2−πJ
2
t
= a1(0)a2(0)
Baa
cos
ω0,1 + ω0,2 + πJ
2
−ia1(0)a2(0)
Aaa
sin
ω0,1 + ω0,2 + πJ
2
. (10.185)
We see that evolution of magnetic moments due to the J-coupling can be described by the same equations as coupled oscillation. The
only diﬀerence is that the coeﬃcients Aaa and Baa are complex numbers (note that the spinor components a1(0) and a2(0) are complex
numbers in general).
10.10. SUPPORTING INFORMATION 269
10.10.6 Two J-coupled nuclei in thermal equilibrium
Before we analyze evolution of the density matrix in a 2D experiment, we must deﬁne its initial form. Again, we start from the thermal
equilibrium and use the Hamiltonian. The diﬀerence from the case of isolated nuclei is that we need to deﬁne a 4 × 4 density matrix in
order to describe a pair of mutually interacting nuclei. As explained above, the oﬀ-diagonal elements of the equilibrium density matrix
(proportional to Ix and Iy) are equal to zero. The four diagonal elements describe average populations of four stationary states of a
system composed of (isolated) nuclear pairs: αα, αβ, βα, and ββ. These populations are:
Peq
αα =
e−Eαα/kBT
e−Eαα/kBT + e−Eαβ /kBT
+ e−Eβα/kBT
+ e−Eββ /kBT
≈
1 − Eαα
kBT
4
, (10.186)
Peq
αβ =
e−Eαβ /kBT
e−Eαα/kBT + e−Eαβ /kBT
+ e−Eβα/kBT
+ e−Eββ /kBT
≈
1 −
Eαβ
kBT
4
, (10.187)
Peq
βα =
e−Eβα/kBT
e−Eαα/kBT + e−Eαβ /kBT
+ e−Eβα/kBT
+ e−Eββ /kBT
≈
1 −
Eβα
kBT
4
, (10.188)
Peq
ββ =
e−Eββ /kBT
e−Eαα/kBT + e−Eαβ /kBT
+ e−Eβα/kBT
+ e−Eββ /kBT
≈
1 −
Eββ
kBT
4
. (10.189)
In principle, the total Hamiltonian also includes the term ˆHJ , which describes the J coupling and which is not averaged to zero.
ˆH = −γ1B0(1 + δi,1)ˆI1z − γ2B0(1 + δi,2)ˆI2z +
2π ˆI1z
ˆI2z = (10.190)
−γ1B0(1 + δi,1)
2




1 0 0 0
0 1 0 0
0 0 −1 0
0 0 0 −1



 − γ2B0(1 + δi,2)
2




1 0 0 0
0 −1 0 0
0 0 1 0
0 0 0 −1



 +
πJ
2 2




1 0 0 0
0 −1 0 0
0 0 −1 0
0 0 0 1



 .
(10.191)
where the diagonal elements (eigenvalues) are the energies of the individual states. Therefore, the populations (diagonal elements of
the density matrix) should be given by
Peq
αα ≈
1 − Eαα
kBT
4
=
1
4
+ γ1(1 + δi,1)
B0
8kBT
+ γ2(1 + δi,2)
B0
8kBT
−
πJ
16kBT
, (10.192)
Peq
αβ ≈
1 −
Eαβ
kBT
4
=
1
4
+ γ1(1 + δi,1)
B0
8kBT
− γ2(1 + δi,2)
B0
8kBT
+
πJ
16kBT
, (10.193)
Peq
βα ≈
1 −
Eβα
kBT
4
=
1
4
− γ1(1 + δi,1)
B0
8kBT
+ γ2(1 + δi,2)
B0
8kBT
+
πJ
16kBT
, (10.194)
Peq
ββ ≈
1 −
Eββ
kBT
4
=
1
4
− γ1(1 + δi,1)
B0
8kBT
− γ2(1 + δi,2)
B0
8kBT
−
πJ
16kBT
. (10.195)
(10.196)
However, the values of J in typical organic compounds are at least ﬁve orders of magnitude lower than the frequencies measured even
at low-ﬁeld magnets. As a consequence, the contribution of J-coupling can be safely neglected, and the initial density matrix is identical
to that derived for a pair of nuclei interacting through space (Eq. 8.39).
10.10.7 Coherences depicted as double arrows
Algebraic analysis of the corresponding density matrix evolution is straightforward, but somewhat tedious. An alternative graphical analysis
using ”double arrows” was mentioned in Section 10.4 and used in Figure 10.2. Here we discus the ”double arrow” visualization in more
detail.
We have introduced a graphical representation of the product operators (density matrix contributions) in Table 8.2, where contribution
of each coherence is visualized as a colored plot of the magnetic moment distributions. Two examples are shown in the second column of
Table 10.1. The third column of Table 10.1 explains the diﬀerence between the depicted coherences. On one hand, the −I1y coherence
describes transverse polarization of µ1 regardless of the distribution of µ2. Therefore, we observe transverse polarization of µ1 (highlighted
270
cyan arrows of most polarized µ1) in the same direction if we look at a fraction of molecules with µ2 pointing mostly up or mostly down
(highlighted green arrows of µ2). On the other hand, the 2I1xI2z coherence describes correlation of transverse polarization of µ1 with
the longitudinal polarization of µ2. Therefore, we observe transverse polarization of µ1 (cyan arrows) in the opposite directions if we look
at a fraction of molecules with µ2 pointing mostly up or mostly down (green arrows).
In order to depict transverse polarization of three magnetic moments in a single diagram, the graphical representation is further
simpliﬁed in the fourth column. Directions of the orange arrows in the fourth column show transverse polarization of µ1 whereas the style
(dashed or solid) of the arrows describes longitudinal polarization of µ2 (up or down, respectively) in the same fraction of molecules. The
solid and dashed arrows can be viewed as vectors of partial magnetizations, and are thus aﬀected by the radio waves in the same way as
the magnetization vectors. Our graphical analysis can be thus viewed as an extension of the vector model, presented e.g. by Keeler in K4.
As described in Section 10.4, the evolution due to the chemical shift is represented by simultaneous rotation of the arrows (solid and
dashed arrows rotate by the same angle Ω1t or Ω2t in the same direction). The evolution due to the J-coupling is represented by mutual
rotation of the arrows (solid and dashed arrows rotate by the same angle πJt in the opposite direction). Evolution of coherences of µ1
in the presence of chemical shift, J-coupling, or both is depicted in Figures 10.8–10.10, respectively. Figure 10.11 shows evolution of the
coherences described by double arrows for various signs of Ω1 and J, and relates them to the spectra plotted according to the standard
conventions.
10.10. SUPPORTING INFORMATION 271
Table 10.1: Examples of two graphical representations of coherences: as distributions (used in Table 8.2) and as arrows (used in
Figure 10.2). The color coding of distributions is similar to that used in Tables 8.1–8.2. Cyan arrows represent magnetic moments
resonating with the applied radio wave (magnetic moments of 1H in our example) most aligned along the −y direction. Green arrows
represent magnetic moments that do not resonate with the radio frequency (magnetic moments of 13C or 15N in our example) in the same
molecules. The solid and dashed orange arrows presented in the last two columns correspond to partial distributions of proton magnetic
moments, shown in cyan in the third column. The direction of the arrow is given by the average direction of the cyan arrows, the type
(dashed or solid) of the arrow is given by the average direction of the green arrows in the third column (up or down, respectively). The
orientation of axes in the schematic drawings of the magnetic moment distributions is shown below the table.
Coherence depicted as distributions: decomposed distributions depicted decomposed
contributing to ˆρ selected molecules with µ of 1H based on µ of 13C/15N as arrows arrows
in addition to It closest to −y (in-phase) or being closer to +z vs. to −z
closest to x (anti-phase) in the selected molecules
−I1y = − i
2




0 0 −1 0
0 0 0 −1
+1 0 0 0
0 +1 0 0




y
x
2I1xI2z = 1
2




0 0 +1 0
0 0 0 −1
+1 0 0 0
0 −1 0 0




x
y
−y
z
x
272
−1
−1
−1
−1
0
0
0
0
1
1
1
1
0
0
0
0
1
1
1
1
2
2
2
2
3
3
3
3
4
4
4
4
5
5
5
5
6
6
6
6
7
7
7
7
8
8
8
8
9
9
9
9
10
10
10
10
11
11
11
11
12
12
12
12
13
13
13
13
14
14
14
14
15
15
15
15
16
16
16
16
17
17
17
17
18
18
18
18
19
19
19
19
20
20
20
20
21
21
21
21
22
22
22
22
23
23
23
23
24
24
24
24
x
y
+s1cJ
−c1sJ
+s1sJ
+c1sJ
I1x:
I1y:
2I1yI2z:
2I1xI2z:
Figure 10.8: Coherence evolution due to the chemical shift in the absence of J-coupling (J = 0). Several snapshots of evolving
coherences are shown in the circles in the middle. Evolution of the I1x and I1y coherences is plotted above the snapshots. Evolution of
the 2I1xI2z and 2I1yI2z coherences is plotted below the snapshots. The blue bar coincides with the transverse polarization of magnetic
moments (cyan arrow), size of which is preserved in the presence of the chemical shifts and in the absence of J-coupling. The 2I1xI2z
and 2I1yI2z coherences do not evolve as sJ = sin(πJt) = 0.
10.10. SUPPORTING INFORMATION 273
−1
−1
−1
−1
0
0
0
0
1
1
1
1
0
0
0
0
1
1
1
1
2
2
2
2
3
3
3
3
4
4
4
4
5
5
5
5
6
6
6
6
7
7
7
7
8
8
8
8
9
9
9
9
10
10
10
10
11
11
11
11
12
12
12
12
13
13
13
13
14
14
14
14
15
15
15
15
16
16
16
16
17
17
17
17
18
18
18
18
19
19
19
19
20
20
20
20
21
21
21
21
22
22
22
22
23
23
23
23
24
24
24
24
x
y
+s1cJ
−c1cJ
+c1sJ
+s1sJ
2I1yI2z:
2I1xI2z:
I1y:
I1x:
Figure 10.9: Coherence evolution due to the J-coupling for Ω1 = 0. Several snapshots of evolving coherences are shown in the circles
in the middle. Evolution of the I1x and I1y coherences is plotted above the snapshots. Evolution of the 2I1xI2z and 2I1yI2z coherences
is plotted below the snapshots. The −I1y coherence (blue bar) evolves into the 2I1xI2z coherences (red bars). The orientation of the
2I1xI2z coherence is given by the direction of the dashed bar. The direction of the transverse polarization in the absence of the J-coupling
is shown as the cyan arrow. The solid and dashed arrows, used in this text to describe evolution of coherences, are shown in orange. The
blue and red bars are projections of the orange arrows to the directions parallel and perpendicular to the cyan arrow.
274
−1
−1
−1
−1
0
0
0
0
1
1
1
1
0
0
0
0
1
1
1
1
2
2
2
2
3
3
3
3
4
4
4
4
5
5
5
5
6
6
6
6
7
7
7
7
8
8
8
8
9
9
9
9
10
10
10
10
11
11
11
11
12
12
12
12
13
13
13
13
14
14
14
14
15
15
15
15
16
16
16
16
17
17
17
17
18
18
18
18
19
19
19
19
20
20
20
20
21
21
21
21
22
22
22
22
23
23
23
23
24
24
24
24
x
y
I1x:
I1y:
+s1cJ
2I1yI2z:
2I1xI2z: +s1sJ
+c1sJ
−c1cJ
Figure 10.10: Coherence evolution due to the chemical shift and J-coupling. Several snapshots of evolving coherences are shown
in the circles in the middle. Evolution of the I1x and I1y coherences is plotted above the snapshots. Evolution of the 2I1xI2z
and 2I1yI2z coherences is plotted below the snapshots. The orientation of the transverse µ1 polarization uncorrelated with the µ2
longitudinal polarization is given by the direction of the blue bar. The orientations of the transverse µ1 polarizations correlated with the
µ2 polarization in the +z and −z direction are given by the direction of the dashed and solid red bars, respectively. The direction of the
transverse polarization in the absence of the J-coupling is shown as the cyan arrow. The solid and dashed arrows, used in this text to
describe evolution of coherences, are shown in orange. The blue and red bars are projections of the orange arrows to the directions parallel
and perpendicular to the cyan arrow.
10.10. SUPPORTING INFORMATION 275
x
y
x
y
x
Laboratory frame
t > 0
x
y
x
y
x
y
x
y
x
y
x
y
y
x
y
x
y
γ < 0, Ω < 0 γ < 0, Ω > 0 γ > 0, Ω > 0
ωradiot
Ωt
−ωradiot + π
Rotating frame
ω0t
Ωt
0−ωradio+ωradio
ω−ωradio
0
Ω− ++
0
−
0
+ −+
0
−
+ δref
− + δref
− + δref
− + δref
− δ
γ > 0, Ω < 0
0
t = 0
t > 0
−+ωradio
−0+0+
y
ω0t ±πJt
J > 0J < 0
±πJt
x
Figure 10.11: Conventions used in NMR spectroscopy when describing the eﬀect of radio-wave pulses and the evolution of coherences,
and when presenting the spectra. Evolution of coherences symbolized by the solid and dashed orange arrows (introduced in Table 10.1)
for nuclei with diﬀerent signs of γ and Ω is shown in the circles. The ﬁrst row of circles represents the polarization immediately after
applying a very short 90◦ radio-wave pulse (at t = 0). The second and third row show directions of coherences and of B1 at time t the
laboratory and rotating coordinate frame, respectively. The direction of the transverse polarization in the absence of the J-coupling is
shown as the cyan arrow. The z axis is deﬁned by the direction of B0. The oscillating radio-wave magnetic ﬁeld is decomposed into two
counter-rotating components. The purple arrows indicate the direction of the resonant component B1 (+Bradio for γ < 0 and −Bradio for
γ > 0). The absolute value of B0 is supposed to be much greater that the amplitude of B1. Note the convention to add a phase of 180 ◦ to
the direction of B1 (i.e., to revert the direction) for γ > 0. The resulting spectra (after Fourier transformation and applying the necessary
phase correction, which is 90 ◦ in the presented cases), are plotted below the circles. Arrows above the spectra assign individual peaks to
the solid and dashed arrows. Note the convention to plot the frequency axis from the right to the left for nuclei with γ < 0.
276
Lecture 11
Correlated spectroscopy using J-coupling
Literature: INEPT, HSQC, and APT experiments are nicely described in K7.10, K8.7, and
K12.4.4., respectively. INEPT is discussed in detail in L16.3., HSQC in C7.1.1. Decoupling trains are
reviewed in C3.5. COSY is described in detail in L16.1, C6.2.1., and K8.3 (with a detailed discussion
of DQF-COSY in K8.4).
11.1 Through-bond correlation
Correlated spectroscopy greatly extends beneﬁts of NMR. We have described in Section 9.4 how
dipolar coupling allows us to correlate frequencies of nuclei (usually protons) that are close in space.
The distance-dependence of the mentioned NOESY experiment is great advantage as it provides
structural information (interatomic distance). However, the dependence of the signal on the a priori
unknown distances in molecules makes interpretation of NOESY spectra diﬃcult. In many cases, it
is desirable to introduce correlation mediated by an interaction that depends only on a presence of
a covalent bond between the observed nuclei. The one-bond J-coupling (usually between 13
C or 15
N
and attached protons) is an ideal choice. The one-bond coupling constants are almost identical in all
C–H bonds in molecules, and the same applies to N–H, C–C, C=C, etc. It is therefore possible to
design experiments where identical one-bond J-coupling provides correlation, and variable chemical
shift provides resolution.
Design of heteronuclear correlation experiments is facilitated by the possibility to apply radiowave
pulses selectively aﬀecting only one nucleus. Spin echoes can be used to separate the eﬀect of
the J-coupling from that of the chemical shift. In the following section, we ﬁrst introduce a pulse
sequence INEPT based on the simultaneous echo (Sections 11.2–11.3). It can be used as a building
block of multidimensional correlated experiments, but also as a one-dimensional experiment utilizing
the J-coupling to increase sensitivity of measurement of nuclei with low γ. Another application of
the simultaneous echo, known as ATP (attached proton test) and useful for analysis of the CHn
groups, is presented in Section 11.10.1. Then we describe the most popular heteronuclear correlation
experiment HSQC, built of the INEPT modules (Sections 11.5–11.8). Finally, we discuss the use of
J-coupling in homonuclear correlated spectroscopy (Section 11.9).
277
278
11.2 INEPT
INEPT is a heteronuclear NMR experiment based on the simultaneous echo. It diﬀers from the
simple simultaneous echo in two issues:
• The length of the delay τ is set to 1/4|J|
• The echo is followed by two 90◦
radio wave pulses, one applied at the same frequency as the
excitation pulse (the 90◦
pulse preceding the echo) – this one must be phase-shifted by 90 ◦
from the excitation pulse, and the other one applied at the frequency of the other nucleus (13
C
or 15
N in Fig. 11.1).
With τ = 1/4|J|, 2πτ = π/2, cJ = 0,sJ = 1 if J > 0, and sJ = −1 if J < 0. Therefore, the
density matrix at the end of the echo is1
ˆρ(e) = 1
2
It − 1
2
κ1 (2IxSz) − 1
2
κ2Sz
−→ ˆρ(f) = 1
2
It + 1
2
κ1 (2IzSz) − 1
2
κ2Sz after the ﬁrst pulse and
−→ ˆρ(g) = 1
2
It − 1
2
κ1 (2IzSy) + 1
2
κ2Sy after the second pulse.
If the experiment continues by acquisition, the density matrix evolves as
It −→ It −→ It (11.1)
−2IzSy −→



−c2 2IzSy −→
−c2cJ 2IzSy
+c2sJ Sx
+s2 2IxSz −→
+s2cJ 2IzSx
+s2sJ Sy
(11.2)
Sy −→



+c2Sy −→
+c2cJ Sy
−c2sJ 2SxIz
−s2Sx −→
−s2cJ Sx
−s2sJ 2SyIz
(11.3)
Both the ”blue” coherence 2IzSy and the ”green” coherence Sy evolve into ”measurable” product
operators, giving non-zero trace when multiplied by S+. Note that all components of ˆρ(g) commute
with Iz. Therefore, the chemical shift of the ﬁrst nucleus (Hamiltonian Ω1Iz) does not contribute
to the density matrix evolution.
Evaluation of the expected value of magnetization must take into account instrumental set up
used in heteronuclear experiments. Only frequencies of the observed nucleus pass the audio ﬁlters
of the NMR spectrometer (see footnote 13 in Section 7.10.5). Therefore, the detected transverse
magnetization is represented by the operator
ˆM2+ = Nγ2 S+ = Nγ2 (Sx + iSy) . (11.4)
1
The analysis is done for J > 0. If J < 0 (e.g. for one-bond 1
H-15
N coupling), all blue terms have the opposite
sign.
11.2. INEPT 279
H1
N15
C13
or
1
4J
N15
C13
or
1
4J
H1
a b g
y,−y
fe
1
H
N15
C13
or
Figure 11.1: INEPT pulse sequence applied to 1
H and 13
C or 15
N (top) and direct excitation of 13
C or 15
N (bottom).
The narrow and wide rectangles represent 90◦
and 180◦
radio wave pulses, respectively. The label y, −y above the
pulse indicates application of phase cycling to the labeled pulse (irradiation by a radio wave with the phases alternating
between values of 90 ◦
and 270◦
, relative to the ﬁrst pulse in the sequence, in subsequent measurements). Distributions
of magnetic moments corresponding to the density matrix contributions other than It are shown schematically above
the pulse sequences for time instants labeled by the red letters and arrows. For a better visibility, the distributions
are shown in a coordinate frame rotated by 90◦
counterclockwise about z, compared with the orientation used in
Table 8.2.
280
As discussed in Section 10.5, only products of Sx, Sy, Sx, Sy with S+ have non-zero traces:
Tr {Sx(Sx + iSy)} = Tr {Sx(Sx + iSy)} = 1, (11.5)
Tr {Sy(Sx + iSy)} = Tr {Sy(Sx + iSy)} = i, (11.6)
The expected value of M2+ is therefore
M2+ = Tr ˆρ(t) ˆM2+ = Nγ2 Tr {ˆρ(t)(Sx + iSy)} = Nγ2
κ1
2
(c2sJ + is2sJ ) +
κ2
2
(−s2cJ + ic2cJ ) .
The trigonometric relations cos(a±b) = cos a cos b sin a sin b and sin(a±b) = sin a cos b±cos a sin b
allow us to convert the products c2sJ , s2sJ s2cJ , and c2cJ to goniometric functions of (Ω2 −πJ)t and
(Ω2 − πJ)t
M2+ = Nγ2
κ1
4
(− sin((Ω1 − πJ)t) + sin((Ω1 + πJ)t) + i cos((Ω1 − πJ)t) − i cos((Ω1 + πJ)t)) +
Nγ2
κ2
4
(− sin((Ω1 − πJ)t) − sin((Ω1 + πJ)t) + i cos((Ω1 − πJ)t) + i cos((Ω1 + πJ)t))
= Nγ2
κ1
4
i (cos((Ω1 − πJ)t) + i sin((Ω1 − πJ)t) − cos((Ω1 + πJ)t) − i sin((Ω1 + πJ)t)) +
Nγ2
κ2
4
i (cos((Ω1 − πJ)t) + i sin((Ω1 − πJ)t) + cos((Ω1 + πJ)t) + i sin((Ω1 + πJ)t))
=
Nγ1γ2
8kBT
ei π
2 ei(Ω1−πJ)t
− ei(Ω1+πJ)t
+
Nγ2
2
8kBT
ei π
2 ei(Ω1−πJ)t
+ ei(Ω1+πJ)t
, (11.7)
where the unimportant red terms are removed by phase correction. After including relaxation
with the simpliﬁcations introduced in Section 10.5,
M2+ =
Nγ2
8kBT
e−R2
γ1 ei(Ω1−πJ)t
− ei(Ω1+πJ)t
+ γ2 ei(Ω1−πJ)t
+ ei(Ω1+πJ)t
. (11.8)
The real part of the spectrum obtained by the Fourier transformation is
{Y (ω)} =
Nγ2
2
2
B0
8kBT
+
γ1R2
R
2
2 + (ω − Ω2 + πJ)2
−
γ1R2
R
2
2 + (ω − Ω2 − πJ)2
+
Nγ2
2
2
B0
8kBT
+
γ2R2
R
2
2 + (ω − Ω2 + πJ)2
+
γ2R2
R
2
2 + (ω − Ω2 − πJ)2
(11.9)
• The ”blue” coherence 2IzSy gives a signal with opposite phases of the peaks at Ω2 − πJ and
Ω2 + πJ. Accordingly, it is called the anti-phase coherence.
• The ”green” coherence Sy gives a signal with the same phase of the peaks at Ω2 − πJ and
Ω2 + πJ. Accordingly, it is called the in-phase coherence.
11.3. PHASE CYCLING 281
Ω2
2πJ
{Y(ω)}
ω
Ω2
2πJ
{Y(ω)} ω
Figure 11.2: Real (left) and imaginary (right) components of an INEPT spectrum of a 1
H–13
C pair. The blue and
green curves are contributions of the INEPT transfer and direct excitation to the ﬁnal spectrum (red). Note that
the direct contribution makes the ﬁnal peak heights slightly unbalanced. The blue spectrum is obtained if the phase
sampling is applied, direct measurement of 13
C magnetization provides the green spectrum. The scale is the same as
in Figure 10.5.
• More importantly, the ”blue” coherence 2IzSy gives a signal proportional to |γ1| while the
”green” coherence Sy gives a signal proportional to |γ2|. The amplitude of the ”green” signal
corresponds to the amplitude of a regular 1D 15
N spectrum. The ”blue” signal ”inherited”
the amplitude with |γ1| from the excited nucleus, proton. In the case of 1
H and 15
N, |γ1| is
approximately ten times higher than |γ2|. Therefore, the blue signal is an order of magnitude
stronger. This is why this experiment is called Insensitive Nuclei Enhanced by Polarization
Transfer (INEPT).
11.3 Phase cycling
As described in the previous section, the ”blue” and ”green” signals of diﬀerent origins (evolving
from the ”blue” coherence 2IzSy and from the ”green” coherence Sy) are combined in the INEPT
experiment. It results in diﬀerent heights of the Ω2 − πJ and Ω2 + πJ peaks in the INEPT spectrum
(Figure 11.2). The ”blue” and ”green” signals can be separated if we repeat the measurement twice
with the phase of the proton y pulse shifted by 180 ◦
(i.e., with −y). The mentioned pulse converts
the 2IxSz operator in ˆρ(e) to −2IzSz if the relative phase of the radio wave is +90 ◦
(y), but to
+2IzSz if the phase is −90 ◦
(−y): ˆρ(e) = 1
2
It − 1
2
κ1 (2IxSz) − 1
2
κ2Sz −→
ˆρ(f) = 1
2
It ± 1
2
κ1 (2IzSz) − 1
2
κ2Sz −→
ˆρ(g) = 1
2
It
1
2
κ1 (2IzSy) + 1
2
κ2Sy
Such alteration of the phase does not aﬀect the ”green” signal, but changes the sign of the ”blue”
signal. If we subtract the spectra, we obtained a pure ”blue” signal. This trick, repeating acquisition
with diﬀerent phases, is known as phase cycling and is used routinely in NMR spectroscopy to remove
unwanted signals.
282
11.4 Simpliﬁed analysis
When analyzing more advanced NMR experiments, tracking the complete density matrix evolution
may be very demanding. In practice, the analysis is simpliﬁed (i) by working with the already
known eﬀects of the complete building blocks (spin echoes, INEPT) and (ii) by ignoring evolution of
the density matrix contributions that cannot inﬂuence the measured transverse magnetization. The
latter simpliﬁcation is based on the following considerations (presented for a heteronuclear pair of
magnetic moments).
• Only product operators representing uncorrelated transverse polarizations (Ix, Iy, Sx, Sy),
known as in-phase single-quantum coherences, directly contribute to the measurable signal.
Furthermore, only signal oscillating relatively close to the carrier frequency of the radio waves
passes the audio ﬁlters of the spectrometer (see footnote 13 in Section 7.10.5). Therefore, the
operator of the measured quantity represents only the actually detected transverse magnetization
(M1+ in our case). This limits coherences contributing to the signal to Ix, Iy (if nucleus
1 is detected). Only traces of their products with ˆM+ are not zero. The coherences Sx, Sy
can be converted to the ”measurable” operators Ix, Iy by a combination of J-coupling and
90◦
pulses.
• Product operators representing transverse polarizations correlated with longitudinal polarizations,
known as anti-phase single-quantum coherences (2IxSz, 2IySz if nucleus 1 is detected),
do not contribute to the measurable signal (traces of their products with ˆM1+ are equal to zero),
but they can evolve to the ”measurable” in-phase single-quantum coherences if the J-coupling
is present (without application of any radio-wave pulses).
• Conversion of the operators 2IzSx, 2IzSy to the single-quantum coherences of the measured
nucleus 1 requires evolution of the J-coupling and application of a 90◦
pulse (at the precession
frequency of nucleus 1).
• Product operators representing two2
correlated transverse polarizations (2IxSx, 2IySy, 2IxSy,
2IySx), known as multiple-quantum coherences, do not contribute to the measurable signal
(traces of their products with ˆM1+ are equal to zero), and can be converted to the ”measurable”
in-phase single quantum coherences only by applying a 90◦
pulse and by a subsequent action
of the J-coupling.
• Product operators representing longitudinal polarizations (Iz, Sz, 2IzSz), known as populations,
do not contribute to the measurable signal (traces of their products with ˆM1+ are equal
to zero), and can be converted to single quantum coherences only by applying a 90◦
pulse (Iz)
and, in the case of Sz and 2IzSz, by a subsequent action of the J-coupling.
• The product operator It never evolves to a measurable coherence because it commutes with
all Hamiltonians. It can be ignored right from the beginning.
2
In spin systems consisting of more than two coupled magnetic moments, product operators representing more
than two correlated transverse polarizations also belong to this category.
11.5. HSQC 283
Based on the arguments discussed above, all operators other than Ix, Iy, 2IxSz, 2IySz can
be ignored after the last 90 ◦
pulse applied at the frequency of the given nucleus.
11.5 HSQC
Heteronuclear Single-Quantum Correlation (HSQC) spectroscopy is a 2D experiment using J-coupling
to correlate frequencies of two magnetic moments with diﬀerent γ (Figure 11.3A). The experiment
consists of
• excitation pulse, usually applied at the proton frequency
• INEPT module, transferring polarization to the coupled nucleus (usually 15
N or 13
C)
• evolution period of incremented duration t1, introducing signal modulation by frequency of the
other nucleus
• another INEPT module, transferring polarization back to proton
• signal acquisition
We now analyze the evolution of the density matrix during the HSQC experiments using the
simpliﬁed approach described in Section 11.4.
• After a 90◦
pulse at the proton frequency, polarization is transferred to the other nucleus
(usually 15
N or 13
C). The density matrix at the end of the INEPT is
ˆρ(f) = 1
2
It − 1
2
κ1 (2IzSy) + 1
2
κ2Sy
• During an echo with a decoupling 180◦
pulse at the proton frequency (cyan pulse in Figure 11.3,
top), anti-phase single quantum coherences evolve according to the chemical shift
ˆρ(f) −→ ˆρ(g) = 1
2
It + 1
2
κ1 (c212IzSy − s212IzSx) + 1
2
κ2 (c21Sy − s21Sx).
The coeﬃcients c21 and s21, respectively, include the cos(Ω2t1) and sin(Ω2t1) factors, as described
in Section 9.2.
• Two 90◦
pulses convert 2IzSy to −2IySz and −2IzSx to 2IySx. The magenta operator
is a contribution to the density matrix which represents a multiple-quantum coherence, which
can be converted to a ”measurable” in-phase single quantum coherence only by applying a 90◦
pulse (and by a subsequent action of the J-coupling). Since our pulse sequence does not contain
any more 90◦
pulses, we ignore 2IySx. The 90◦
pulse applied at the precession frequency of
13
C or 15
N converts Sy to the longitudinal polarization Sz. The Sx is not aﬀected by the 90◦
pulses applied with the 0◦
(x) phase. As the pulse sequence does not contain any more 90◦
pulses, we can ignore the green terms. Also, we ignore the red term It which never evolves to
a measurable coherence because it commutes with all Hamiltonians. The density matrix can
be written as
ˆρ(h) = −1
2
κ1c212IySz+ unmeasurable contributions.
284
• The last echo allows the J-coupling to evolve but refocuses evolution due to the chemical
shift. If the delays τ = 1/4J, the measurable components of the density matrix evolve to
1
2
κ1 cos(Ω2t1)Ix (rotation ”about” 2IzSz by 90 ◦
and change of the sign by the last 180◦
pulse
applied at the proton frequency):
ˆρ(i) = 1
2
κ1c21Ix+ unmeasurable contributions
• During acquisition, both chemical shift and J-coupling evolve in the experiment depicted in
Figure 11.3A:
1
2
κ1c21Ix −→



+1
2
κ1c21c12 Ix −→
+1
2
κ1c21c12cJ Ix
+1
2
κ1c21c12sJ 2IySz
+1
2
κ1c21s12 Iy −→
+1
2
κ1c21s12cJ Iy
−1
2
κ1c21s12sJ 2IxSz
(11.10)
HSQC experiments are usually two-dimensional. The second dimension is introduced by repeating
the measurement with t1 being incremented. Moreover, each increment is measured twice with a
diﬀerent phase of one of the 90◦
radio-wave pulses applied to 13
C or 15
N (labeled in Figure 11.3 by
writing x/y above the pulse, do not confuse with the label x, y in Figure 11.1 that indicates phase
cycling, i.e. storing a single record obtained by adding or subtracting data acquired with a diﬀerent
phase). In the records acquired with the phase shifted by 90◦
(y), the pulses inﬂuence the density
matrix as follows:
• The density matrix at the end of the ﬁrst INEPT applied with the 90◦ 13
C (or 15
N) pulse shifted
by 90◦
(y) is
ˆρ(f) = 1
2
It + 1
2
κ1 (2IzSx) − 1
2
κ2Sx
• During the echo with a decoupling 180◦
pulse at the proton frequency (cyan pulse in Figure 11.3,
top), anti-phase single quantum coherences evolve according to the chemical shift
ˆρ(f) −→ ˆρ(g) = 1
2
It − 1
2
κ1 (c212IzSx + s212IzSy) − 1
2
κ2 (c21Sx + s21Sy).
• Two 90◦
pulses convert −2IzSx to −2IySz and −2IzSy to −2IySy. The 90◦
pulse applied
at the precession frequency of 13
C or 15
N with a phase shift of 90 ◦
(y) converts −Sx to Sz and
leaves −Sy untouched. As discussed above, only 2IySz evolves to a measurable coherence:
ˆρ(h) = −1
2
κ1s212IySz+ unmeasurable contributions
The density matrix then evolves as described above for the records acquired with the phase 0 ◦
(x), the only diﬀerence is the factor s21 instead of c21:
1
2
κ1s21Ix −→



+1
2
κ1s21c12 Ix −→
+1
2
κ1s21c12cJ Ix
+1
2
κ1s21c12sJ 2IySz
+1
2
κ1s21s12 Iy −→
+1
2
κ1s21s12cJ Iy
−1
2
κ1s21s12sJ 2IxSz
(11.11)
11.6. DECOUPLING TRAINS 285
The subsequent records acquired with the 0 ◦
(x) and 90 ◦
(y) phases of the 90◦ 13
C or 15
N pulse
are stored as real (modulated by c21 = cos(Ω2t1)) and imaginary (modulated by s21 = sin(Ω2t1))
component of a complex signal, respectively, like in the NOESY experiment.
As described in Section 10.5, we continue by calculating the trace of ˆρ(t2) ˆM1+ and including
relaxation (with diﬀerent rates R2,1 and R2,2 in the direct and indirect dimensions,3
respectively).
The result shows that the expected value of M1+ evolves as
M1+ =
Nγ1
2 2
B0
8kBT
e−R2,2t1
e−R2,1t2
cos(Ω2t1) ei(Ω1−πJ)t2
+ ei(Ω1+πJ)t2
(11.12)
for the 0 ◦
(x) phase of the 90◦ 13
C or 15
N pulse, and as
M1+ =
Nγ1
2 2
B0
8kBT
e−R2,2t1
e−R2,1t2
sin(Ω2t1) ei(Ω1−πJ)t2
+ ei(Ω1+πJ)t2
(11.13)
for the 90 ◦
(y) phase of the 90◦ 13
C or 15
N pulse.
The last step is conversion of the data to a two-dimensional spectrum, described already in
Section 9.5.1. We ﬁrst perform the Fourier transformation in t2 (for each t1 and each phase (0◦
and 90◦
) of the 90◦ 13
C or 15
N pulse). Then we take the real parts of the (correctly phased) onedimensional
spectra and combine them in an array containing two 1D spectra per each t1 value,
one with the c21 modulation and the other one with the s21 modulation. We combine c21 and s21
as c21 + is21 = eiΩ2t1
and perform the Fourier transformation in t1. The real part of the obtained
spectrum (real in both dimensions) is
{Y (ω)} =
Nγ1
2 2
B0
8kBT
R
2
2,2
R
2
2,1 + (ω − Ω1)2
R
2
2,1
R
2
2,1 + (ω − Ω2 + πJ)2
+
R
2
2,1
R
2
2,1 + (ω − Ω2 − πJ)2
(11.14)
11.6 Decoupling trains
If we perform the experiments as depicted in Figure 11.3A and analyzed above, we obtain a 2D
spectrum with peaks at the frequency oﬀset Ω2 in the indirect dimension and a doublet at Ω1 ± πJ
in the direct (proton) dimension (Figure 11.4). Note that the splitting by ±πJ was removed by
the cyan decoupling pulse in the indirect dimension. Splitting of peaks in the direct dimension is
undesirable, but the remedy is not simple. We acquire signal in real time and cannot remove the
splitting by a decoupling echo. In principle, we can divide the acquisition time into short fragments
and apply a 180◦
pulse at the frequency of 13
C (or 15
N) in the middle of each such echo (green
3
The relaxation rates diﬀer because single-quantum coherences of 13
C or 15
N evolve during t1, whereas proton
single-quantum coherences evolve during t2. Moreover, the single-quantum coherences oscillate between in-phase and
anti-phase terms during t1 and t2, and the relaxation rates of in-phase and anti-phase single-quantum coherences diﬀer
as described in Section 10.5. The actually observed relaxation rates R2,1 and R2,2 are averages of the in-phase and
anti-phase values, despite the fact that (i) the density matrix is purely anti-phase (consisting of 2IzSx and 2IzSy
operators) at the end of t1 (due to the presence of the cyan decoupling pulse) and that (ii) only the in-phase (Ix and
Iy) coherence contributes to the signal in t2.
286
A
/
4J
1
4J
1
t2
1
4J
1
4J
t1
H1
N15
C13
or
x y
y
ihgf
B
13
1
4J
1
4J
N15
C
2
or
t1
H1
x y/
4J
1
4J
t
1y
C
1
4J
N15
C13
or
t1
H1
2φ
2φ
4J
1
4J
1
t2
1
4J
φ
signal multiplied by: 1,−1,−1,1,1,−1,−1,1
y
GARP
3φ
=y,−y,y,−y,y,−y,y,−yφ1
=x,−x,x,−x,x,−x,x,−xφ1 =x,x,−x,−x,x,x,−x,−x
φ3=x,x,x,x,−x,−x,−x,−x
/ /1φ φ11φ1
Figure 11.3: HSQC experiment. A, basic HSQC pulse sequence. B, general idea of the decoupling in the direct
dimension. C, Standard presentation of the HSQC pulse sequence with decoupling in the direct dimension and phase
cycling. The decoupling pulse applied to proton and to 13
C (or 15
N) are shown in cyan and green, respectively. The
label x/y indicates repeated acquisition with the phase of the given pulse set ﬁrst to 0 ◦
(x) and than to 90◦
(y), in
order to obtain a cosine-modulated and sine-modulated 1D records for each t1 increment. In panel C, the pulses with
cycled phases are labeled φ1, φ2, φ3, and the actual phases during the cycles are listed below the sequence. In order
to add signals with the same signs, the individual signals acquired during the phase cycles are multiplied by the +1
or −1 factors as indicated below the sequence. Other symbols are used as explained in Figure 11.1.
11.7. SIGNAL SUMMATION AND ARRAYING IN 2D SPECTROSCOPY 287
pulses in Figure 11.3B). In practice, imperfections of such a long series of echoes, aﬀecting especially
magnetic moments with large Ω2, are signiﬁcant. However, more sophisticated series of pulses have
much better performance. Typical examples of decoupling pulse sequences are
• WALTZ - a series of 90◦
, 180◦
, and 270◦
pulses with phase of 0 ◦
(x), or 180 ◦
(−x), repeating
in complex patterns
• DIPSI - a similar series of pulses with non-integer rotation angles
• GARP - computer-optimized sequence of pulses with non-integer rotation angles and phases.
In the schematic drawings of pulse sequences, the decoupling (and other) trains of many pulses
are depicted as rectangles with abbreviations of the used sequences (Figure 11.3C).
11.7 Signal summation and arraying in 2D spectroscopy
Phases of pulses during NMR experiments really run in practice alternate in order to suppress
unwanted signals. The unwanted signals may be due to magnetic moments really present in the
sample (of water protons, of 13
C or 15
N nuclei without protons attached) or various artifacts. An
example of an HSQC sequence including such phase cycling is presented in Figure 11.3C. When
phase cycling is applied, the experiments are repeated for each increment of t1 with diﬀerent phases
of some radio-wave pulses, and the acquired signals are combined. Repeating the measurements
of course extends the overall time of the experiments. However, this drawback is not as serious
as it may appear. In many cases, the sensitivity of the measurement requires to sum results of
several measurements anyway, in order to achieve a suﬃcient signal-to-noise ratio. Usually, the
signals recorded with individual settings of the phases are not stored separately but directly added
to the data acquired with the preceding phase setting. Phase of the desired signal acquired with
diﬀerent phases of the pulses may vary. Therefore, the phase of the acquired signal has to be adjusted
before it is added to the sum of the signals recorded in the previous runs. In the example shown in
Figure 11.3C, the phases of signal acquired in the second, third, sixth, and seventh runs are shifted
by 180 ◦
, which corresponds to changing the sign of the signal. Therefore, the signals detected in the
aforementioned runs are multiplied by −1 before they are added to the overall signal.
When the phase cycling is applied in a two-dimensional experiment, as discussed in this section,
we should carefully distinguish diﬀerent purposes of repeating the data acquisition:
• In order to apply phase cycling and to improve signal-to-noise ratio, the signal is acquired
repeatedly with various phases of certain radio-wave pulses. The individual signals are called
transients or scans in the NMR literature. Usually, transients (scans) are not stored separately,
but combined (summed after necessary phase adjustment). Quadrature receivers of standard
NMR spectrometers supply two output signals with phases shifted by 90 ◦
, therefore each
transient represents a complex signal with real and imaginary component.
• In order to introduce the second dimension, the signal is acquired repeatedly with increasing
(or decreasing) length of the delay t1. The individual signals are called increments in the NMR
288
Ω1 Ω1ω2 ω2
2πJ
ω1
Ω2
Figure 11.4: HSQC spectrum of a 1
H–13
C (or 1
H–15
N) pair. The two-dimensional peaks are displayed as contour
plots. Frequency oﬀsets of the proton and 13
C (or 15
N) are Ω1 and Ω2, respectively. The left spectrum was obtained
using the pulse sequence shown in Figure 11.3A, the right spectrum was acquired with the decoupling applied in the
direct dimension (Figure 11.3B,C).
literature. The increments are stored separately as an array of one-dimensional data (data
matrix). Each increment is stored as two signals (sums of transients) with phases shifted by
90 ◦
, providing (after appropriate phase correction) real and imaginary component of the data
in the direct dimension.
• In order to achieve frequency discrimination in the indirect dimension (using the StatesHaberkorn-Ruben
method described in Section 9.5.1), each increment is recorded twice with a
diﬀerent phase of a certain pulse. The individual increments are called cosine modulated and
sine modulated in the NMR literature. The States-Haberkorn-Ruben method of frequency discrimination
is based on changing a pulse phase (like phase cycling), but (unlike phase cycling)
data collected with diﬀerent phases are stored separately. The real components of the cosineand
sine-modulated increments provide the real and imaginary component of the data in the
indirect dimension as described in Section 9.5.1.
11.8 Beneﬁts of HSQC
At the and of the discussion of the HSQC experiment, we summarize the advantages of recording a
2D HSQC spectrum instead of 1D proton and 13
C or 15
N spectra.
• 13
C or 15
N frequency is measured with high sensitivity (higher by |γ1/γ2|5/2
than provided by
the direct detection, cf. Section 7.10.5).
11.9. COSY 289
• Expansion to the second dimension and reducing the number of peaks in spectrum (only 13
C or
15
N-bonded protons and only protonated 13
C or 15
N nuclei are visible) provides high resolution.
• 1
H-13
C and 1
H-15
N correlation is important structural information (it tells us which proton is
attached to which 13
C or 15
N).
11.9 COSY
We started the discussion of experiments based on J-couplings with heteronuclear correlations because
they are easier to analyze. The basic (and very popular) homonuclear experiment is COSY
(COrrelated SpectroscopY). Its pulse sequence is very simple, consisting of only two 90◦
pulses separated
by an incremented delay t1 (which provides the second dimension), but the evolution of the
density matrix is relatively complex. Here, we analyze evolution for a pair of interacting nuclei
(protons). During t2 (signal acquisition), we discuss only the components of the density matrix that
contribute to the measurable signal. The complete analysis is summarized in Table 11.1.
• ˆρ(a) = 1
2
It + 1
2
κ(I1z + I2z)
thermal equilibrium, the matrices are diﬀerent than for the non-interacting spin, but the constant
is the same.
• ˆρ(b) = 1
2
It + 1
2
κ(−I1y − I2y)
90◦
pulse, see the one-pulse experiment
• ˆρ(c) = 1
2
It
+1
2
κ(−c11cJ1I1y + s11cJ1I1x + c11sJ12I1xI2z + s11sJ12I1yI2z)
+1
2
κ(−c21cJ1I2y + s21cJ1I2x + c21sJ12I1zI2x + s21sJ12I1zI2y),
where c11 = cos(Ω1t1), s11 = sin(Ω1t1), c21 = cos(Ω2t1), s21 = sin(Ω2t1), cJ1 = cos(πJt1), and
sJ1 = sin(πJt1) – evolution of the chemical shift and coupling.
• The second 90◦
pulse creates the following coherences
ˆρ(d) = 1
2
It
+1
2
κ(−c11cJ1I1z+ s11cJ1I1x −c11sJ12I1xI2y− s11sJ12I1zI2y )
t1
t2
x y/
cba d
Figure 11.5: COSY pulse sequence. The rectangles represent 90◦
radio wave pulses applied at a frequency suﬃciently
close to the precession frequencies of both interacting magnetic moments.
290
+1
2
κ(−c21cJ1I2z+ s21cJ1I2x −c21sJ12I1yI2x− s21sJ12I1yI2z ).
The red terms contain population operators, not coherences, they do not contribute to the
signal. The green terms contain in-phase single-quantum coherences, only they give non-zero
trace when multiplied with ˆM+ ∝ (I1x +iI1y +I2x +iI2y). The blue terms contain anti-phase
single-quantum coherences, they do not contribute to the signal directly, but they evolve into
in-phase coherences during acquisition due to the J- coupling. The magenta terms contain
multiple-quantum coherences. They do not contribute to the signal, but can be converted to
single-quantum coherences by 90◦
pulses.4
Such pulses are not applied in the discussed pulse
sequence, but are used in some versions of the experiment.
• The terms in black frames evolve with the chemical shift of the ﬁrst nucleus during acquisition:
s11cJ1I1x → s11cJ1c12cJ2I1x + s11cJ1s12cJ2I1y+ unmeasurable anti-phase coherences
−s21sJ12I1yI2z → s21sJ1c12sJ2I1x + s21sJ1s12sJ2I1y+ unmeasurable anti-phase coherences ,
where cn2 = cos(Ωnt2), sn2 = sin(Ωnt2), cJ2 = cos(πJt2), and sJ2 = sin(πJt2). Using the
following trigonometric relations
cnkcJk =
c−
nk + c+
nk
2
snksJk =
c−
nk − c+
nk
2
cnksJk =
−s−
nk + s+
nk
2
snkcJk =
s−
nk + s+
nk
2
, (11.15)
where c±
nk = cos((Ωn ± πJ)tk) and s±
nk = sin((Ωn ± πJ)tk), the terms contributing to the signal
can be written as
κ
8

(s−
11 + s+
11)(c−
12 + c+
12)
[Ω1,Ω1]
+ (c−
21 − c+
21)(−s−
12 + s+
12)
[Ω2,Ω1]

 I1x + κ
8

(s−
11 + s+
11)(s−
12 + s+
12)
[Ω1,Ω1]
+ (c−
21 − c+
21)(c−
12 − c+
12)
[Ω2,Ω1]

 I1y
The ﬁrst and second line show coherences providing the real and imaginary component of the
complex signal acquired in the direct dimension (t2).
• Evaluation of the traces of ˆM+ ˆρ(t2) gives the following modulation of the signal:
(s−
11 + s+
11) ei(Ω1−πJ)t2
+ ei(Ω1+πJ)t2
[Ω1,Ω1]
+ i(c−
21 − c+
21) ei(Ω1−πJ)t2
− ei(Ω1+πJ)t2
[Ω2,Ω1]
.
The imaginary unit in front of the blue term can be written as eiπ/2
. Phase correction of the
whole signal by −π/2 (mathematically equivalent to multiplication by −i results in
−i (s−
11 + s+
11) ei(Ω1−πJ)t2
+ ei(Ω1+πJ)t2
[Ω1,Ω1]
+ (c−
21 − c+
21) ei(Ω1−πJ)t2
− ei(Ω1+πJ)t2
[Ω2,Ω1]
. (11.16)
Note that the phase correction cannot remove the phase shift of π/2 from the whole signal:
either green or blue part is always multiplied by the imaginary unit (equivalent to eiπ/2
). As a
4
We have not analyzed evolution of the multiple quantum coherences so far. To do it, it is suﬃcient (i) to recognize
that multiple quantum coherences commute with 2I1zI2z (therefore they are not inﬂuenced by the weak J-coupling),
and (ii) to analyze ”rotation” of individual constituents of the product operators (e.g. of I1x and I2y) ”about” Inj
individually and calculate the product of the results of the rotation.
11.9. COSY 291
consequence, the blue and green parts of the signal result in peaks of diﬀerent shapes (one of
a convenient absorption shape and the other one of an undesirable dispersion shape).
The imaginary signal in the indirect dimension is obtained by repeating acquisition for each
increment of t1 with a diﬀerent phase of the second 90 ◦
pulse (shifted by 90 ◦
, which corresponds to
the direction y in the rotating coordinate system).
• The second 90◦
pulse with the y phase creates the following coherences
ˆρ(d) = 1
2
It
+1
2
κ(− c11cJ1I1y −s11cJ1I1z− c11sJ12I1zI2x +s11sJ12I1yI2x)
+1
2
κ(− c21cJ1I2y −s21cJ1I2z− c21sJ12I1xI2z +s21sJ12I1xI2y).
• The terms in black frames evolve with the chemical shift of the ﬁrst nucleus during acquisition:
−c11cJ1I1y → c11cJ1s12cJ2I1x − c11cJ1c12cJ2I1y+ unmeasurable anti-phase coherences
−s21sJ12I1xI2z → c21sJ1s12sJ2I1x − c21sJ1c12sJ2I1y+ unmeasurable anti-phase coherences .
The terms contributing to the signal can be written as
(c−
11 + c+
11)(s−
12 + s+
12)
[Ω1,Ω1]
+ (−s−
21 − s+
21)(c−
12 − c+
12)
[Ω2,Ω1]

 I1x −

(c−
11 + c+
11)(c−
12 + c+
12)
[Ω1,Ω1]
+ (−s−
21 + s+
21)(−s−
12 + s+
12)
[Ω2,Ω1]

 I1y.
• Evaluation of the traces of ˆM + ˆρ(t2) gives the following modulation of the signal:
−i(c−
11 + c+
11) ei(Ω1−πJ)t2
+ ei(Ω1+πJ)t2
[Ω1,Ω1]
− (s−
21 − s+
21) ei(Ω1−πJ)t2
− ei(Ω1+πJ)t2
[Ω2,Ω1]
Again, the green part of the signal is shifted by π/2 from the blue one (multiplied by −i), and
this diﬀerence in phases cannot be removed by any phase correction. If we apply the same
phase correction as for the experiment with the x phase of the second pulse, we obtain
− (c−
11 + c+
11) ei(Ω1−πJ)t2
+ ei(Ω1+πJ)t2
[Ω1,Ω1]
+ i(s−
21 − s+
21) ei(Ω1−πJ)t2
− ei(Ω1+πJ)t2
[Ω2,Ω1]
(11.17)
• Now we combine signals obtained with the diﬀerent phases of the second pulse. Fourier transformation
(with respect to the real time course during measurement, labeled t2 here) of the signal
obtained with phase x (Eq. 11.16) after introducing relaxation yields a series of one-dimensional
spectra with the following shape:
− i(s−
11 + s+
11)
R2,1
R
2
2,1 + (Ω1 − πJ − ω2)2
+
R2,1
R
2
2,1 + (Ω1 + πJ − ω2)2
+
i(Ω1 − πJ − ω2)
R
2
2,1 + (Ω1 − πJ − ω2)2
+
i(Ω1 + πJ − ω2)
R
2
2,1 + (Ω1 + πJ − ω2)2
[Ω1,Ω1]
+ (c−
21 − c+
21)
R2,1
R
2
2,1 + (Ω1 − πJ − ω2)2
−
R2,1
R
2
2,1 + (Ω1 + πJ − ω2)2
+
i(Ω1 − πJ − ω2)
R
2
2,1 + (Ω1 − πJ − ω2)2
−
i(Ω1 + πJ − ω2)
R
2
2,1 + (Ω1 + πJ − ω2)2
[Ω2,Ω1]
.
(11.18)
292
Separating the real and imaginary parts provides
+ (s−
11 + s+
11)
(Ω1 − πJ − ω2)
R
2
2,1 + (Ω1 − πJ − ω2)2
+
(Ω1 + πJ − ω2)
R
2
2,1 + (Ω1 + πJ − ω2)2
− i
R2,1
R
2
2,1 + (Ω1 − πJ − ω2)2
+
R2,1
R
2
2,1 + (Ω1 + πJ − ω2)2
[Ω1,Ω1]
+ (c−
21 − c+
21)
R2,1
R
2
2,1 + (Ω1 − πJ − ω2)2
−
R2,1
R
2
2,1 + (Ω1 + πJ − ω2)2
+ i
(Ω1 − πJ − ω2)
R
2
2,1 + (Ω1 − πJ − ω2)2
−
(Ω1 + πJ − ω2)
R
2
2,1 + (Ω1 + πJ − ω2)2
[Ω2,Ω1]
.
(11.19)
In a similar manner, Fourier transformation of the signal obtained with the phase y (Eq. 11.17)
provides a series of one-dimensional spectra with the following shape:
− (c−
11 + c+
11)
R2,1
R
2
2,1 + (Ω1 − πJ − ω2)2
+
R2,1
R
2
2,1 + (Ω1 + πJ − ω2)2
+
i(Ω1 − πJ − ω2)
R
2
2,1 + (Ω1 − πJ − ω2)2
+
i(Ω1 + πJ − ω2)
R
2
2,1 + (Ω1 + πJ − ω2)2
[Ω1,Ω1]
+ i(s−
21 − s+
21)
R2,1
R
2
2,1 + (Ω1 + πJ − ω2)2
−
R2,1
R
2
2,1 + (Ω1 − πJ + ω2)2
+
i(Ω1 − πJ − ω2)
R
2
2,1 + (Ω1 − πJ − ω2)2
−
i(Ω1 + πJ − ω2)
R
2
2,1 + (Ω1 + πJ − ω2)2
[Ω2,Ω1]
.
(11.20)
We wish to use the signal obtained with the phase y as an imaginary component. Therefore,
we factor out the imaginary unit and then separate the real and imaginary parts
− i (c−
11 + c+
11)
(Ω1 − πJ − ω2)
R
2
2,1 + (Ω1 − πJ − ω2)2
+
(Ω1 + πJ − ω2)
R
2
2,1 + (Ω1 + πJ − ω2)2
− i
R2,1
R
2
2,1 + (Ω1 − πJ − ω2)2
+
R2,1
R
2
2,1 + (Ω1 + πJ − ω2)2
[Ω1,Ω1]
+ i (s−
21 − s+
21)
R2,1
R
2
2,1 + (Ω1 − πJ − ω2)2
−
R2,1
R
2
2,1 + (Ω1 + πJ − ω2)2
+ i
(Ω1 − πJ − ω2)
R
2
2,1 + (Ω1 − πJ − ω2)2
−
(Ω1 + πJ − ω2)
R
2
2,1 + (Ω1 + πJ − ω2)2
[Ω2,Ω1]
.
(11.21)
As described in Section 9.5.1, the hypercomplex signal is obtained by (i) discarding the imaginary
parts and (ii) processing the signals recorded with the x and y phases of the second pulse
as real and imaginary components
11.9. COSY 293
+ (s−
11 + s+
11) + i(c−
11 + c+
11)
(Ω1 − πJ − ω2)
R
2
2,1 + (Ω1 − πJ − ω2)2
+
(Ω1 + πJ − ω2)
R
2
2,1 + (Ω1 + πJ − ω2)2
[Ω1,Ω1]
+ (c−
21 − c+
21) + i(s−
21 − s+
21)
R2,1
R
2
2,1 + (Ω1 − πJ − ω2)2
−
R2,1
R
2
2,1 + (Ω1 + πJ − ω2)2
[Ω2,Ω1]
.
(11.22)
When we express the obtained complex numbers in the exponential form and write the −i
multiplying the green terms as e−iπ/2
,
+ e−i π
2 ei(Ω1−πJ)t1 + ei(Ω1+πJ)t1
(Ω1 − πJ − ω2)
R
2
2,1 + (Ω1 − πJ − ω2)2
+
(Ω1 + πJ − ω2)
R
2
2,1 + (Ω1 + πJ − ω2)2
[Ω1,Ω1]
+ ei(Ω2−πJ)t1 − ei(Ω2+πJ)t1
R2,1
R
2
2,1 + (Ω1 − πJ − ω2)2
−
R2,1
R
2
2,1 + (Ω1 + πJ − ω2)2
[Ω2,Ω1]
,
(11.23)
we see that the green and blue parts diﬀer in phase also in the indirect t1 dimension. After
the second Fourier transformation, we obtain a two-dimensional spectrum with intensity of the
real part proportional to
+ Nγ2 2B0
16kBT
(Ω1 − πJ − ω1)
R
2
2,1 + (Ω1 − πJ − ω1)2
+
(Ω1 + πJ − ω1)
R
2
2,1 + (Ω1 + πJ − ω1)2
(Ω1 − πJ − ω2)
R
2
2,1 + (Ω1 − πJ − ω2)2
+
(Ω1 + πJ − ω2)
R
2
2,1 + (Ω1 + πJ − ω2)2
[Ω1,Ω1]
+ Nγ2 2B0
16kBT
R2,2
R
2
2,2 + (Ω2 − πJ − ω1)2
−
R2,2
R
2
2,2 + (Ω2 + πJ − ω1)2
R2,1
R
2
2,1 + (Ω1 − πJ − ω2)2
−
R2,1
R
2
2,1 + (Ω1 + πJ − ω2)2
[Ω2,Ω1]
,
(11.24)
where R2,1 and R2,2 are average relaxation rates of proton 1 and 2, respectively (ignoring the
diﬀerences in relaxation of diﬀerent coherences).
• The green component of the signal evolves with the same chemical shift in both dimensions,
providing diagonal signal (at frequencies [Ω1, Ω1] in the 2D spectrum). The blue (originally
anti-phase) component of the signal also evolves with Ω1 in the direct dimension (t2), but with
Ω2 in the indirect dimension (t1). It provides oﬀ-diagonal signal, a cross-peak at frequencies
294
[Ω2, Ω1] in the 2D spectrum. As the phases of the blue and green components diﬀer by 90 ◦
,
either diagonal peaks or cross-peaks have the undesirable dispersion shape (it is not possible to
phase both diagonal peaks or cross-peaks, they always have phases diﬀering by 90 ◦
, even when
the spectrum is processed correctly following the protocol described in Section 9.5.1). Typically,
the spectrum is phased so that the cross-peaks have a nice absorptive shape corresponding to
Eq. 11.26 (see Figure 11.6) because they carry a useful chemical information: they show which
protons are connected by 2 or 3 covalent bonds.
• The diagonal peaks are not interesting, but their dispersive shape may obscure cross-peaks
close to the diagonal. The problem with the phase can be solved if one more 90 ◦
pulse is
introduced. Such a pulse converts the magenta multiple-quantum coherences to anti-phase
single-quantum coherences, which evolve into the measurable signal. The point is that other
coherences can be removed by phase cycling. The obtained spectrum contains diagonal peaks
and cross-peaks, but (in contrast to the simple two-pulse variant of the COSY experiment)
both diagonal peaks and cross-peaks have the same phase.5
This version of the experiment,
known as double-quantum ﬁltered COSY (DQF-COSY), is analyzed in Section 11.10.2. Its
disadvantage is a lower sensitivity – we lose a half of the signal.
• Also, note that each peak is split into doublets in both dimensions. More complex multiplets
are obtained if more than two nuclei are coupled. The distance of peaks in the multiplets is
given by the interaction constant J. In the case of nuclei connected by three bonds, J depends
on the torsion angle deﬁned by these three bonds. So, COSY spectra can be used to determine
torsion angles in the molecule.
• The terms in cyan frames evolve with the chemical shift of the second nucleus during acquisition
as
s21cJ1I2x → s21cJ1c12cJ2I2x + s21cJ1s12cJ2I2y+ unmeasurable anti-phase coherences
−s11sJ12I1zI2y → s11sJ1c12sJ2I2x + s11sJ1s12sJ2I2y+ unmeasurable anti-phase coherences
and give a similar type of signal for the other nucleus:
+ Nγ2 2B0
16kBT
(Ω2 − πJ − ω1)
R
2
2,2 + (Ω2 + πJ − ω1)2
+
(Ω2 + πJ − ω1)
R
2
2,2 + (Ω2 − πJ − ω1)2
(Ω2 − πJ − ω2)
R
2
2,2 + (Ω2 + πJ − ω2)2
+
(Ω2 + πJ − ω2)
R
2
2,2 + (Ω2 − πJ − ω2)2
[Ω1,Ω1]
+ Nγ2 2B0
16kBT
R2,1
R
2
2,1 + (Ω1 + πJ − ω1)2
−
R2,1
R
2
2,1 + (Ω1 − πJ − ω1)2
R2,2
R
2
2,2 + (Ω2 + πJ − ω2)2
−
R2,2
R
2
2,2 + (Ω2 − πJ − ω2)2
[Ω2,Ω1]
,
(11.25)
This signal represents the other diagonal and oﬀ-diagonal peak in the spectrum.
5
We cannot use phase cycling to remove the green terms resulting in the unwanted diagonal peaks because phase
cycling can distinguish multiple-quantum coherences from single-quantum ones, but it cannot distinguish anti-phase
single quantum coherences from in-phase single quantum coherences.
11.9. COSY 295
Table 11.1: Evolution of the density matrix during the COSY experiment. Modulations of the density matrix
components (omitting the κ/2 factor and the It component) having the origin in I1z and I2z are shown in black
and cyan, respectively. The product operators are color-coded as in the text.
Real in t1:
ˆρ(a) ˆρ(b) ˆρ(c) ˆρ(d) ˆρ(t2) Tr{ˆρ(t2)(I1x + I2x + iI1y + iI2y)}
I1z +1 0 0 −c11cJ1 +c11cJ1 0
I1x 0 +s11cJ1 +s11cJ1 +s11cJ1c12cJ2 + s21sJ1c12sJ2 +s11cJ1c12cJ2 + s21sJ1c12sJ2
I1y −1 −c11cJ1 0 +s11cJ1s12cJ2 + s21sJ1s12sJ2 i(+s11cJ1s12cJ2 + s21sJ1s12sJ2)
2I1yI2z +s11sJ1 −s21sJ1 +s11cJ1c12sJ2 − s21sJ1c12cJ2 0
2I1xI2z +c11sJ1 0 −s11cJ1s12sJ2 + s21sJ1s12cJ2 0
2I1xI2y −c11sJ1 −c11sJ1c12c22 + c21sJ1s12s22 0
2I1xI2x +c11sJ1c12s22 + c21sJ1s12c22 0
2I1yI2y −c21sJ1c12s22 − c11sJ1s12c22 0
2I1yI2x −c21sJ1 −c21sJ1c12c22 + c11sJ1s12s22 0
2I1zI2x +c21sJ1 0 −s21cJ1s22sJ2 + s11sJ1s22cJ2 0
2I1zI2y +s21sJ1 −s11sJ1 +s21cJ1c22sJ2 − s11sJ1c22cJ2 0
I2y −1 −c21cJ1 0 +s21cJ1s22cJ2 + s11sJ1s22sJ2 i(+s21cJ1s22cJ2 + s11sJ1s22sJ2)
I2x 0 +s21cJ1 +s21cJ1 +s21cJ1c22cJ2 + s11sJ1c22sJ2 +s21cJ1c22cJ2 + s11sJ1c22sJ2
I2z +1 0 0 −c21cJ1 +c21cJ1 0
Imaginary in t1:
ˆρ(a) ˆρ(b) ˆρ(c) ˆρ(d) ˆρ(t2)
I1z +1 0 0 −s11cJ1 +s11cJ1 0
I1x 0 +s11cJ1 0 +c11cJ1s12cJ2 + c21sJ1s12sJ2 +c11cJ1s12cJ2 + c21sJ1s12sJ2
I1y −1 −c11cJ1 −c11cJ1 −c11cJ1c12cJ2 − c21sJ1c12sJ2 i(−c11cJ1c12cJ2 − c21sJ1c12sJ2)
2I1yI2z +s11sJ1 0 +c11cJ1s12sJ2 − c21sJ1s12cJ2 0
2I1xI2z +c11sJ1 −c21sJ1 +c11cJ1c12sJ2 − c21sJ1c12cJ2 0
2I1xI2y +s11sJ1 +s11sJ1c12c22 + c21sJ1s12s22 0
2I1xI2x −s11sJ1c12s22 − s21sJ1s12c22 0
2I1yI2y +s21sJ1c12s22 + s11sJ1s12c22 0
2I1yI2x +s21sJ1 +s21sJ1c12c22 + c11sJ1s12s22 0
2I1zI2x +c21sJ1 −c11sJ1 +c21cJ1c22sJ2 − c11sJ1c22cJ2 0
2I1zI2y +s21sJ1 0 +c21cJ1s22sJ2 − c11sJ1s22cJ2 0
I2y −1 −c21cJ1 −c21cJ1 −c21cJ1c22cJ2 − c11sJ1c22sJ2 i(−c21cJ1c22cJ2 − c11sJ1c22sJ2)
I2x 0 +s21cJ1 0 +c21cJ1s22cJ2 + c11sJ1s22sJ2 +c21cJ1s22cJ2 + c11sJ1s22sJ2
I2z +1 0 0 −s21cJ1 +s21cJ1 0
296
ω1
ω2
Ω1
Ω1
Ω2
Ω2
ω1
ω2
Ω1
Ω1
Ω2
Ω2
Ω1
2πJ
Ω2
2πJ
{Y(ω2)}
ω2
Ω1
2πJ
Ω2
2πJ
{Y(ω2)}
ω2
Figure 11.6: COSY spectrum of a 1
H–1
H pair. The two-dimensional peaks are displayed as contour plots, the
positive and negative contours are shown in blue and red, respectively. A one-dimensional slice taken from the 2D
spectrum at the position indicated by the green line is displayed below the 2D plot. Frequency oﬀsets of the protons
are Ω1 and Ω2. The left spectra were obtained by the pulse sequence displayed in Figure 11.5, the right spectra by
the pulse sequence DQF-COSY.
11.9. COSY 297
HOMEWORK
Analyze the COSY experiment (Section 11.9).
298
11.10 SUPPORTING INFORMATION
11.10.1 APT
The attached proton test (APT) is useful for analysis of systems with multiple protons, most often CHn (C, CH, CH2, CH3). The
experiment consists of 13C excitation, simultaneous echo (discussed in Section 10.9), and 13C acquisition with proton decoupling. In the
following analysis, the 13C operators are labeled Sx, Sy, Sz, and relaxation is ignored for the sake of simplicity.
• ˆρ(a) = 1
2n It + κ1
2n
n
k=1
(Ikz) + κ2
2n Sz
The probability density matrix at equilibrium is described in a similar manner as for one or two magnetic moments, the extension
to the multinuclear system is reﬂected by the scaling constant 1/2n, where n is the number of protons attached to 13C.
• ˆρ(b) = 1
2n It + κ1
2n
n
k=1
(Ikz) − κ2
2n Sy
Excitation of 13C is an analogy of cases discussed above.
• Understanding the next step is critical for the analysis. The general conclusions of Section 10.9 apply, but the actual form of
the density matrix must be derived for each system. The general conclusions are: evolution of Ω2 (13C frequency oﬀset) due tho
the 13C chemical shift is refocused, J-coupling evolves for 2τ as cos(2πJτ) and sin(2πJτ), nucleus 1 (proton) is never excited (no
proton 90◦ pulse), therefore only Ikz contributions are present for protons.
• The actual analysis for 13CH2 and 13CH3 groups requires extension of the density matrix to 2n+1 ×2n+1 dimensions. Construction
of the basis matrices for such 4n+1-dimensional operator space involves additional direct products with the matrices It, Ix, Iy,
Iz. Evolution of the 2n+1 × 2n+1 matrices is governed by their commutation rules, three-dimensional subspaces where ”rotations”
of operators take place are deﬁned by these commutation rules (Eqs. 8.29–8.31).
• When the rules are applied, the analysis gives
ˆρ(e) = 1
2n It + κ1
2n
n
k=1
(Ikz) + κ2
2n



n = 0 : Sy
n = 1 : cSy − s2I1zSx
n = 2 : c2Sy − sc(2I1zSx + 2I2zSx) − s24I1zI2zSy
n = 3 : c3Sy − sc2(2I1zSx + 2I2zSx + 2I3zSx)
−s2c(4I1zI2zSy + 4I1zI3zSy + 4I2zI3zSy)
+s38I1zI2zI3zSx
where s = sin(2πJτ) and c = cos(2πJτ).
• Since decoupling is applied during acquisition, only the Sy coherences give a measurable signal. Note that the fact that the proton
decoupling is used tells us in advance that the terms containing Ikz need not be analyzed. Therefore the knowledge of exact
commutation rules is not necessary, the only important conclusion is that the observable contributions to the density matrix are
modulated by cosn(2πJτ) for CHn. During acquisition, these terms evolve under the inﬂuence of chemical shift, exactly like in a
one-pulse experiment. If τ is set to τ = 2J, then c = cos π = −1. Therefore, signals of C and CH2 are positive and signals of CH
and CH3 are negative ⇒ useful chemical information.
11.10.2 Double-quantum ﬁltered COSY
The double-quantum ﬁltered variant of the COSY experiment (DQF-COSY) provides spectra with the same phases of diagonal peaks
and cross peaks. The modiﬁcation of the experiment consists of (i) adding a third 90◦ pulse and (ii) phase cycle of the ﬁrst two pulses
(Figure 11.7). In DQF-COSY, the initial density matrix
ˆρ(a) = 1
2
(It + κI1z + κI2z)
evolves as shown in Table 11.2. The experiment is repeated for times with diﬀerent phases of the radio waves (see values φ1 and φ2
in Table 11.2). As the consecutive measured records are subtracted before storing the data (indicated by the multiplying factor m in
Table 11.2), contribution of all coherences are canceled except for the multiple-quantum terms 2I1xI1y and 2I1yI1x. It is therefore
suﬃcient to analyze only the following component of ˆρ(d):
ˆρ(d) = κ
2
( 1
2
(c11sJ1+c21sJ1)2I1xI1y + 1
2
(c11sJ1+c21sJ1)2I1yI1x)
11.10. SUPPORTING INFORMATION 299
t2
t1
2φ1φ
cba d e
Figure 11.7: DQF-COSY pulse sequence. The rectangles represent 90◦ radio wave pulses applied at a frequency suﬃciently close
to the precession frequencies of both interacting magnetic moments. The symbols φ1 and φ2 represent a phase cycle x, y, −x, −y. The
hypercomplex spectrum is obtained by repeating the measurement for each t1 increment with φ2 advanced by 90 ◦ (see Table 11.2).
Table 11.2: Evolution of the density matrix during DQF-COSY. Modulations of the density matrix components (omitting the κ/2
factor and the It component) having the origin in I1z and I2z are shown in black and cyan, respectively. The lines labeled φ1 and φ2
show phase cycles of the ﬁrst two radio-wave pulses (cf. Figure 11.7). The multiplier m indicates whether the data recorded with the
given phases are stored as positive (+) or negative (−) numbers. The modulations after the second pulse averaged by the phase cycle are
presented in the last column labeled 1
4
. The product operators are color-coded as in Section 11.9.
Real in t1:
ˆρ(a) ˆρ(b) ˆρ(c) ˆρ(d) mˆρ(d8)
φ1: +x +y −x −y +x +y −x −y +x +y −x −y +x +y −x −y
φ2: +x +y −x −y +x +y −x −y 1
4
m: − + − +
I1z +1 0 0 0 0 0 0 0 0 −c11cJ1 −c11cJ1 −c11cJ1 −c11cJ1 +c11cJ1 −c11cJ1 +c11cJ1 −c11cJ1 0
I1x 0 1 0 −1 +s11cJ1 +c11cJ1 −s11cJ1 −c11cJ1 +s11cJ1 0 −s11cJ1 0 −s11cJ1 0 +s11cJ1 0 0
I1y −1 0 1 0 −c11cJ1 +s11cJ1 +c11cJ1 −s11cJ1 0 +s11cJ1 0 −s11cJ1 0 +s11cJ1 0 −s11cJ1 0
2I1yI2z +s11sJ1 +c11sJ1 −s11sJ1 −c11sJ1 −s21sJ1 0 +s21sJ1 0 +s21sJ1 0 −s21sJ1 0 0
2I1xI2z +c11sJ1 −s11sJ1 −c11sJ1 +s11sJ1 0 +s21sJ1 0 −s21sJ1 0 +s21sJ1 0 −s21sJ1
2I1xI2y −c11sJ1 +c21sJ1 −c11sJ1 +c21sJ1 +c11sJ1 +c21sJ1 +c11sJ1 +c21sJ1
+c11sJ1+c21sJ1
2
2I1yI2x −c21sJ1 +c11sJ1 −c21sJ1 +c11sJ1 +c21sJ1 +c11sJ1 +c21sJ1 +c11sJ1
+c21sJ1+c11sJ1
2
2I1zI2x +c21sJ1 −s21sJ1 −c21sJ1 +s21sJ1 0 +s11sJ1 0 −s11sJ1 0 +s11sJ1 0 −s11sJ1 0
2I1zI2y +s21sJ1 +c21sJ1 −s21sJ1 −c21sJ1 −s11sJ1 0 +s11sJ1 0 +s11sJ1 0 −s11sJ1 0 0
I2y −1 0 1 0 −c21cJ1 +s21cJ1 +c21cJ1 −s21cJ1 0 +s11cJ1 0 −s11cJ1 0 +s11cJ1 0 −s11cJ1 0
I2x 0 1 0 −1 +s21cJ1 +c21cJ1 −s21cJ1 −c21cJ1 +s21cJ1 0 −s21cJ1 0 −s21cJ1 0 +s21cJ1 0 0
I2z +1 0 0 0 0 0 0 0 0 −c21cJ1 −c21cJ1 −c21cJ1 −c21cJ1 +c21cJ1 −c21cJ1 +c21cJ1 −c21cJ1 0
Imaginary in t1:
ˆρ(a) ˆρ(b) ˆρ(c) ˆρ(d) mˆρ(d)
φ1: +x +y −x −y +x +y −x −y +x +y −x −y +x +y −x −y
φ2: +y −x −y +x +y −x −y +x 1
4
m: − + − +
I1z +1 0 0 0 0 0 0 0 0 −s11cJ1 −s11cJ1 −s11cJ1 −s11cJ1 +s11cJ1 −s11cJ1 +s11cJ1 −s11cJ1 0
I1x 0 1 0 −1 +s11cJ1 +c11cJ1 −s11cJ1 −c11cJ1 0 +c11cJ1 0 −c11cJ1 0 +c11cJ1 0 −c11cJ1 0
I1y −1 0 1 0 −c11cJ1 +s11cJ1 +c11cJ1 −s11cJ1 −c11cJ1 0 +c11cJ1 0 +c11cJ1 0 −c11cJ1 0 0
2I1yI2z +s11sJ1 +c11sJ1 −s11sJ1 −c11sJ1 0 −c21sJ1 0 +c21sJ1 0 −c21sJ1 0 +c21sJ1
2I1xI2z +c11sJ1 −s11sJ1 −c11sJ1 +s11sJ1 −c21sJ1 0 +c21sJ1 0 +c21sJ1 0 −c21sJ1 0 0
2I1xI2y +s11sJ1 −s21sJ1 +s11sJ1 −s21sJ1 −s11sJ1 −s21sJ1 −s11sJ1 −s21sJ1
−s11sJ1−s21sJ1
2
2I1yI2x +s21sJ1 −s11sJ1 +s21sJ1 −s11sJ1 −s21sJ1 −s11sJ1 −s21sJ1 −s11sJ1
−s21sJ1−s11sJ1
2
2I1zI2x +c21sJ1 −s21sJ1 −c21sJ1 +s21sJ1 −c11sJ1 0 +c11sJ1 0 +c11sJ1 0 −c11sJ1 0 0
2I1zI2y +s21sJ1 +c21sJ1 −s21sJ1 −c21sJ1 0 −c11sJ1 0 +c11sJ1 0 −c11sJ1 0 +c11sJ1 0
I2y −1 0 1 0 −c21cJ1 +s21cJ1 +c21cJ1 −s21cJ1 −c21cJ1 0 +c21cJ1 0 +c21cJ1 0 −c21cJ1 0 0
I2x 0 1 0 −1 +s21cJ1 +c21cJ1 −s21cJ1 −c21cJ1 0 +c11cJ1 0 −c11cJ1 0 +c11cJ1 0 −c11cJ1 0
I2z +1 0 0 0 0 0 0 0 0 −s21cJ1 −s21cJ1 −s21cJ1 −s21cJ1 +s21cJ1 −s21cJ1 +s21cJ1 −s21cJ1 0
300
+
Nγ2 2B0
16kBT
(Ω1 − πJ − ω1)
R
2
2,1 + (Ω1 − πJ − ω1)2
+
(Ω1 + πJ − ω1)
R
2
2,1 + (Ω1 + πJ − ω1)2
(Ω1 − πJ − ω2)
R
2
2,1 + (Ω1 − πJ − ω2)2
+
(Ω1 + πJ − ω2)
R
2
2,1 + (Ω1 + πJ − ω2)2
[Ω1,Ω1]
+
Nγ2 2B0
16kBT
R2,2
R
2
2,2 + (Ω2 − πJ − ω1)2
−
R2,2
R
2
2,2 + (Ω2 + πJ − ω1)2
R2,1
R
2
2,1 + (Ω1 − πJ − ω2)2
−
R2,1
R
2
2,1 + (Ω1 + πJ − ω2)2
[Ω2,Ω1]
,
(11.26)
It is converted by the third 90◦ pulse to
ˆρ(e) = κ
2
(1
2
(c11sJ1+c21sJ1)2I1xI1z + 1
2
(c11sJ1+c21sJ1)2I1zI1x),
which evolves during t2 as
ˆρ(t2) = κ
2
( 1
2
(c11sJ1+c21sJ1)c12sJ2I1y − 1
2
(c11sJ1+c21sJ1)s12sJ2I1x + 1
2
(c11sJ1+c21sJ1)c22sJ2I2y − 1
2
(c11sJ1+c21sJ1)s22sJ2I2x)
plus unmeasurable anti-quantum coherences.
Considering orthogonality of the matrices,
Tr{ˆρ(t2)(I1x + iI1y + I2x + iI2y)} =
κ
2
(i
1
2
(c11sJ1+c21sJ1)c12sJ2 −
1
2
(c11sJ1+c21sJ1)s12sJ2 + i
1
2
(c11sJ1+c21sJ1)c22sJ2 −
1
2
(c11sJ1+c21sJ1)s22sJ2). (11.27)
Using the trigonometric relations (Eq. 11.15), the averaged signal (i.e., the signal recorded for the full phase cycle, divided by four) is
proportional to
Tr{ˆρ(t2)(I1x + iI1y + I2x + iI2y)}
=
κ
4
(i(c11sJ1+c21sJ1)c12sJ2 − (c11sJ1+c21sJ1)s12sJ2 + i(c11sJ1+c21sJ1)c22sJ2 − (c11sJ1+c21sJ1)s22sJ2)
= i
κ
4
−s−
11 + s+
11
2
−s−
12 + s+
12
2
+
−s−
21 + s+
21
2
−s−
12 + s+
12
2
+
−s−
11 + s+
11
2
−s−
22 + s+
22
2
+
−s−
21 + s+
21
2
−s−
22 + s+
22
2
−i
−s−
11 + s+
11
2
c−
12 − c+
12
2
+
−s−
21 + s+
21
2
c−
12 − c+
12
2
+
−s−
11 + s+
11
2
c−
22 − c+
22
2
+
−s−
21 + s+
21
2
c−
22 − c+
22
2
(11.28)
In order to obtain a hypercomplex two-dimensional spectrum, the measurements is repeated with φ2 advanced by 90 ◦ for each t1
increment. The components of ˆρ(d) contributing to the signal, = −κ
2
( 1
2
(s11sJ1+s21sJ1)2I1xI1y + (s11sJ1+s21sJ1)2I1yI1x)
are converted by the third 90◦ pulse to
ˆρ(e) = −κ
2
( 1
2
(s11sJ1+s21sJ1)2I1xI1z + 1
2
(s11sJ1+s21sJ1)2I1zI1x),
which evolves during t2 as
ˆρ(t2) = −κ
2
( 1
2
(s11sJ1+s21sJ1)c12sJ2I1y − 1
2
(s11sJ1+s21sJ1)s12sJ2I1x + 1
2
(s11sJ1+s21sJ1)c22sJ2I2y − 1
2
(s11sJ1+s21sJ1)s22sJ2I2x)
plus unmeasurable anti-quantum coherences.
Considering orthogonality of the matrices,
Tr{ˆρ(t2)(I1x + iI1y + I2x + iI2y)} =
−
κ
2
(i
1
2
(s11sJ1+s21sJ1)c12sJ2 −
1
2
(s11sJ1+s21sJ1)s12sJ2 + i
1
2
(s11sJ1+s21sJ1)c22sJ2 −
1
2
(s11sJ1+s21sJ1)s22sJ2). (11.29)
Including the factor of two and using the trigonometric relations (Eq. 11.15),
Tr{ˆρ(t2)(I1x + iI1y + I2x + iI2y)}
= −
κ
4
(i(s11sJ1+s21sJ1)c12sJ2 − (s11sJ1+s21sJ1)s12sJ2 + i(s11sJ1+s21sJ1)c22sJ2 − (s11sJ1+s21sJ1)s22sJ2)
= −i
κ
4
c−
11 − c+
11
2
−s−
12 + s+
12
2
+
c−
21 − c+
21
2
−s−
12 + s+
12
2
+
c−
11 − c+
11
2
−s−
22 + s+
22
2
+
c−
21 − c+
21
2
−s−
22 + s+
22
2
−i
c−
11 − c+
11
2
c−
12 − c+
12
2
+
c−
21 − c+
21
2
c−
12 − c+
12
2
+
c−
11 − c+
11
2
c−
22 − c+
22
2
+
c−
21 − c+
21
2
c−
22 − c+
22
2
(11.30)
11.10. SUPPORTING INFORMATION 301
Introducing relaxation, applying phase correction, and performing Fourier transformation in the direct dimension, discaring the
imaginary component (see Section 9.5.1), multiplying Eq. 11.30 by ”i” and combining it with Eq. 11.28, introducing relaxation and
performing Fourier transformation in the indirect dimension provides a signal with the real part proportional to
+
Nγ2 2B0
32kBT
R2,1
R
2
2,1 + (ω1 − Ω1 + πJ)2
−
R2,1
R
2
2,1 + (ω1 − Ω1 − πJ)2
R2,1
R
2
2,1 + (ω2 − Ω1 + πJ)2
−
R2,1
R
2
2,1 + (ω2 − Ω1 − πJ)2
[Ω1,Ω1]
+
Nγ2 2B0
32kBT
R2,2
R
2
2,2 + (ω1 − Ω2 + πJ)2
−
R2,2
R
2
2,2 + (ω1 − Ω2 − πJ)2
R2,1
R
2
2,1 + (ω2 − Ω1 + πJ)2
−
R2,1
R
2
2,1 + (ω2 − Ω1 − πJ)2
[Ω2,Ω1]
+
Nγ2 2B0
32kBT
R2,2
R
2
2,2 + (ω1 − Ω2 + πJ)2
−
R2,2
R
2
2,2 + (ω1 − Ω2 − πJ)2
R2,2
R
2
2,2 + (ω2 − Ω2 + πJ)2
−
R2,2
R
2
2,2 + (ω2 − Ω2 − πJ)2
[Ω1,Ω1]
+
Nγ2 2B0
32kBT
R2,1
R
2
2,1 + (ω1 − Ω1 + πJ)2
−
R2,1
R
2
2,1 + (ω1 − Ω1 − πJ)2
R2,2
R
2
2,2 + (ω2 − Ω2 + πJ)2
−
R2,2
R
2
2,2 + (ω2 − Ω2 − πJ)2
[Ω2,Ω1]
.
(11.31)
The terms with the same indices of Ω represent diagonal peaks with a Lorentzian absorption shape at [Ω1, Ω1] and [Ω2, Ω2] and the
terms with diﬀerent indices represent cross-peaks with a Lorentzian absorption shape at [Ω2, Ω1] and [Ω1, Ω2]. Comparison with Eqs. 11.26
and 11.25 shows that (i) a phase shift between diagonal peaks and cross-peaks is present only in standard COSY but not in DQF-COSY,
and (ii) the DQF-COSY signal intensity is half of the value obtained in standard COSY. The spectrum is plotted in Figure 11.6. Note
that diagonal peaks and cross-peaks have the same phase (form anti-phase doublets).
302
Lecture 12
Strong coupling, TOCSY
Literature: Strong coupling for a pair of nuclei is discussed in K12.1, L14.1-L14.3, C2.5.2, and
analyzed in detail in LA.8. The idea of the magnetic equivalence is presented in K12.2, L14.4 (for two
nuclei), L17.5 (in larger molecules, with some details discussed in LA.9). The TOCSY experiment
discussed in Section 12.3 (mixing the Iny coherences) is described in L18.14, another variant (mixing
the Inz coherences) is presented in K8.11, C4.2.1.2, and C6.5.
12.1 Strong J-coupling
We have seen in Section 10.2 that secular approximation substantially simpliﬁes Hamiltonian of the
J-coupling if γ and/or chemical shifts diﬀer (weak coupling). However, the description of the system
of interacting nuclei changes dramatically if γ1 = γ2 and chemical shifts are similar (strong coupling).
We now analyze how a density matrix describing a strongly coupled pair of magnetic moments evolves
in a one-pulse 1D NMR experiment.
As usually, the density matrix at the beginning of the experiment is given by the thermal equilibrium.
As mentioned in Section 10.10.6, the eﬀect of the J-coupling on populations is negligible.
Therefore, the initial form of the density matrix and its form after the 90◦
excitation pulse are the
same as in the case of a weak coupling:
ˆρ(b) =
1
2
It −
κ
2
I1y −
κ
2
I2y. (12.1)
In order to describe the evolution of ˆρ, we need to know the Hamiltonian. For a pair of nuclei,
the Hamiltonian (expressed in e rotating coordinate frame) is given by Eq. 10.3. In the presence of
(very similar) chemical shifts
H = +Ω1I1z + Ω2I2z + πJ (2I1zI2z + 2I1xI2x + 2I1yI2y) . (12.2)
In this Hamiltonian, I1z and I2z do not commute with 2I1xI2x and 2I1yI2y. Therefore, we
cannot analyze the evolution of the density matrix by analyzing eﬀects of individual components of
the Hamiltonian separately and in any order, as we did in the case of weak the coupling Hamiltonian
Ω1I1z + Ω2I2z + πJ · 2I1zI2z consisting of three mutually commuting components.
If we use matrices listed in Tables 8.1 and 8.2, the matrix representation of the Hamiltonian is
303
304
H =




Ω1+Ω2
2
+ π
2
J 0 0 0
0 Ω1−Ω2
2
− π
2
J πJ 0
0 πJ −Ω1−Ω2
2
− π
2
J 0
0 0 0 −Ω1+Ω2
2
+ π
2
J



 . (12.3)
Obviously, the matrix is not diagonal. In order to ﬁnd eigenvalues of the Hamiltonian, corresponding
to frequencies observed in the spectra, we have to ﬁnd a new basis where the Hamiltonian
is represented by a diagonal matrix. This is done in Section 12.4.1. The diagonalized matrix H can
be written as a linear combination of matrices listed in Table 8.1
H = Ω1I1z + Ω2I2z + πJ · 2I1zI2z, (12.4)
where
Ω1 =
1
2
Ω1 + Ω2 + (Ω1 − Ω2)2 + 4π2J2 (12.5)
Ω2 =
1
2
Ω1 + Ω2 − (Ω1 − Ω2)2 + 4π2J2 . (12.6)
We see that H consists of the same product operators as the Hamiltonian describing a weak
coupling, only the frequencies diﬀer. The density matrix ˆρ(b) and the operator of the measured
quantity ˆM+ should be also expressed in the basis found in Section 12.4.1. The transformed density
matrix ˆρ consists of the same product operators as the density matrix in the original basis, they are
just combined with diﬀerent coeﬃcients. We can thus repeat the analysis presented for a weak Jcoupling
in Section 10.4 using the same rotations in the operators space as presented in Figure 10.3.
The analysis of a strongly J-coupled system diﬀers only in three issues: (i) we start to rotate from
a diﬀerent combination of product operators, (ii) the angles of rotations diﬀer, being given by the
frequencies Ω1, Ω2 instead of the frequency oﬀsets Ω1, Ω2, and (iii) we have to calculate a trace of
the density matrix multiplied by the transformed operator of transverse magnetization, ˆM+. The
analysis is presented in Section 12.4.2. The Fourier transform of the result (Eq. 12.52) is
{Y (ω)} = 1 −
2πJ
(Ω1 − Ω2)2 + 4π2J2
Nγ2 2
B0
8kBT
R2
R
2
2 + (Ω2 − πJ − ω)2
+ 1 +
2πJ
(Ω1 − Ω2)2 + 4π2J2
Nγ2 2
B0
8kBT
R2
R
2
2 + (Ω2 + πJ − ω)2
+ 1 +
2πJ
(Ω1 − Ω2)2 + 4π2J2
Nγ2 2
B0
8kBT
R2
R
2
2 + (Ω1 − πJ − ω)2
+ 1 −
2πJ
(Ω1 − Ω2)2 + 4π2J2
Nγ2 2
B0
8kBT
R2
R
2
2 + (Ω1 + πJ − ω)2
12.2. MAGNETIC EQUIVALENCE 305
{Y (ω)} = i 1 −
2πJ
(Ω1 − Ω2)2 + 4π2J2
Nγ2 2
B0
8kBT
Ω2 − πJ − ω
R
2
2 + (Ω2 − πJ − ω)2
+i 1 +
2πJ
(Ω1 − Ω2)2 + 4π2J2
Nγ2 2
B0
8kBT
Ω2 + πJ − ω
R
2
2 + (Ω2 + πJ − ω)2
+i 1 +
2πJ
(Ω1 − Ω2)2 + 4π2J2
Nγ2 2
B0
8kBT
Ω1 − πJ − ω
R
2
2 + (Ω1 − πJ − ω)2
+i 1 −
2πJ
(Ω1 − Ω2)2 + 4π2J2
Nγ2 2
B0
8kBT
Ω1 + πJ − ω
R
2
2 + (Ω1 + πJ − ω)2
. (12.7)
Spectra for three diﬀerent values of |Ω1 − Ω2| are plotted in Figure 12.1. The following features
distinguish spectra of strongly coupled nuclear magnetic moments from those of weakly coupled pairs:
• The centers of doublets of peaks of individual nuclei are shifted from the precession frequencies
of the nuclei Ω1 and Ω2 by a factor of ± Ω1 − Ω2 − (Ω1 − Ω2)2 + 4π2J2 /2.
• The intensities of the inner peaks of the doublet of doublets are increased and the intensities
of the outer peaks are decreased by a factor of 2πJ/ (Ω1 − Ω2)2 + 4π2J2.
The square root (Ω1 − Ω2)2 + 4π2J2 speciﬁes the limit between weak and strong J-coupling.
If |Ω1 − Ω2| 2π|J|, the factors modifying the peak intensities are negligible and the J-coupling
is considered weak. The other limit, |Ω1 − Ω2| → 0, deserves a special attention and is discussed in
more details in the next section.
12.2 Magnetic equivalence
If two interacting nuclear magnetic moments have the same precession frequencies (due to a molecular
symmetry1
or accidentally), and if they are not distinguished by diﬀerent couplings to other nuclei,
they are magnetically equivalent.
Following the trends in Figure 12.1 suggests that only one peak appears in a spectrum of a pair
of magnetically equivalent nuclei. This explains why we do not observe e.g. splitting due to the
relatively large J-coupling of protons in water (|2
J| ≈ 7 Hz).
From the theoretical point of view, a pair of magnetically equivalent nuclei represents a fundamentally
diﬀerent system than a pair of weakly coupled nuclei (even for identical J constant). The
eigenstates of the Hamiltonian of the magnetically equivalent nuclei in B0 are not direct products of
the |α and |β eigenstates (as we described in Section 8.9.3). The pair of magnetically equivalent
1
Nuclei can be inequivalent even if the whole molecule is symmetric (i.e., achiral). Existence of a plane of symmetry
is not suﬃcient, the plane must bisect the particular pair of nuclei. Otherwise, the nuclei are diastereotopic and
magnetically inequivalent.
306
{Y(ω)}
ω
Ω2
2πJ
Ω1
2πJ
{Y(ω)} ω
Ω2
2πJ
Ω1
2πJ
{Y(ω)}
ω
Ω2
2πJ
Ω1
2πJ
{Y(ω)}
ω
Ω2
2πJ
Ω1
2πJ
{Y(ω)}
ω
Ω2
2πJ
Ω1
2πJ
{Y(ω)}
ω
Ω2
2πJ
Ω1
2πJ
Figure 12.1: One-dimensional spectra of strongly J-coupled 1
H–1
H pairs. The spectra are plotted for Ω1 −Ω2 = 4πJ
(top), Ω1 − Ω2 = 2πJ (middle), and Ω1 − Ω2 = 0.8πJ (bottom).
12.3. TOCSY 307
Table 12.1: Eigenvalues of selected operators for a pair of magnetically equivalent nuclei. The operators ˆI2
1 , ˆI2
1 , ˆI 2
,
and ˆIz are deﬁned in Section 10.10.2, H = (ω0 + πJ)I1z + (ω0 − πJ)I2z + πJ · 2I1zI2z.
Eigenfunction ˆI2
1
ˆI2
2
ˆI 2 ˆIz H
|α ⊗ |α 3 2
/4 3 2
/4 2 2
+ +ω0 + π
2
J
1√
2
|α ⊗ |β + 1√
2
|β ⊗ |α 3 2
/4 3 2
/4 2 2
0 +π
2
J
1√
2
|α ⊗ |β − 1√
2
|β ⊗ |α 3 2
/4 3 2
/4 0 0 −3π
2
J
|β ⊗ |β 3 2
/4 3 2
/4 2 2
− −ω0 + π
2
J
nuclei is similar to a pair of electrons discussed in Section 10.10.2. The eigenfunctions and eigenvalues
for important operators are listed in Table 12.1.
The eigenfunctions help us to understand the diﬀerence between quantum states of non-interacting
or weakly J-coupled pairs on one hand, and magnetically equivalent pairs on the other hand. We
have discussed in detail that the stationary states |α ⊗|α , |α ⊗|β , |β ⊗|α , |β ⊗|β are important
in single pairs of nuclei, but are rarely present in large macroscopic ensembles. Now we see that in
the case of magnetically equivalent nuclei, |α ⊗ |β and |β ⊗ |α do not even describe stationary
states of a single pair. Instead, the stationary states are their combinations.
The eigenvalues of the operator representing square of the total angular momentum ˆI 2
tells us
that three eigenstates have the same size of the total angular momentum (
√
2 ) and one does not have
any angular momentum (and therefore any magnetic moment). The energy diﬀerences (eigenvalues
of H multiplied by ) between the three ”magnetic states” are the same in isotropic liquids (but
they diﬀer if the dipole-dipole coupling is not averaged to zero), which explains why we see only one
frequency in the spectrum. The ”non-magnetic” state does not have any magnetic moment and thus
does not contribute to observable magnetization.
The analysis is more demanding if a magnetically equivalent pair is a part of a larger molecule.
Nevertheless, it can be shown that J-couplings between magnetically equivalent nuclei in larger
molecules do not aﬀect the NMR spectra (Sections 12.4.3 and 12.4.4).
12.3 TOCSY
At the ﬁrst glance, molecules whose nuclei have very similar chemical shifts (by accident or as a
result of molecular symmetry), and are therefore very strongly J-coupled, seem to represent a special
case. However, tricks discussed in the previous lectures allow us to exploit advantages of magnetic
equivalence even if the chemical shifts are very diﬀerent. We have learnt that we can use a spin echo
to suppress the eﬀect of the chemical shift evolution, which is exactly what we need: no chemical
shift evolution corresponds to zero diﬀerence in frequency oﬀset. If we apply the simultaneous echo
(actually, the only echo applicable to homonuclear pairs) that keeps the J-coupling evolution but
refocuses evolution of the chemical shift, the state of the system of nuclei at the end of the echo
is the same as a state of a system of nuclei with identical chemical shifts. Note, however, that a
single application of a spin echo is not suﬃcient. Our goal is to make the strong coupling to act
308
H1
t1
x y/ t2y yy y y y y yy y y y
a b c d
Figure 12.2: TOCSY pulse sequence. The narrow black and wide cyan rectangles represent 90◦
and 180◦
radio wave
pulses applied at a frequency suﬃciently close to the precession frequencies of all interacting magnetic moments.
continuously for a certain period of time, comparable to 1/(2J), not just in one moment. Therefore,
we have to apply a series of radio-frequency pulses to keep the strong coupling active for a whole
mixing period. In principle, a series of very short coupling echoes with very short 180 ◦
pulses
should work (Figure 12.2). However, specially designed sequences of pulses with much weaker oﬀset
eﬀects are used in practice.2
Two-dimensional experiment utilizing a mixing mimicking the strong
coupling is known as Totally Correlated Spectroscopy (TOCSY). There are numerous variants of the
experiment, here we present only the simplest version (Figure 12.2) illustrating the basic idea.
In order to describe the major advantage of the TOCSY experiment, we analyze a simple system
of three nuclei (e.g. three protons) where nuclei 1 and 2 are coupled, nuclei 2 and 3 are also coupled,
but there is no coupling between nuclei 1 and 3 (a more general analysis and matrix representations
of the product operators for ensembles presented in Sections 12.4.5–12.4.8). Let us assume that both
coupling constants are identical (J12 = J23 = J). Before the TOCSY mixing period, the density
matrix of our system evolves like in the NOESY or COSY experiment. The evolution starts from
the equilibrium density matrix
ˆρ(a) = 1
4
(It + κI1z + κI2z + κI3z),
derived in Section 12.4.6 for a general case of N magnetic moments. The starting density matrix is
converted to
ˆρ(b) = 1
4
(It − κI1y − κI2y − κI3y)
by a 90◦
excitation pulse and evolves during the incremented evolution period t1. For the sake of
simplicity, we pay attention to the fate of the coherences modulated by the chemical shift of nucleus 1:
ˆρ(c) = −κ
4
cos(Ω1t1) cos(πJt1)I1y + . . .
Let us assume that the TOCSY pulse train is applied with the 90◦
or −90◦
(y or −y) phases of
the radio waves. As a consequence, the pulses keep the I1y, I2y, I3y components of the density
matrix intact and rotate other coherences ”about” the Iny ”axis”. Because the trains contain many
(hundreds) of pulses, the imperfections of the pulses and stochastic molecular motions randomize
the direction of the polarizations in the xz plane (an eﬀect similar to the loss of coherence in the xy
plane during evolution in the B0 ﬁeld). Therefore, we assume that only the I1y, I2y, I3y coherences,
”locked” in the y direction of the rotating frame, survive the TOCSY mixing pulse train.3
The Hamiltonian describing the evolution of our simple system during the TOCSY mixing period
is
2
Technically, our task is very similar to decoupling during acquisition, shown in Section 11.6.
3
If coherences other than Iny are not destroyed completely, their contribution can be removed by phase cycling.
12.3. TOCSY 309
HTOCSY = πJ (2I1zI2z + 2I1xI2x + 2I1yI2y + 2I2zI3z + 2I2xI3x + 2I2yI3y) . (12.8)
Note that the Hamiltonian is fully symmetric in our coordinate system. In our version of the
TOCSY experiment, we decided to preserve only the Iny coherences by the choice of the phase of
the applied pulses. However, the Hamiltonian itself acts on the Inx, Iny, and Inz in a completely
identical way.4
Therefore, the eﬀect of the Hamiltonian is called isotropic mixing (working equally
in all directions).
All components of the Hamiltonian in Eq. 12.8 commute (because the echo removed the chemical
shift components) and it is possible to inspect their eﬀects separately. Such analysis is straightforward
for two interacting nuclei, but gets complicated for three or more nuclei. Nevertheless, a useful insight
can be gained from the inspection of commutation relations of the HTOCSY Hamiltonian, derived in
Section 12.4.7. First, HTOCSY does not commute with I1y. It tells us that −κ
4
cos(Ω1t1) cos(πJt1)I1y
partially evolves to other coherences (or populations) during the TOCSY mixing (Eq. 12.69). Second,
HTOCSY does not commute with I1y + I2y either (Eq. 12.72). We see that the lost portion
of −κ
4
cos(Ω1t1) cos(πJt1)I1y is not completely converted to −κ
4
cos(Ω1t1) cos(πJt1)I2y. Finally,
HTOCSY does commute5
with I1y + I2y + I3y (Eq. 12.73). If I1y + I2y + I3y does not change
and I1y is not completely converted to I2y, the missing portion of I1y must be compensated by
formation of −κ
4
cos(Ω1t1) cos(πJt1)I3y. The fraction of the density matrix converted to I2y and I3y
depends on the length of the TOCSY pulse train (mixing time), on actual values of the J constants
(they are not identical in a real case), on relaxation, and on the evolution during the pulses (their
duration is not negligible compared to the lengths of individual echoes in the train if the goal is to
have the echoes as short as possible). In our analysis, we describe the fraction that stays in I1y by a
factor a11, the eﬃciency of the transfer from nucleus 1 to nucleus 2 by a factor a12, and the eﬃciency
of the transfer from nucleus 1 to nucleus 3 by a factor a13.
A detailed analysis of the evolution of the density matrix (the procedure, presented in Section
12.4.8, is very similar to those described in previous lectures for other 2D experiments) shows
that the coherence I1y provides the following components of the signal (cf. Eq. 12.82)
Nγ
κ
16
a11e−R2t1
(cos((Ω1 − πJ)t1) + cos((Ω1 + πJ)t1)) e−R2t2
e−i(Ω1−πJ)t2
+ e−i(Ω1+πJ)t2
+ Nγ
κ
32
a12e−R2t1
(cos((Ω1 − πJ)t1) + cos((Ω1 + πJ)t1)) e−R2t2
e−i(Ω2−2πJ)t2
+ 2e−iΩ2t2
+ e−i(Ω2+2πJ)t2
+ Nγ
κ
16
a13e−R2t1
(cos((Ω1 − πJ)t1) + cos((Ω1 + πJ)t1)) e−R2t2
e−i(Ω3−πJ)t2
+ e−i(Ω3+πJ)t2
. (12.9)
After processing the signal as described in Section 9.5.1, these components result in one set of
diagonal peaks (at the frequencies close to [Ω1, Ω1]) and in two sets of cross-peaks (Figure 12.3),
including peaks close to the frequencies of protons that are not directly J-coupled ([Ω1, Ω3]). This
is a fundamental diﬀerence between COSY and TOCSY spectra. Appearance of cross-peaks in the
4
We could select Inx coherences equally well by applying pulses with a phase of 0 ◦
(x). The Inz can be selected
by applying additional 90◦
before and after the TOCSY pulse train (this approach is described in K8.11 and C6.5).
In practice, the pulse trains are optimized for the given purpose.
5
In general, HTOCSY commutes with the operators of all three components Ij of the total angular momentum,
where j ∈ {x, y, z} and Ij is a sum of Inj for all nuclei n.
310
Ω1
Ω2
Ω3
Ω3Ω1 Ω2
ω1
ω2 ω2
Ω2
ω1
Ω1
Ω2
Ω3
Ω3Ω1
Figure 12.3: DQF-COSY (left) and TOCSY (right) spectra of a molecule with three protons with the J-coupling
constants |J12| > |J23| and J13 = 0. Note the presence of a cross-peak correlating the not coupled protons 1 and 3 in
the TOCSY, but not in the DQF-COSY spectrum.
COSY spectra requires a direct J-coupling, whereas cross-peaks in the TOCSY spectra correlate
all peaks of a spin-system (a network of nuclei connected by J-coupling), even if the coupling of a
particular pair is negligible (Figure 12.3). Structural information in COSY and TOCSY spectra is
complementary. The TOCSY experiment describes the complete spin systems in a single spectrum,
COSY spectra distinguish directly J-coupled nuclei (usually vicinal and geminal protons).
HOMEWORK
Analyze the TOCSY experiment.
12.4. SUPPORTING INFORMATION 311
12.4 SUPPORTING INFORMATION
12.4.1 Diagonalization of the J-coupling Hamiltonian matrix
The matrix representation of the Hamiltonian describing chemical shift and strong J-coupling, written in the basis constructed from the
α and β states of the interacting nuclei (i.e., |αα , |βα , |αβ , |ββ ), is
H =
π
2




Σ + J 0 0 0
0 ∆ − J 2J 0
0 2J −∆ − J 0
0 0 0 −Σ + J



 , (12.10)
where Σ = (ω0,1 + ω0,2)/π and ∆ = (ω0,1 − ω0,2)/π. We are looking for a new, diagonal matrix representation of our Hamiltonian
H . A similar task is solved in Section 10.10.2, the matrix in Eq. 12.10 just have more complicated elements. From the mathematical
point of view, diagonalization of our Hamiltonian can be described using a transformation matrix ˆT:
H = ˆT−1
H ˆT. (12.11)
Multiplying by ˆT from left gives
ˆTH = H ˆT. (12.12)
The desired eigenvalues ωk and eigenvectors |ψk can be obtained by comparing the eigenvalue equation
H |ψk = ωk|ψk (12.13)
with the left-hand side of Eq. 12.12




T11 T12 T13 T14
T21 T22 T23 T24
T31 T32 T33 T34
T41 T42 T43 T44








ω1 0 0 0
0 ω2 0 0
0 0 ω3 0
0 0 0 ω4



 =




ω1T11 ω2T12 ω3T13 ω4T14
ω1T21 ω2T22 ω3T23 ω4T24
ω1T31 ω2T32 ω3T33 ω4T34
ω1T41 ω2T42 ω3T43 ω4T44



 . (12.14)
The eigenvalue equation can be written as a set of four equations for k = 1, 2, 3, 4
H |ψk =
π
2




Σ + J 0 0 0
0 ∆ − J 2J 0
0 2J −∆ − J 0
0 0 0 −Σ + J








T1k
T2k
T3k
T4k



 =
π
2




(Σ + J)T1k
(∆ − J)T2k + 2JT3k
2JT2k − (∆ + J)T3k
(−Σ + J)T4k



 = ωk




T1k
T2k
T3k
T4k



 = ωk|ψk . (12.15)
The ﬁrst row of the middle equality allows us to identify
ω1 =
π
2
(Σ + J) =
ω0,1 + ω0,2
2
+
π
2
J (12.16)
if we set T21 = T31 = T41 = 0, i.e.,
|ψ1 =




T11
0
0
0



 . (12.17)
Similarly,
ω4 =
π
2
(−Σ + J) = −
ω0,1 + ω0,2
2
+
π
2
J (12.18)
for
|ψ4 =




0
0
0
T44



 . (12.19)
The ω2 and ω3 values can be calculated from the equations
312
2ωkT2k = π(∆ − J)T2k + 2πJT3k (12.20)
2ωkT3k = 2πJT2k − π(∆ + J)T3k, (12.21)
(setting T12 = T42 = T13 = T43 = 0).
T3k can be expressed from the ﬁrst equation
T3k =
2ωk + π(J − ∆)
2πJ
T2k (12.22)
and inserted into the second equation
(2ωk + π(J + ∆))(2ωk + π(J − ∆))T2k = (2πJ)2
T2k, (12.23)
directly giving
ωk = −
π
2
J ± 4J2 + ∆2 . (12.24)
Choosing
ω2 = −
π
2
J − 4J2 + ∆2 =
(ω0,1 − ω0,2)2 + 4π2J2
2
−
π
2
J (12.25)
and
ω3 = −
π
2
J + 4J2 + ∆2 = −
(ω0,1 − ω0,2)2 + 4π2J2
2
−
π
2
J. (12.26)
completely deﬁnes the diagonalized Hamiltonian
H =
π
2




Σ + J 0 0 0
0
√
∆2 + 4J2 − J 0 0
0 0 −
√
∆2 + 4J2 − J 0
0 0 0 −Σ + J



 =
ω0,1
2




1 0 0 0
0 1 0 0
0 0 −1 0
0 0 0 −1



 +
ω0,1
2




1 0 0 0
0 −1 0 0
0 0 1 0
0 0 0 −1



 + πJ




1 0 0 0
0 −1 0 0
0 0 −1 0
0 0 0 1




= ω0,1I1z + ω0,2I2z + πJ2I1zI2z, (12.27)
where
ω0,1 =
π
2
(Σ + ∆2 + 4J2) =
1
2
ω0,1 + ω0,2 + (ω0,1 − ω0,2)2 + 4π2J2 (12.28)
ω0,2 =
π
2
(Σ − ∆2 + 4J2) =
1
2
ω0,1 + ω0,2 − (ω0,1 − ω0,2)2 + 4π2J2 . (12.29)
The new basis is given by Eqs. 12.20, 12.21, and the normalization condition
ψk|ψk = 1 ⇒
4
j=1
T2
jk = 1. (12.30)
The normalization conditions immediately deﬁnes T11 = T44 = 1. Substituting ω2 into Eqs. 12.20 and 12.21, respectively, gives
T32
T22
=
√
4J2 + ∆2 − ∆
2J
(12.31)
T22
T32
=
√
4J2 + ∆2 + ∆
2J
. (12.32)
Consequently,
T2
32
T2
22
=
√
4J2 + ∆2 − ∆
√
4J2 + ∆2 + ∆
(12.33)
and applying the normalization condition T2
32 = 1 − T2
22
12.4. SUPPORTING INFORMATION 313
1 + T2
22
T2
22
=
√
4J2 + ∆2 − ∆
√
4J2 + ∆2 + ∆
(12.34)
deﬁnes
T2
22 =
1
1 −
√
4J2+∆2−∆
√
4J2+∆2+∆
=
√
4J2 + ∆2 + ∆
2
√
4J2 + ∆2
(12.35)
and
T2
32 = 1 − T2
22 =
√
4J2 + ∆2 − ∆
2
√
4J2 + ∆2
. (12.36)
Similarly, T2
23 and T2
33 can be calculated by substituting ω3 into Eqs. 12.20 and 12.21:
T2
23 =
√
4J2 + ∆2 − ∆
2
√
4J2 + ∆2
(12.37)
T2
33 =
√
4J2 + ∆2 + ∆
2
√
4J2 + ∆2
. (12.38)
If we use
T22 = T33 =
1
2
+
∆
2
√
4J2 + ∆2
≡ cξ, T23 =
1
2
−
∆
2
√
4J2 + ∆2
≡ sξ, T32 = −
1
2
+
∆
2
√
4J2 + ∆2
≡ −sξ, (12.39)
we obtain a transformation matrix
ˆT =




1 0 0 0
0 cξ −sξ 0
0 sξ cξ 0
0 0 0 1



 , (12.40)
which is its own inverse ( ˆT−1 = ˆT ⇒ ˆT−1 ˆT = ˆT ˆT = ˆ1). Later, we also use the following relations between cξ and sξ:
c2
ξ + s2
ξ =
1
2
+
∆
2
√
4J2 + ∆2
+
1
2
−
∆
2
√
4J2 + ∆2
= 1 (12.41)
2cξsξ = 2
1
4
−
∆2
4(4J2 + ∆2)
=
4J2 + ∆2 + ∆2
4J2 + ∆2
=
2J
√
4J2 + ∆2
. (12.42)
Finally, the new basis consists of the following eigenvectors
|ψ1 =




1
0
0
0



 , |ψ2 =








0
1
2
+ ∆
2
√
4J2+∆2
1
2
− ∆
2
√
4J2+∆2
0








≡




0
cξ
sξ
0



 , |ψ3 =








0
− 1
2
− ∆
2
√
4J2+∆2
1
2
+ ∆
2
√
4J2+∆2
0








≡




0
−sξ
cξ
0



 , |ψ4 =




0
0
0
1



 . (12.43)
We can also use the transformation matrix to express the density matrix (ˆρ = ˆT ˆρ ˆT) and the operator of the measured quantity
(ˆˆM+ = ˆT ˆM+
ˆT) in the new basis (cf. Eq. 12.11). In particular, we are interested in the transformed operators I1y +I2y and I1+ +I2+ =
I1x + I2x + i(I1y + I2y):
I1y + I2y = ˆT(I1y + I2y) ˆT =




1 0 0 0
0 cξ −sξ 0
0 sξ cξ 0
0 0 0 1




i
2




0 −1 −1 0
1 0 0 −1
1 0 0 −1
0 1 1 0








1 0 0 0
0 cξ −sξ 0
0 sξ cξ 0
0 0 0 1



 =
i
2




0 −(cξ + sξ) −(cξ − sξ) 0
cξ + sξ 0 0 −(cξ + sξ)
cξ − sξ 0 0 −(cξ − sξ)
0 cξ + sξ cξ − sξ 0




= cξ
i
2




0 −1 −1 0
+1 0 0 −1
+1 0 0 −1
0 +1 +1 0



 + sξ
i
2




0 −1 +1 0
+1 0 0 −1
−1 0 0 +1
0 +1 −1 0



 = cξ(I1y + I2y) + sξ(2I1zI2y − 2I1yI2z), (12.44)
314
I1x + I2x = ˆT(I1x + I2x) ˆT =




1 0 0 0
0 cξ −sξ 0
0 sξ cξ 0
0 0 0 1




1
2




0 1 1 0
1 0 0 1
1 0 0 1
0 1 1 0








1 0 0 0
0 cξ −sξ 0
0 sξ cξ 0
0 0 0 1



 =
1
2




0 cξ + sξ cξ − sξ 0
cξ + sξ 0 0 cξ + sξ
cξ − sξ 0 0 cξ − sξ
0 cξ + sξ cξ − sξ 0




= cξ
1
2




0 +1 +1 0
+1 0 0 +1
+1 0 0 +1
0 +1 +1 0



 + sξ
1
2




0 +1 −1 0
+1 0 0 +1
−1 0 0 −1
0 +1 −1 0



 = cξ(I1x + I2x) + sξ(2I1zI2x − 2I1xI2z), (12.45)
I1+ + I2+ =




0 cξ + sξ cξ − sξ 0
0 0 0 cξ + sξ
0 0 0 cξ − sξ
0 0 0 0



 = cξ(I1x + I2x + iI1y + iI2y) + sξ(2I1zI2x − 2I1xI2z + i2I1zI2y − i2I1yI2z) (12.46)
12.4.2 Strong J-coupling and density matrix evolution
When the density matrix at the beginning of the evolution is written in the new basis (where the Hamiltonian matrix is diagonal), it
consists of multiple contributions. We analyze its evolution separately for the operators contributing to the signal of individual nuclei, and
write the progress of the analysis in a table. The density matrix can be divided as
ˆρ = −
κ
2
I1y −
κ
2
I2y = ˆρ1 + ˆρ2 (12.47)
Starting with ˆρ1,
Contribution ˆρ1(b)
ω0,1I1z
−→
πJ·2I1zI2z
−→
Tr{ˆρ1(t)I1+}
I1y
2I1yI2z
+κ
2
cξ
−κ
2
sξ
+κ
2
cξc1
−κ
2
sξc1
+κ
2
cξc1cJ + κ
2
sξs1sJ
−κ
2
sξc1cJ − κ
2
cξs1sJ
+i κ
2
c2
ξc1cJ + i κ
2
cξsξs1sJ
+i κ
2
s2
ξc1cJ + i κ
2
cξsξs1sJ
= i κ
2
c1cJ + 2J√
4J2+∆2
s1sJ
I1x
2I1xI2z
0
0
−κ
2
cξs1
+κ
2
sξs1
−κ
2
cξs1cJ + κ
2
sξc1sJ
+κ
2
sξs1cJ − κ
2
cξc1sJ
−κ
2
c2
ξs1cJ + κ
2
cξsξc1sJ
−κ
2
s2
ξs1cJ + κ
2
cξsξc1sJ
= −κ
2
s1cJ − 2J√
4J2+∆2
c1sJ
Using the following trigonometric relations
c1cJ =
c −
1 + c +
1
2
s1sJ =
c −
1 − c +
1
2
c1sJ =
−s −
1 + s +
1
2
s1cJ =
s −
1 + s +
1
2
, (12.48)
where c ±
1 = cos((ω0,1 − ωrot ± πJ)t) = cos((Ω1 ± πJ)t) and s ±
1 = sin((ω0,1 − ωrot ± πJ)t) = sin((Ω1 ± πJ)t) (ωrot = −ωradio),
Tr{ˆρ1(t)I1+} = i
κ
2
c1cJ +
2J
√
4J2 + ∆2
s1sJ −
κ
2
s1cJ −
2J
√
4J2 + ∆2
c1sJ
= i
κ
2
c −
1 + c +
1
2
+
2J
√
4J2 + ∆2
c −
1 − c +
1
2
−
κ
2
s −
1 + s +
1
2
+
2J
√
4J2 + ∆2
s −
1 − s +
1
2
= i
κ
4
1 +
2J
√
4J2 + ∆2
c −
1 + 1 −
2J
√
4J2 + ∆2
c +
1 + i 1 +
2J
√
4J2 + ∆2
s −
1 + 1 −
2J
√
4J2 + ∆2
s +
1
=
κ
4
ei π
2 1 +
2J
√
4J2 + ∆2
ei(Ω1−πJ)t
+ 1 −
2J
√
4J2 + ∆2
ei(Ω1+πJ)t
. (12.49)
We now repeat the analysis for nucleus 2.
Contribution ˆρ2(b)
ω0,2I2z
−→
πJ·2I1zI2z
−→
Tr{ˆρ2(t)I2+}
I2y
2I1zI2y
+κ
2
cξ
+κ
2
sξ
+κ
2
cξc2
+κ
2
sξc2
+κ
2
cξc2cJ − κ
2
sξs2sJ
+κ
2
sξc2cJ − κ
2
cξs2sJ
+i κ
2
c2
ξc2cJ − i κ
2
cξsξs2sJ
+i κ
2
s2
ξc2cJ − i κ
2
cξsξs2sJ
= i κ
2
c2cJ − 2J√
4J2+∆2
s2sJ
I2x
2I1zI2x
0
0
−κ
2
cξs2
−κ
2
sξs2
−κ
2
cξs2cJ − κ
2
sξc2sJ
−κ
2
sξs2cJ − κ
2
cξc2sJ
−κ
2
c2
ξs2cJ − κ
2
cξsξc2sJ
−κ
2
s2
ξs2cJ − κ
2
cξsξc2sJ
= −κ
2
s2cJ + 2J√
4J2+∆2
c2sJ
Using the following trigonometric relations
c2cJ =
c −
2 + c +
2
2
s2sJ =
c −
2 − c +
2
2
c2sJ =
−s −
2 + s +
2
2
s2cJ =
s −
2 + s +
2
2
, (12.50)
12.4. SUPPORTING INFORMATION 315
where c ±
2 = cos((ω0,2 − ωrot ± πJ)t) = cos((Ω2 ± πJ)t) and s ±
2 = sin((ω0,2 − ωrot ± πJ)t) = sin((Ω2 ± πJ)t),
Tr{ˆρ2(t)I2+} = i
κ
2
c2cJ −
2J
√
4J2 + ∆2
s2sJ −
κ
2
s2cJ +
2J
√
4J2 + ∆2
c2sJ
= i
κ
2
c −
2 + c +
2
2
−
2J
√
4J2 + ∆2
c −
2 − c +
2
2
−
κ
2
s −
2 + s +
2
2
−
2J
√
4J2 + ∆2
s −
2 − s +
2
2
= i
κ
4
1 −
2J
√
4J2 + ∆2
c −
2 + 1 +
2J
√
4J2 + ∆2
c +
2 + i 1 −
2J
√
4J2 + ∆2
s −
2 + 1 +
2J
√
4J2 + ∆2
s +
2
=
κ
4
ei π
2 1 −
2J
√
4J2 + ∆2
ei(Ω2−πJ)t
+ 1 +
2J
√
4J2 + ∆2
ei(Ω2+πJ)t
. (12.51)
Combining results presented in Eqs. 12.51 and 12.49, applying phase correction, and including relaxation, we obtain the following
description of the evolution of the signal:
M+ =
Nγ2 2B0
8kBT
e−R2t
1 − 2cξsξ ei(Ω1−πJ)t
+ 1 + 2cξsξ ei(Ω1+πJ)t
+ 1 + 2cξsξ ei(Ω2−πJ)t
+ 1 − 2cξsξ ei(Ω2+πJ)t
, (12.52)
where 2cξsξ = 2J/
√
4J2 + ∆2.
12.4.3 HJ and operators of components of total I commute
We show that the operator of each component of the total angular momentum (e.g., ˆIx ∝ Ix = I1x + I2x + I3x + . . . ) commutes
with the strong coupling Hamiltonian HJ for any number of nuclei in the coupled system and for any values of the J constants. For
j = x, k = y, l = z or for any cyclic permutation (j = y, k = z, l = x or j = z, k = x, l = y),
[Ij, HJ ] =
n n =n
2πJnn [Inj, (InjIn j + InkIn k + InlIn l)]
=
n n =n
2πJnn ([Inj, Ink]In k + [Inj, Inl]In l) =
n n =n
2πJnn ([Inj, Ink]In k − [Inl, Inj]In l)
=
n n =n
2iπJnn (InlIn k − InkIn l) = 0
, (12.53)
where n and n are two diﬀerent nuclei. The commutator is equal to zero because for any pair of nuclei p and q, the term
2iπJnn (InlIn k − InkIn l) appears twice in the sum, with the opposite sign: once for n = p and n = q as 2iπJpq(IplIqk − IpkIql),
and once for n = q and n = p as 2iπJpq(IqlIpk − IqkIpl).
12.4.4 J-coupling of magnetically equivalent nuclei
In general, the free evolution of multiple spin-1/2 magnetic moments is governed by the Hamiltonian
H =
n
ω0,nInz + πJnn
n n
(2InxIn x + 2InyIn y + 2InzIn z) =
n
ω0,nInz + HJ (12.54)
If the nuclei are magnetically equivalent,
H = ω0
n
Inz + HJ = ω0Iz + HJ , (12.55)
where Iz and HJ commute, as shown in Section 12.4.3. Therefore, the eﬀect of chemical shift and J-coupling can be analyzed
separately. Note that HJ commutes also with ˆM+, which is proportional to Ix + iIy.
In order to analyze the eﬀect of the J-coupling on the spectrum, we evaluate M+ as
M+ = Tr{ˆρ ˆM+} =
j k
ρjkM+jk, (12.56)
316
where we expressed the trace explicitly in terms of the elements of the matrices ˆρ and ˆM+. If the system evolves due to the J-coupling,
M+ should change, i.e., the time derivative of M+ should diﬀer from zero.
d M+
dt
=
j k
dρjkM+jk
dt
= Tr
dˆρ
dt
ˆM+ . (12.57)
According to the Liouville-von Neumann equation,
dˆρ
dt
= i[ˆρ, HJ ] ⇒
d M+
dt
= iTr [ˆρ, HJ ] ˆM+ = iTr ˆρHJ
ˆM+ − iTr HJ ˆρ ˆM+ . (12.58)
Because HJ commutes with ˆM+ (and therefore HJ
ˆM+ = ˆM+HJ ), and because Tr{ ˆA ˆB} = Tr{ ˆB ˆA},
d M+
dt
= iTr ˆρ ˆM+HJ − iTr HJ ˆρ ˆM+ = iTr (ˆρ ˆM+)HJ − iTr HJ (ˆρ ˆM+) = iTr (ˆρ ˆM+)HJ − iTr (ˆρ ˆM+)HJ = 0. (12.59)
We see that M+ does not change due to the J-coupling regardless of the actual form of ˆρ. This proves that J-coupling between
magnetically equivalent nuclei does not have any eﬀect on the spectrum (is invisible).
12.4.5 Product operators of three and more coupled magnetic moments
Features of the TOCSY experiments are fully manifested only in networks consisting of three or more interacting magnetic moments.
Therefore, we need to ﬁnd product operators for systems of more than two nuclei. We start by the normalized bases of density matrices
of isolated magnetic moments, consisting of four operators
√
2It,
√
2Ix,
√
2Iy,
√
2Iz. They can be also written as
1
√
2
2I
(1)
j , (12.60)
where j ∈ {t, x, y, z} and the superscript (1), not written in practice, emphasizes that we describe a single magnetic moment. Note
that all operators 2I
(1)
j are represented my matrices with elements equal to 0, ±1, or ±i.
We continue by the procedure introduced in Section 8.3. Eqs. 8.11–8.26 describe construction of an orthonormal basis consisting of
direct products of operators
√
2Ij, where j ∈ {t, x, y, z}. A general form of Eqs. 8.11–8.26 can be written as
2I1jI
(12)
2k =
√
2I
(1)
j ⊗
√
2I
(2)
k =
1
2
2I
(1)
j ⊗ 2I
(2)
k , (12.61)
where j, k ∈ {t, x, y, z}. The unit matrices ˆ1 = 2I
(n)
t are usually not written explicitly.
In a similar manner, 64 operators constituting an orthonormal bases of density matrices of three coupled magnetic moments can be
obtained by calculating
2
√
2I1jI2kI3l
(123)
=
√
2I
(1)
j ⊗
√
2I
(2)
k ⊗
√
2I
(3)
l =
1
√
8
2I
(1)
j ⊗ 2I
(2)
k ⊗ 2I
(3)
l , (12.62)
where j, k, l ∈ {t, x, y, z}.
In general, 22N operators constituting an orthonormal bases of density matrices of N coupled magnetic moments are obtained by
calculating
√
2N
2
I1j · · · INk
(1···N)
=
√
2I
(1)
j ⊗ · · · ⊗
√
2I
(N)
k =
1
√
2N
2I
(1)
j ⊗ · · · ⊗ 2I
(N)
k . (12.63)
Note that the operators of angular momentum components divided by , Inj, are orthonormal only for systems of two magnetic
moments (together with It = 1
2
ˆ1 and product operators 2InjIn k). In general, product operators of N coupled magnetic moments
2N
2
I1j · · · INk
(1···N)
=
1
2
2I
(1)
j ⊗ · · · ⊗ 2I
(N)
k , (12.64)
including Inj, must be multiplied by 2/
√
2N to be normalized. For example, single-quantum coherences of a set of three magnetic
moments are normalized by dividing the product operators I1x, 2I1xI2z, 4I1xI2zI3z, etc., by
√
2.
In conclusion, we have to distinguish whether we use the product operator to represent a physical quantity, or to serve as a mathematical
object, i.e., a basis matrix. In the former case, we should use the operator without normalization: e.g., Inz multiplied by to represent
angular momentum, multiplied by γ to represent magnetic moment, multiplied by −γB0 to represent energy in the magnetic ﬁeld B0,
etc. In the latter case, normalization is useful e.g. when calculating traces of products of matrices.
12.4. SUPPORTING INFORMATION 317
12.4.6 Three magnetic moments in thermal equilibrium
Derivation of the density matrix describing an ensemble of pairs of nuclear magnetic moments presented in Section 8.9.8 can be easily
extended to a sets of N nuclei. As the inﬂuence of direct or indirect interaction on the distribution of magnetic moments is negligible in
liquid samples, and the impact of the chemical shift is also very small, it is suﬃcient to consider the Hamiltonian reﬂecting the eﬀect of
the external magnetic ﬁeld B0:
ˆH = −
N
n=1
γnB0
ˆInz = −
N
n=1
γnB0 Inz. (12.65)
The density matrix describing the mixed state in the equilibrium consists of populations
Peq
j =
e−Ej /kBT
2N
k=1
e−Ek/kBT
≈
1 −
Ej
kBT
2N
, (12.66)
where Ej is the eigenvalue of the Hamiltonian corresponding the population Pj. Consequently, the equilibrium density matrix is
ˆρeq
=
1
2N
ˆ1 +
N
n=1
γnB0
kBT
Inz =
1
2N−1
It +
N
n=1
κnInz , (12.67)
where
κn =
γnB0
2kBT
. (12.68)
12.4.7 Commutation relations of the TOCSY mixing Hamiltonian
The commutators of the Ijy operators with the HTOCSY Hamiltonian for a set of three protons with J12 = J23 > 0 and J13 = 0 are given
by
[I1y, HTOCSY] = πJ[I1y, 2I1xI2x+2I1yI2y+2I1zI2z] = 2πJ[I1y, I1x]I2x+2πJ[I1y, I1z]I2z = −2iπJ(I1zI2x−I1xI2z), (12.69)
[I2y, HTOCSY] = πJ[I2y, 2I1xI2x + 2I1yI2y + 2I1zI2z + 2I2xI3x + 2I2yI3y + 2I2zI3z]
= 2πJI1x[I2y, I2x] + 2πJI1z[I2y, I2z] + 2πJ[I2y, I2x]I3x + 2πJ[I2y, I2z]I3z
= −2iπJ(I1xI2z − I1zI2x) − 2iπJ(I2xI3z − I2zI3x), (12.70)
[I3y, HTOCSY] = πJ[I3y, 2I2xI3x+2I2yI3y+2I2zI3z] = 2πJI2x[I3y, I3x]+2πJI2z[I3y, I3z] = −2iπJ(I2xI3z−I2zI3x). (12.71)
A sum of the ﬁrst two commutators (Eqs. 12.69 and 12.70) shows that
[I1y + I2y, HTOCSY] = 2iπJ(I2xI3z − I2zI3x) (12.72)
and a sum of all three commutators (Eqs. 12.69–12.71) shows that
[I1y + I2y + I3y, HTOCSY] = 0 (12.73)
in agreement with Eq. 12.53.
318
12.4.8 Density matrix evolution in the TOCSY experiment
As discussed in Section 12.3, the TOCSY pulse sequence starts by a 90◦ excitation pulse that converts ˆρ(a) = 1
4
(It +κI1z +κI2z +κI3z) =
1
4
It + κ
4
j
Ijz
to
ˆρ(b) = 1
4
(It − κI1y − κI2y − κI3y) = 1
4
It − κ
4
j
Ijy,
which evolves during the incremented evolution period t1. An example of a set of nuclei interacting via couplings described by constants
J12 = J23 = J and J13 = 0 is presented in Section 12.3, here we analyze a general case that evolves (considering only Ijy coherences that
survive the TOCSY mixing) as
ˆρ(c) = −κ
4
j
Cj1Ijy,
where
C11 = cos(Ω1t1) cos(πJ12t1) cos(πJ13t1) =
1
2
cos(Ω1t1)(cos(πJ12t1 − πJ13t1) − cos(πJ12t1 + πJ13t1))
=
1
4
(cos((Ω1 − πJ12 − πJ13)t1) + cos((Ω1 − πJ12 + πJ13)t1) + cos((Ω1 + πJ12 − πJ13)t1) + cos((Ω1 + πJ12 + πJ13)t1))
C21 = cos(Ω2t1) cos(πJ12t1) cos(πJ23t1) =
1
2
cos(Ω2t1)(cos(πJ12t1 − πJ23t1) − cos(πJ12t1 + πJ23t1))
=
1
4
(cos((Ω2 − πJ12 − πJ23)t1) + cos((Ω2 − πJ12 + πJ23)t1) + cos((Ω2 + πJ12 − πJ23)t1) + cos((Ω2 + πJ12 + πJ23)t1))
C31 = cos(Ω3t1) cos(πJ13t1) cos(πJ23t1) =
1
2
cos(Ω3t1)(cos(πJ13t1 − πJ23t1) − cos(πJ13t1 + πJ23t1))
=
1
4
(cos((Ω3 − πJ13 − πJ23)t1) + cos((Ω3 − πJ13 + πJ23)t1) + cos((Ω3 + πJ13 − πJ23)t1) + cos((Ω3 + πJ13 + πJ23)t1)).
(12.74)
The −κ
4
Cj1Ijy components of the density matrix, converted to ˆρ(d) = −κ
4
j k
ajkCj1Iky during the TOCSY mixing period (see
Section 12.3), further evolve during t2 to
ˆρ(t2) = −κ
4
j k
ajkCj1(Ck2Iky − Sk2Ikx), where
C12 = cos(Ω1t2) cos(πJ12t2) cos(πJ13t2) =
1
2
cos(Ω1t2)(cos(πJ12t2 − πJ13t2) − cos(πJ12t2 + πJ13t2))
=
1
4
(cos((Ω1 − πJ12 − πJ13)t2) + cos((Ω1 − πJ12 + πJ13)t2) + cos((Ω1 + πJ12 − πJ13)t2) + cos((Ω1 + πJ12 + πJ13)t2))
C22 = cos(Ω2t2) cos(πJ12t2) cos(πJ23t2) =
1
2
cos(Ω2t2)(cos(πJ12t2 − πJ23t2) − cos(πJ12t2 + πJ23t2))
=
1
4
(cos((Ω2 − πJ12 − πJ23)t2) + cos((Ω2 − πJ12 + πJ23)t2) + cos((Ω2 + πJ12 − πJ23)t2) + cos((Ω2 + πJ12 + πJ23)t2))
C32 = cos(Ω3t2) cos(πJ13t2) cos(πJ23t2) =
1
2
cos(Ω3t2)(cos(πJ13t2 − πJ23t2) − cos(πJ13t2 + πJ23t2))
=
1
4
(cos((Ω3 − πJ13 − πJ23)t2) + cos((Ω3 − πJ13 + πJ23)t2) + cos((Ω3 + πJ13 − πJ23)t2) + cos((Ω3 + πJ13 + πJ23)t2)).
(12.75)
and
S12 = sin(Ω1t2) cos(πJ12t2) cos(πJ13t2) =
1
2
sin(Ω1t2)(cos(πJ12t2 − πJ13t2) − cos(πJ12t2 + πJ13t2))
=
1
4
(sin((Ω1 − πJ12 − πJ13)t2) + sin((Ω1 − πJ12 + πJ13)t2) + sin((Ω1 + πJ12 − πJ13)t2) + sin((Ω1 + πJ12 + πJ13)t2))
S22 = sin(Ω2t2) cos(πJ12t2) cos(πJ23t2) =
1
2
sin(Ω2t2)(cos(πJ12t2 − πJ23t2) − cos(πJ12t2 + πJ23t2))
=
1
4
(sin((Ω2 − πJ12 − πJ23)t2) + sin((Ω2 − πJ12 + πJ23)t2) + sin((Ω2 + πJ12 − πJ23)t2) + sin((Ω2 + πJ12 + πJ23)t2))
S32 = sin(Ω3t2) cos(πJ13t2) cos(πJ23t2) =
1
2
sin(Ω3t2)(cos(πJ13t2 − πJ23t2) − cos(πJ13t2 + πJ23t2))
=
1
4
(sin((Ω3 − πJ13 − πJ23)t2) + sin((Ω3 − πJ13 + πJ23)t2) + sin((Ω3 + πJ13 − πJ23)t2) + sin((Ω3 + πJ13 + πJ23)t2)).
(12.76)
12.4. SUPPORTING INFORMATION 319
Considering the orthogonality of the matrices and the normalization used in our analysis,6 the nonzero traces are
Tr{InxIn+} = 2, Tr{InyIn+} = 2i. (12.77)
Tr{ˆρ(t2) ˆM+} = −Nγ
κ
2 j k
ajkCj1(iCk2 − Sk2) = −iNγ
κ
2 j k
ajkCj1(Ck2 + iSk2) = −iNγ
κ
2 j k
ajkCj1Ek2, (12.78)
where
E12 = C12 + iS12 =
1
4
ei(Ω1−πJ12−πJ13)t2 + ei(Ω1−πJ12+πJ13)t2 + ei(Ω1+πJ12−πJ13)t2 + ei(Ω1+πJ12+πJ13)t2
E22 = C22 + iS22 =
1
4
ei(Ω2−πJ12−πJ23)t2 + ei(Ω2−πJ12+πJ23)t2 + ei(Ω2+πJ12−πJ23)t2 + ei(Ω2+πJ12+πJ23)t2
E32 = C32 + iS32 =
1
4
ei(Ω3−πJ13−πJ23)t2 + ei(Ω3−πJ13+πJ23)t2 + ei(Ω3+πJ13−πJ23)t2 + ei(Ω3+πJ13+πJ23)t2 .
(12.79)
As the previously discussed two-dimensional experiments, TOCSY is also applied so that a hypercomplex 2D spectrum is obtained.
Therefore, acquisition is repeated for each t1 increment with the phase of the radio wave shifted by 90 ◦ (y) during the 90◦ pulse. The
original density matrix
ˆρ(a) = 1
4
(It + κI1z + κI2z + κI3z) = 1
4
It + κ
4
j
Ijz
is then converted to
ˆρ(b) = 1
4
(It + κI1x + κI2x + κI3x) = 1
4
It + κ
4
j
Ijx,
which evolves during t1 to the Ijy components, selected during the TOCSY mixing, with the following modulation: ˆρ(c) = κ
4
j
Sj1Ijy,
where
S12 = sin(Ω1t2) cos(πJ12t2) cos(πJ13t2) =
1
2
cos(Ω1t2)(cos(πJ12t2 − πJ13t2) − cos(πJ12t2 + πJ13t2))
=
1
4
(sin((Ω1 − πJ12 − πJ13)t2) + sin((Ω1 − πJ12 + πJ13)t2) + sin((Ω1 + πJ12 − πJ13)t2) + sin((Ω1 + πJ12 + πJ13)t2))
S22 = sin(Ω2t2) cos(πJ12t2) cos(πJ23t2) =
1
2
sin(Ω2t2)(cos(πJ12t2 − πJ23t2) − cos(πJ12t2 + πJ23t2))
=
1
4
(sin((Ω2 − πJ12 − πJ23)t2) + sin((Ω2 − πJ12 + πJ23)t2) + sin((Ω2 + πJ12 − πJ23)t2) + sin((Ω2 + πJ12 + πJ23)t2))
S32 = cos(Ω3t2) cos(πJ13t2) cos(πJ23t2) =
1
2
cos(Ω3t2)(cos(πJ13t2 − πJ23t2) − cos(πJ13t2 + πJ23t2))
=
1
4
(sin((Ω3 − πJ13 − πJ23)t2) + sin((Ω3 − πJ13 + πJ23)t2) + sin((Ω3 + πJ13 − πJ23)t2) + sin((Ω3 + πJ13 + πJ23)t2)).
(12.80)
The κ
4
Sj1Ijy components of the density matrix, converted to ˆρ(d) = κ
4
j k
ajkSj1Iky during the TOCSY mixing period, evolve
during t2 to
ˆρ(t2) = −κ
4
j k
ajkSj1(Ck2Iky − Sk2Ikx), and
Tr{ˆρ(t2) ˆM+} = Nγ
κ
2 j k
ajkSj1(iCk2 − Sk2) = iNγ
κ
2 j k
ajkSj1(Ck2 + iSk2) = iNγ
κ
2 j k
ajkSj1Ek2, (12.81)
If we introduce relaxation, apply phase correction and perform Fourier transformation in the direct dimension, discard the imaginary
component of the signal, multiply Eq. 12.81 by ”i” and combine it with Eq. 12.78, introduce relaxation and perform Fourier transformation
in the indirect dimension, we obtain a hypercomplex signal with the real component
{Y (ω1, ω2} = Nγ
κ
2 j k
ajkYj1Yk2, (12.82)
6
As discussed in Section 12.4.5, the product operators listed in Tables 12.2–12.5 diﬀer from normalized basis matrices by a factor of√
2, resulting in the factor of 2 in the traces Tr{InxIn+} and Tr{InyIn+}.
320
where
Ynl =
1
4
R2
R
2
2 + (Ωn − πJnn − πJnn − ωl)2
+
1
4
R2
R
2
2 + (Ωn − πJnn + πJnn − ωl)2
+
1
4
R2
R
2
2 + (Ωn + πJnn − πJnn − ωl)2
+
1
4
R2
R
2
2 + (Ωn + πJnn + πJnn − ωl)2
(12.83)
(n, n , n ∈ {1, 2, 3}).
12.4. SUPPORTING INFORMATION 321
Table 12.2: Cartesian basis for a three-spin system: population operators. Symbols ”+” and ”−” stand for +1 and
−1, respectively. All operators have to be divided by
√
2 to obtain orthonormal matrices.
It =
1
2











+ 0 0 0 0 0 0 0
0 + 0 0 0 0 0 0
0 0 + 0 0 0 0 0
0 0 0 + 0 0 0 0
0 0 0 0 + 0 0 0
0 0 0 0 0 + 0 0
0 0 0 0 0 0 + 0
0 0 0 0 0 0 0 +











I1z =
1
2











+ 0 0 0 0 0 0 0
0 + 0 0 0 0 0 0
0 0 + 0 0 0 0 0
0 0 0 + 0 0 0 0
0 0 0 0 − 0 0 0
0 0 0 0 0 − 0 0
0 0 0 0 0 0 − 0
0 0 0 0 0 0 0 −











I2z =
1
2











+ 0 0 0 0 0 0 0
0 + 0 0 0 0 0 0
0 0 − 0 0 0 0 0
0 0 0 − 0 0 0 0
0 0 0 0 + 0 0 0
0 0 0 0 0 + 0 0
0 0 0 0 0 0 − 0
0 0 0 0 0 0 0 −











I3z =
1
2











+ 0 0 0 0 0 0 0
0 − 0 0 0 0 0 0
0 0 + 0 0 0 0 0
0 0 0 − 0 0 0 0
0 0 0 0 + 0 0 0
0 0 0 0 0 − 0 0
0 0 0 0 0 0 + 0
0 0 0 0 0 0 0 −











2I1zI2z =
1
2











+ 0 0 0 0 0 0 0
0 + 0 0 0 0 0 0
0 0 − 0 0 0 0 0
0 0 0 − 0 0 0 0
0 0 0 0 − 0 0 0
0 0 0 0 0 − 0 0
0 0 0 0 0 0 + 0
0 0 0 0 0 0 0 +











2I1zI3z =
1
2











+ 0 0 0 0 0 0 0
0 − 0 0 0 0 0 0
0 0 + 0 0 0 0 0
0 0 0 − 0 0 0 0
0 0 0 0 − 0 0 0
0 0 0 0 0 + 0 0
0 0 0 0 0 0 − 0
0 0 0 0 0 0 0 +











2I2zI3z =
1
2











+ 0 0 0 0 0 0 0
0 − 0 0 0 0 0 0
0 0 − 0 0 0 0 0
0 0 0 + 0 0 0 0
0 0 0 0 + 0 0 0
0 0 0 0 0 − 0 0
0 0 0 0 0 0 − 0
0 0 0 0 0 0 0 +











4I1zI2zI3z =
1
2











+ 0 0 0 0 0 0 0
0 − 0 0 0 0 0 0
0 0 − 0 0 0 0 0
0 0 0 + 0 0 0 0
0 0 0 0 − 0 0 0
0 0 0 0 0 + 0 0
0 0 0 0 0 0 + 0
0 0 0 0 0 0 0 −











Table 12.3: Cartesian basis for a three-spin system: single/triple-quantum operators. Symbols ”+” and ”−” stand
for +1 and −1, respectively. All operators have to be divided by
√
2 to obtain orthonormal matrices.
4I1xI2xI3x =
1
2











0 0 0 0 0 0 0 +
0 0 0 0 0 0 + 0
0 0 0 0 0 + 0 0
0 0 0 0 + 0 0 0
0 0 0 + 0 0 0 0
0 0 + 0 0 0 0 0
0 + 0 0 0 0 0 0
+ 0 0 0 0 0 0 0











4I1xI2yI3y =
1
2











0 0 0 0 0 0 0 −
0 0 0 0 0 0 − 0
0 0 0 0 0 + 0 0
0 0 0 0 + 0 0 0
0 0 0 + 0 0 0 0
0 0 + 0 0 0 0 0
0 − 0 0 0 0 0 0
− 0 0 0 0 0 0 0











4I1yI2xI3y =
1
2











0 0 0 0 0 0 0 −
0 0 0 0 0 0 + 0
0 0 0 0 0 − 0 0
0 0 0 0 + 0 0 0
0 0 0 + 0 0 0 0
0 0 − 0 0 0 0 0
0 + 0 0 0 0 0 0
− 0 0 0 0 0 0 0











4I1yI2yI3x =
1
2











0 0 0 0 0 0 0 −
0 0 0 0 0 0 + 0
0 0 0 0 0 + 0 0
0 0 0 0 − 0 0 0
0 0 0 − 0 0 0 0
0 0 + 0 0 0 0 0
0 + 0 0 0 0 0 0
− 0 0 0 0 0 0 0











4I1xI2xI3y =
1
2











0 0 0 0 0 0 0 −
0 0 0 0 0 0 + 0
0 0 0 0 0 − 0 0
0 0 0 0 + 0 0 0
0 0 0 − 0 0 0 0
0 0 + 0 0 0 0 0
0 − 0 0 0 0 0 0
+ 0 0 0 0 0 0 0











4I1xI2yI3x =
1
2











0 0 0 0 0 0 0 −
0 0 0 0 0 0 − 0
0 0 0 0 0 + 0 0
0 0 0 0 + 0 0 0
0 0 0 − 0 0 0 0
0 0 − 0 0 0 0 0
0 + 0 0 0 0 0 0
+ 0 0 0 0 0 0 0











4I1yI2xI3x =
1
2











0 0 0 0 0 0 0 −
0 0 0 0 0 0 − 0
0 0 0 0 0 − 0 0
0 0 0 0 − 0 0 0
0 0 0 + 0 0 0 0
0 0 + 0 0 0 0 0
0 + 0 0 0 0 0 0
+ 0 0 0 0 0 0 0











4I1yI2yI3y =
1
2











0 0 0 0 0 0 0 +
0 0 0 0 0 0 − 0
0 0 0 0 0 − 0 0
0 0 0 0 + 0 0 0
0 0 0 − 0 0 0 0
0 0 + 0 0 0 0 0
0 + 0 0 0 0 0 0
− 0 0 0 0 0 0 0











322
Table 12.4: Cartesian basis for a three-spin system: single-quantum operators. Symbols ”+” and ”−” stand for +1
and −1, respectively. All operators have to be divided by
√
2 to obtain orthonormal matrices.
I1x =
1
2











0 0 0 0 + 0 0 0
0 0 0 0 0 + 0 0
0 0 0 0 0 0 + 0
0 0 0 0 0 0 0 +
+ 0 0 0 0 0 0 0
0 + 0 0 0 0 0 0
0 0 + 0 0 0 0 0
0 0 0 + 0 0 0 0











2I1xI2z =
1
2











0 0 0 0 + 0 0 0
0 0 0 0 0 + 0 0
0 0 0 0 0 0 − 0
0 0 0 0 0 0 0 −
+ 0 0 0 0 0 0 0
0 + 0 0 0 0 0 0
0 0 − 0 0 0 0 0
0 0 0 − 0 0 0 0











2I1xI3z =
1
2











0 0 0 0 + 0 0 0
0 0 0 0 0 − 0 0
0 0 0 0 0 0 + 0
0 0 0 0 0 0 0 −
+ 0 0 0 0 0 0 0
0 − 0 0 0 0 0 0
0 0 + 0 0 0 0 0
0 0 0 − 0 0 0 0











4I1xI2zI3z =
1
2











0 0 0 0 + 0 0 0
0 0 0 0 0 − 0 0
0 0 0 0 0 0 − 0
0 0 0 0 0 0 0 +
+ 0 0 0 0 0 0 0
0 − 0 0 0 0 0 0
0 0 − 0 0 0 0 0
0 0 0 + 0 0 0 0











I1y =
i
2











0 0 0 0 − 0 0 0
0 0 0 0 0 − 0 0
0 0 0 0 0 0 − 0
0 0 0 0 0 0 0 −
+ 0 0 0 0 0 0 0
0 + 0 0 0 0 0 0
0 0 + 0 0 0 0 0
0 0 0 + 0 0 0 0











2I1yI2z =
i
2











0 0 0 0 − 0 0 0
0 0 0 0 0 − 0 0
0 0 0 0 0 0 + 0
0 0 0 0 0 0 0 +
+ 0 0 0 0 0 0 0
0 + 0 0 0 0 0 0
0 0 − 0 0 0 0 0
0 0 0 − 0 0 0 0











2I1yI3z =
i
2











0 0 0 0 − 0 0 0
0 0 0 0 0 + 0 0
0 0 0 0 0 0 − 0
0 0 0 0 0 0 0 +
+ 0 0 0 0 0 0 0
0 − 0 0 0 0 0 0
0 0 + 0 0 0 0 0
0 0 0 − 0 0 0 0











4I1yI2zI3z =
i
2











0 0 0 0 − 0 0 0
0 0 0 0 0 + 0 0
0 0 0 0 0 0 + 0
0 0 0 0 0 0 0 −
+ 0 0 0 0 0 0 0
0 − 0 0 0 0 0 0
0 0 − 0 0 0 0 0
0 0 0 + 0 0 0 0











I2x =
1
2











0 0 + 0 0 0 0 0
0 0 0 + 0 0 0 0
+ 0 0 0 0 0 0 0
0 + 0 0 0 0 0 0
0 0 0 0 0 0 + 0
0 0 0 0 0 0 0 +
0 0 0 0 + 0 0 0
0 0 0 0 0 + 0 0











2I1zI2x =
1
2











0 0 + 0 0 0 0 0
0 0 0 + 0 0 0 0
+ 0 0 0 0 0 0 0
0 + 0 0 0 0 0 0
0 0 0 0 0 0 − 0
0 0 0 0 0 0 0 −
0 0 0 0 − 0 0 0
0 0 0 0 0 − 0 0











2I2xI3z =
1
2











0 0 + 0 0 0 0 0
0 0 0 − 0 0 0 0
+ 0 0 0 0 0 0 0
0 − 0 0 0 0 0 0
0 0 0 0 0 0 + 0
0 0 0 0 0 0 0 −
0 0 0 0 + 0 0 0
0 0 0 0 0 − 0 0











4I1zI2xI3z =
1
2











0 0 + 0 0 0 0 0
0 0 0 − 0 0 0 0
+ 0 0 0 0 0 0 0
0 − 0 0 0 0 0 0
0 0 0 0 0 0 − 0
0 0 0 0 0 0 0 +
0 0 0 0 − 0 0 0
0 0 0 0 0 + 0 0











I2y =
i
2











0 0 − 0 0 0 0 0
0 0 0 − 0 0 0 0
+ 0 0 0 0 0 0 0
0 + 0 0 0 0 0 0
0 0 0 0 0 0 − 0
0 0 0 0 0 0 0 −
0 0 0 0 + 0 0 0
0 0 0 0 0 + 0 0











2I1zI2y =
i
2











0 0 − 0 0 0 0 0
0 0 0 − 0 0 0 0
+ 0 0 0 0 0 0 0
0 + 0 0 0 0 0 0
0 0 0 0 0 0 + 0
0 0 0 0 0 0 0 +
0 0 0 0 − 0 0 0
0 0 0 0 0 − 0 0











2I2yI3z =
i
2











0 0 − 0 0 0 0 0
0 0 0 + 0 0 0 0
+ 0 0 0 0 0 0 0
0 − 0 0 0 0 0 0
0 0 0 0 0 0 − 0
0 0 0 0 0 0 0 +
0 0 0 0 + 0 0 0
0 0 0 0 0 − 0 0











4I1zI2yI3z =
i
2











0 0 − 0 0 0 0 0
0 0 0 + 0 0 0 0
+ 0 0 0 0 0 0 0
0 − 0 0 0 0 0 0
0 0 0 0 0 0 + 0
0 0 0 0 0 0 0 −
0 0 0 0 − 0 0 0
0 0 0 0 0 + 0 0











I3x =
1
2











0 + 0 0 0 0 0 0
+ 0 0 0 0 0 0 0
0 0 0 + 0 0 0 0
0 0 + 0 0 0 0 0
0 0 0 0 0 + 0 0
0 0 0 0 + 0 0 0
0 0 0 0 0 0 0 +
0 0 0 0 0 0 + 0











2I1zI3x =
1
2











0 + 0 0 0 0 0 0
+ 0 0 0 0 0 0 0
0 0 0 + 0 0 0 0
0 0 + 0 0 0 0 0
0 0 0 0 0 − 0 0
0 0 0 0 − 0 0 0
0 0 0 0 0 0 0 −
0 0 0 0 0 0 − 0











2I2zI3x =
1
2











0 + 0 0 0 0 0 0
+ 0 0 0 0 0 0 0
0 0 0 − 0 0 0 0
0 0 − 0 0 0 0 0
0 0 0 0 0 + 0 0
0 0 0 0 + 0 0 0
0 0 0 0 0 0 0 −
0 0 0 0 0 0 − 0











4I1zI2zI3x =
1
2











0 + 0 0 0 0 0 0
+ 0 0 0 0 0 0 0
0 0 0 − 0 0 0 0
0 0 − 0 0 0 0 0
0 0 0 0 0 − 0 0
0 0 0 0 − 0 0 0
0 0 0 0 0 0 0 +
0 0 0 0 0 0 + 0











I3y =
i
2











0 − 0 0 0 0 0 0
+ 0 0 0 0 0 0 0
0 0 0 − 0 0 0 0
0 0 + 0 0 0 0 0
0 0 0 0 0 − 0 0
0 0 0 0 + 0 0 0
0 0 0 0 0 0 0 −
0 0 0 0 0 0 + 0











2I1zI3y =
i
2











0 − 0 0 0 0 0 0
+ 0 0 0 0 0 0 0
0 0 0 − 0 0 0 0
0 0 + 0 0 0 0 0
0 0 0 0 0 + 0 0
0 0 0 0 − 0 0 0
0 0 0 0 0 0 0 +
0 0 0 0 0 0 − 0











2I2zI3y =
i
2











0 − 0 0 0 0 0 0
+ 0 0 0 0 0 0 0
0 0 0 + 0 0 0 0
0 0 − 0 0 0 0 0
0 0 0 0 0 − 0 0
0 0 0 0 + 0 0 0
0 0 0 0 0 0 0 +
0 0 0 0 0 0 − 0











4I1zI2zI3y =
i
2











0 − 0 0 0 0 0 0
+ 0 0 0 0 0 0 0
0 0 0 + 0 0 0 0
0 0 − 0 0 0 0 0
0 0 0 0 0 + 0 0
0 0 0 0 − 0 0 0
0 0 0 0 0 0 0 −
0 0 0 0 0 0 + 0











12.4. SUPPORTING INFORMATION 323
Table 12.5: Cartesian basis for a three-spin system: zero/double-quantum operators. Symbols ”+” and ”−” stand
for +1 and −1, respectively. All operators have to be divided by
√
2 to obtain orthonormal matrices.
2I1xI2x =
1
2











0 0 0 0 0 0 + 0
0 0 0 0 0 0 0 +
0 0 0 0 + 0 0 0
0 0 0 0 0 + 0 0
0 0 + 0 0 0 0 0
0 0 0 + 0 0 0 0
+ 0 0 0 0 0 0 0
0 + 0 0 0 0 0 0











2I1yI2y =
1
2











0 0 0 0 0 0 − 0
0 0 0 0 0 0 0 −
0 0 0 0 + 0 0 0
0 0 0 0 0 + 0 0
0 0 + 0 0 0 0 0
0 0 0 + 0 0 0 0
− 0 0 0 0 0 0 0
0 − 0 0 0 0 0 0











4I1xI2xI3z =
1
2











0 0 0 0 0 0 + 0
0 0 0 0 0 0 0 −
0 0 0 0 + 0 0 0
0 0 0 0 0 − 0 0
0 0 + 0 0 0 0 0
0 0 0 − 0 0 0 0
+ 0 0 0 0 0 0 0
0 − 0 0 0 0 0 0











4I1yI2yI3z =
1
2











0 0 0 0 0 0 − 0
0 0 0 0 0 0 0 +
0 0 0 0 + 0 0 0
0 0 0 0 0 − 0 0
0 0 + 0 0 0 0 0
0 0 0 − 0 0 0 0
− 0 0 0 0 0 0 0
0 + 0 0 0 0 0 0











2I1xI3x =
1
2











0 0 0 0 0 + 0 0
0 0 0 0 + 0 0 0
0 0 0 0 0 0 0 +
0 0 0 0 0 0 + 0
0 + 0 0 0 0 0 0
+ 0 0 0 0 0 0 0
0 0 0 + 0 0 0 0
0 0 + 0 0 0 0 0











2I1yI3y =
1
2











0 0 0 0 0 − 0 0
0 0 0 0 + 0 0 0
0 0 0 0 0 0 0 −
0 0 0 0 0 0 + 0
0 + 0 0 0 0 0 0
− 0 0 0 0 0 0 0
0 0 0 + 0 0 0 0
0 0 − 0 0 0 0 0











4I1xI2zI3x =
1
2











0 0 0 0 0 + 0 0
0 0 0 0 + 0 0 0
0 0 0 0 0 0 0 −
0 0 0 0 0 0 − 0
0 + 0 0 0 0 0 0
+ 0 0 0 0 0 0 0
0 0 0 − 0 0 0 0
0 0 − 0 0 0 0 0











4I1yI2zI3y =
1
2











0 0 0 0 0 − 0 0
0 0 0 0 + 0 0 0
0 0 0 0 0 0 0 +
0 0 0 0 0 0 − 0
0 + 0 0 0 0 0 0
− 0 0 0 0 0 0 0
0 0 0 − 0 0 0 0
0 0 + 0 0 0 0 0











2I2xI3x =
1
2











0 0 0 + 0 0 0 0
0 0 + 0 0 0 0 0
0 + 0 0 0 0 0 0
+ 0 0 0 0 0 0 0
0 0 0 0 0 0 0 +
0 0 0 0 0 0 + 0
0 0 0 0 0 + 0 0
0 0 0 0 + 0 0 0











2I2yI3y =
1
2











0 0 0 − 0 0 0 0
0 0 + 0 0 0 0 0
0 + 0 0 0 0 0 0
− 0 0 0 0 0 0 0
0 0 0 0 0 0 0 −
0 0 0 0 0 0 + 0
0 0 0 0 0 + 0 0
0 0 0 0 − 0 0 0











4I1zI2xI3x =
1
2











0 0 0 + 0 0 0 0
0 0 + 0 0 0 0 0
0 + 0 0 0 0 0 0
+ 0 0 0 0 0 0 0
0 0 0 0 0 0 0 −
0 0 0 0 0 0 − 0
0 0 0 0 0 − 0 0
0 0 0 0 − 0 0 0











4I1zI2yI3y =
1
2











0 0 0 − 0 0 0 0
0 0 + 0 0 0 0 0
0 + 0 0 0 0 0 0
− 0 0 0 0 0 0 0
0 0 0 0 0 0 0 +
0 0 0 0 0 0 − 0
0 0 0 0 0 − 0 0
0 0 0 0 + 0 0 0











2I1xI2y =
i
2











0 0 0 0 0 0 − 0
0 0 0 0 0 0 0 −
0 0 0 0 + 0 0 0
0 0 0 0 0 + 0 0
0 0 − 0 0 0 0 0
0 0 0 − 0 0 0 0
+ 0 0 0 0 0 0 0
0 + 0 0 0 0 0 0











2I1yI2x =
i
2











0 0 0 0 0 0 − 0
0 0 0 0 0 0 0 −
0 0 0 0 − 0 0 0
0 0 0 0 0 − 0 0
0 0 + 0 0 0 0 0
0 0 0 + 0 0 0 0
+ 0 0 0 0 0 0 0
0 + 0 0 0 0 0 0











4I1xI2yI3z =
i
2











0 0 0 0 0 0 − 0
0 0 0 0 0 0 0 +
0 0 0 0 + 0 0 0
0 0 0 0 0 − 0 0
0 0 − 0 0 0 0 0
0 0 0 + 0 0 0 0
+ 0 0 0 0 0 0 0
0 − 0 0 0 0 0 0











4I1yI2xI3z =
i
2











0 0 0 0 0 0 − 0
0 0 0 0 0 0 0 +
0 0 0 0 − 0 0 0
0 0 0 0 0 + 0 0
0 0 + 0 0 0 0 0
0 0 0 − 0 0 0 0
+ 0 0 0 0 0 0 0
0 − 0 0 0 0 0 0











2I1xI3y =
i
2











0 0 0 0 0 − 0 0
0 0 0 0 + 0 0 0
0 0 0 0 0 0 0 −
0 0 0 0 0 0 + 0
0 − 0 0 0 0 0 0
+ 0 0 0 0 0 0 0
0 0 0 − 0 0 0 0
0 0 + 0 0 0 0 0











2I1yI3x =
i
2











0 0 0 0 0 − 0 0
0 0 0 0 − 0 0 0
0 0 0 0 0 0 0 −
0 0 0 0 0 0 − 0
0 + 0 0 0 0 0 0
+ 0 0 0 0 0 0 0
0 0 0 + 0 0 0 0
0 0 + 0 0 0 0 0











4I1xI2zI3y =
i
2











0 0 0 0 0 − 0 0
0 0 0 0 + 0 0 0
0 0 0 0 0 0 0 +
0 0 0 0 0 0 − 0
0 − 0 0 0 0 0 0
+ 0 0 0 0 0 0 0
0 0 0 + 0 0 0 0
0 0 − 0 0 0 0 0











4I1yI2zI3x =
i
2











0 0 0 0 0 − 0 0
0 0 0 0 − 0 0 0
0 0 0 0 0 0 0 +
0 0 0 0 0 0 + 0
0 + 0 0 0 0 0 0
+ 0 0 0 0 0 0 0
0 0 0 − 0 0 0 0
0 0 − 0 0 0 0 0











2I2xI3y =
i
2











0 0 0 − 0 0 0 0
0 0 + 0 0 0 0 0
0 − 0 0 0 0 0 0
+ 0 0 0 0 0 0 0
0 0 0 0 0 0 0 −
0 0 0 0 0 0 + 0
0 0 0 0 0 − 0 0
0 0 0 0 + 0 0 0











2I2yI3x =
i
2











0 0 0 − 0 0 0 0
0 0 − 0 0 0 0 0
0 + 0 0 0 0 0 0
+ 0 0 0 0 0 0 0
0 0 0 0 0 0 0 −
0 0 0 0 0 0 − 0
0 0 0 0 0 + 0 0
0 0 0 0 + 0 0 0











4I1zI2xI3y =
i
2











0 0 0 − 0 0 0 0
0 0 0 0 0 0 0 0
0 − 0 0 0 0 0 0
0 0 0 0 0 0 0 0
0 0 0 0 0 0 0 0
0 0 0 0 0 0 − 0
0 0 0 0 0 0 0 0
0 0 0 0 − 0 0 0











4I1zI2yI3x =
i
2











0 0 0 − 0 0 0 0
0 0 − 0 0 0 0 0
0 + 0 0 0 0 0 0
+ 0 0 0 0 0 0 0
0 0 0 0 0 0 0 +
0 0 0 0 0 0 + 0
0 0 0 0 0 − 0 0
0 0 0 0 − 0 0 0











324
Lecture 13
Magnetic ﬁeld gradients
Literature: The use of magnetic ﬁeld gradients in NMR spectroscopy is nicely reviewed in K11
(in particular, K11.11–11.14, presented more systematically and in more detail than here) and also
presented in L4.7 and L12.4 (with detailed analysis in LA12), C4.3.3., and B19.5. Magnetic resonance
imaging is discussed in B22, the basic ideas of slice selection and frequency encoding are also described
in L12.5. A very nice introduction has been written by Lars G. Hanson (currently available at
http://www.drcmr.dk/).
13.1 Pulsed ﬁeld gradients in NMR spectroscopy
Resonance frequencies of nuclei depend on properties of the molecule (inherent properties of nuclei
and interactions of nuclei with their microscopic environment) and on the external magnetic ﬁeld.
The external magnetic ﬁeld is what we control and the molecular properties is what we study. We
try to keep the external magnetic ﬁeld as homogeneous as possible so that all nuclei feel the same
external ﬁeld B0 and their frequencies are modulated by their molecular environment only. Now we
learn a trick of the spin alchemy which is based on violating this paradigm. It is possible to create
a magnetic ﬁeld that is inhomogeneous in a controlled way. We will discuss an example when the
ﬁeld is linearly increasing along the z axis (Figure 13.1 left, Sections 13.4.1 and 13.4.2). A linear
gradient of magnetic ﬁeld (or simply ”a gradient” in the NMR jargon) is applied in the z direction.
The nuclei close to the bottom of the sample tube feel a weaker magnetic ﬁeld and have a lower
precession frequency, whereas the nuclei close to the top feel a stronger ﬁeld and have a higher
precession frequency in such case. If we label the gradient in the z direction ∂B0/∂z = Gz, the
frequency oﬀset Ω at diﬀerent at diﬀerent positions is given by
Ω (z) = Ω − γGzz. (13.1)
We can say that frequency carries information about position along the z axis.
If the gradient in the z direction is applied when the total magnetization vector rotates in the xy
plane, nuclei at diﬀerent height of the sample acquire diﬀerent frequencies of rotation (an analysis of
the density matrix evolution is presented in Section 13.4.1). In the individual slices, the coherence
is preserved. But after a while, vectors of local transverse polarization (magnetization) rotating at
diﬀerent frequencies in diﬀerent slices of the sample would point to all possible directions and they
would no longer add up to a measurable total magnetization. We can say (i) that the gradient allows
325
326
0
B 0
B
Figure 13.1: Magnetic ﬁeld gradients in the vertical (z, left) and horizontal (y, right) directions. Top, magnetic
induction lines and the corresponding schematic drawings of the gradients (used to present the gradients in pulse
sequence diagrams) are shown in purple and black, respectively. Bottom, local transverse polarizations (magnetization)
at diﬀerent positions in the sample tube for increasing gradients (indicated by the black schematic drawings below the
sample tubes). The arrows representing the transverse polarizations (magnetization) vectors are color-coded so that
blue corresponds to Mx, red corresponds to −Mx, and white corresponds to ±My. The round shape of the gradient
symbols indicates that the gradients were applied with smoothly changing amplitudes as discussed in Section 13.4.2.
13.1. PULSED FIELD GRADIENTS IN NMR SPECTROSCOPY 327
us to distinguish magnetic moments in diﬀerent slices, or (ii) that the gradient destroys the bulk
(net) transverse magnetization. The longitudinal polarizations are not inﬂuenced. We postpone
discussion of the ﬁrst point of view (selectivity introduced by the gradient) to Section 13.2 and now
explore the consequences of the loss of net magnetization.
At the ﬁrst glance, it seems that dephasing of coherences and the consequent loss of the signal
are completely useless and should be avoided in NMR experiments. It is not true, gradients are very
useful if they are applied correctly. The ﬁrst trick is to apply gradients that destroy coherences we
are not interested in. Such gradients have cleaning eﬀects and remove unwanted contributions from
the spectra.
Another trick is to recover the magnetization back. If we apply the same gradient for the same
time, but in the opposite direction (−z) later in the pulse sequence, the magnetic vectors are refocused
and the signal appears again. We see how an echo can be created from two opposite gradients.
There are also other ways of creating gradient echoes, presented in Figure 13.2. Instead of using
two opposite gradients, two identical gradients can be applied during the refocusing echo (described
in Section 10.7), one in the ﬁrst half of the echo and the other one in the other half (echo ”a” in
Figure 13.2). The gradients do nothing else but adding another source of frequency variation, on
the top of the chemical shift and J-coupling eﬀects. Magnetic moments of the nuclei aﬀected by
the 180 ◦
pulse in the middle of the echo get always refocused, no matter what was the origin of the
frequency variability. On the other hand, magnetic moments of nuclei not aﬀected by the 180 ◦
pulse
(e.g., 13
C or 15
N nuclei if radio ways are applied at the proton frequency) feel two identical gradients
and get dephased. We see the selective cleaning eﬀect of the gradient echo, e.g. preserving the Iy
and 2IxSz coherences but destroying unwanted Sx and Sy coherences. Gradients incorporated
into the decoupling echo (described in Section 10.8) have exactly the opposite eﬀect (echo ”b” in
Figure 13.2). In this spirit, gradients are frequently added to the echoes in the pulse sequence to
clean imperfections of the used pulses.
Application of a cleaning gradient and of gradient echoes in a real NMR experiment is presented in
Figure 13.3 (magenta and cyan symbols, respectively). Note that the cleaning (magenta) gradient is
applied when no coherence (transverse polarization) should contribute to the density matrix (ˆρ(d) =
It − κ1
2
2IzSz, cf. the magnetic moment distribution at ”d” in Figure 13.3). The cyan gradients are
applied during the simultaneous echoes and refocus coherences that evolve due to the J-coupling.
Figure 13.3 also shows another, more tricky use of gradients implemented in an improved version
of the HSQC experiment (blue/red and green symbols). The idea is to apply one gradient during
the time when the desired coherence rotates in the operator space (and the corresponding transverse
polarization rotates in the real space) with the frequency of 13
C (or 15
N) and the other gradient
during the time when the total magnetization rotates with the frequency of protons. In order to
do it, we must generate a space in the pulse sequence by including a refocusing echo (a typical
example of using refocusing echoes in situation when we need more space but do not want to change
evolution). The two applied gradients are not identical, they change the magnetic ﬁelds to diﬀerent
extent. The deviations of the ﬁeld must be exactly in the ratio of resonance frequencies of 13
C
and 1
H. Then, the gradients form a heteronuclear gradient echo. Note what happens to various
coherences of protons. The coherence which contributed to the polarization transfer to 13
C and back
experiences the gradients as an echo and gets refocused. On the other hand, population of protons
whose polarization was not transferred to 13
C (e.g. protons of water that are not 13
C-bonded) feels
328
τττ τ
a b
z
Unwanted
Wanted
G
Figure 13.2: Gradient echoes. Black rectangles and round shapes indicate pluses of radio waves and magnetic ﬁeld
gradients, respectively. Evolution of the phase of the desired and undesired transverse coherence (describing direction
of the corresponding transverse polarization) is shown as green and red lines, respectively. Values of the phase at
diﬀerent positions in the sample tubes correspond to the distances of the green and red lines from the central black
line.
13.2. MAGNETIC RESONANCE IMAGING 329
1
4J
1
4J
H1
N15
C13
or
1
4J
1
4J
t2
t1
y
Gz
GARP
x y/
e hfb c da g i
Figure 13.3: Gradient enhanced HSQC experiment. Cleaning gradients and gradient echoes are shown in magenta
and cyan, respectively. The heteronuclear gradient echo consists of a gradient shown in green, applied during the
refocusing echo between time instants ”f” and ”g” (when density matrix evolves with the 13
C or 15
N frequency),
and of another gradient applied during the last echo (when density matrix evolves with the proton frequency). The
latter gradient is shown in blue for recording the real component of hypercomplex data and in red for recording the
imaginary component.
just two gradients of diﬀerent strengths and its coherence is destroyed. The gradient echo makes the
experiment selective for protons correlated with carbons and suppresses the signal of uncorrelated
protons.
13.2 Magnetic resonance imaging
We now explore selectivity of gradients, mentioned in Section 13.1. During a ﬁeld gradient in the z
direction, the actual precession frequency depends on the position of the molecule along the z axis.
This relationship can be used to selectively acquire NMR signal only from molecules in a certain
slice perpendicular to the z axis. As discussed in Section 13.4.1, the pulse sequence presented in the
right panel of Figure 13.4 allows us to detect transverse magnetization in a vertical slice of a given
thickness (the signal is detected from the whole slice but not from outside the slice). The gradient
can be also applied in the x and y directions (right part of Figure 13.1). It is therefore possible to
select signal in sagittal, coronal, and axial slices of a human body as shown in Figure 13.5.
Gradients also allow us to investigate variations of local magnetization inside the selected slice.
One possibility, called frequency encoding and presented in Figure 13.6, is to apply a gradient during
signal acquisition and to convert the frequency of the Fourier-transformed spectrum to the position
information. In order to understand the principle, we recall that the direction of the transverse
magnetization is described by a phase φ having an initial value φ0 and increasing in time as
φ(t) = φ0 + Ω t = φ0 + (Ω − γGxx)t. (13.2)
330
H1
Gz
a
H1
Gz
slice select
a b c
Figure 13.4: Slice selection pulse sequence: the basic idea (left) and real application (right). Gradients of B0 in
the z direction are shown in green. The 90◦
radio wave pulses are shown schematically as ﬁlled black rectangles, the
actual modulation of the radio-wave amplitude is depicted in cyan.
Figure 13.5: Selection of sagittal, coronal, and axial slices by Gx, Gy, and Gz gradients, respectively.
13.2. MAGNETIC RESONANCE IMAGING 331
We measure the magnetization in discrete time intervals separated by time step ∆t
φ(t) = φ0 + (Ω − γGxx)n∆t. (13.3)
In the standard NMR experiment, no gradient is applied during signal acquisition (Gx = 0).
Fourier transformation then converts the signal varying in time (with the phase growing as Ωn∆t)
to a spectrum describing how the signal depends on frequency ω, which is a quantity reciprocal to
time (ω = 2π/t).
In the frequency encoding imaging, we work on resonance (Ω = 0) and typically set φ0 to a
convenient value. The signal varies in time as −γGxxn∆t. However, we are not interested in how
signal changes in time but in its variation in space. Therefore, we can rearrange (−γGxx)t to
x(−γGxt) = xkx, where kx has the same meaning as the x component of the wave vector describing
propagation of waves in space. We can treat the measured signal as a set of intensities acquired for
a series of incremented values kx (with the step ∆kx = −γGx∆t). Therefore, we can write
φ(kx) = φ0 + xn∆kx (13.4)
and apply Fourier transformation to convert the dependence on kx to the dependence on x, which
is a quantity reciprocal to kx (x = 2π/kx). More details are presented in see Section 13.4.6.
Another option, called phase encoding and presented in Figure 13.7, is to vary the strength of a
gradient applied for a constant time τx. In this case, the phase can be still described by Eq. 13.4, but
the incrementation step is given by ∆kx = −γ∆Gxτx. If we combine frequency encoding using the
gradient Gx with phase encoding using the gradient Gy, as depicted in the left panel of Figure 13.7,
we can image signal variation in the whole selected two-dimensional slice (see Section 13.4.7 for
details).
The slice-selective imaging techniques, discussed above, have one disadvantage. It is diﬃcult to
select a very thin slice. Therefore, the imaging has limited resolution in one dimension. An alternative
approach exists that is not restricted in this sense. It is possible to apply gradient encoding to all
three dimensions. An example of such a pulse sequence is shown in the right panel in Figure 13.7.
However, such a high-resolution 3D imaging is considerably more time consuming. To save time,
shorter that 90◦
pulses are often applied. Such short pulses leave a large portion of magnetization in
the z direction. Therefore, a next short pulse, generating some transverse polarization can be applied
immediately after signal acquisition without the need to wait for the return to the equilibrium. In this
fashion, several acquisitions may be performed in one TR period before the longitudinal magnetization
is completely ”consumed”. This signiﬁcantly reduces the measurement time.
Reconstruction of the two-dimensional image from frequency- and phase-encoded data can be
described in the same manner. Both frequency and phase encoding gradient introduce variation of
the magnetic ﬁeld, and consequently of the precession frequency, in the selected slice (xy plane in
our example). Linear variations of the magnetic ﬁeld create ”waves” of phases of the transverse
polarization, as shown in Figures 13.8 and 13.9. The waves propagate in the x or y direction,
respectively, if the gradients Gx and Gy are applied separately. Simultaneous application of both
gradients generates waves spreading in a direction given by the relative ratio of the gradient strengths
(Figure 13.8B). Each imaging experiment consists of a series of measurements with diﬀerent setting
of the gradients. Each combination of the gradients can be described by two parameters, kx and
332
H1
Gz
Gy
Gx
slice select
read
a b c
H1
Gz
t2t1
read
Gy
Gx
slice select
encode
a b c g
Figure 13.6: Pulse sequences allowing frequency encoded 1D (left) and 2D (right) imaging in the selected slice.
Gradients of B0 in the x, y, and z direction are shown in blue, red, and green, respectively. The 90◦
radio wave pulses
are shown schematically as ﬁlled black rectangles, the actual modulation of the radio-wave amplitude is depicted in
cyan.
1
H
Gz
TR
TE
read
encode
Gy
Gx
slice select
t
gfd ea b c h
H1
TE
TR
Gy
encode
xG
encode
t
x
y
τ
τ
Gz
read
b c hf ga ed
Figure 13.7: Examples of slice-selective 2D imaging experiment, combining phase and frequency encoding (left) and
of 3D phase encoding imaging experiment (right). The frequency and phase encode gradients are labeled ”read” and
”encode” respectively. TE and TR are echo time and repetition time, respectively. Gradients of B0 in the x, y, and
z direction are shown in blue, red, and green, respectively. The radio wave pulses (90◦
in the left panel and 10◦
in
the right panel) are shown schematically as ﬁlled black rectangles, the actual modulation of the radio-wave amplitude
is depicted in cyan. The 180◦
pulse in the left panel creates an echo that refocuses evolution of residual Ω (due to
ﬁeld inhomogeneity). The ﬁrst Gx gradient in the right panel allows to reduce phase encoding to a limited range of x.
Therefore fewer phase encoding steps can be used without problems with aliasing (longer ∆kx can be used to achieve
good resolution by having larger Nx∆kx with lower Nx).
13.2. MAGNETIC RESONANCE IMAGING 333
ky, that can be combined in a vector (vector k in Figure 13.8B). The values of kx and ky vary as
the acquisition time proceeds in the case of the frequency encoding gradient, or as the strength of
the phase encoding gradient is incremented (see Sections 13.4.6 and 13.4.7 for details). Each panel
in Figure 13.9 represents a phase wave for a particular value of kx and ky. In terms of the phase
waves, the direction of k deﬁnes the direction of the wave propagation and the magnitude of k says
how dense the waves are. We see that k behaves as a wave vector describing any other physical
waves (e.g. electromagnetic waves), and we can expect that signal reconstruction is based on similar
principles as analysis of diﬀraction patterns providing structure of the diﬀracting objects.
Instead of describing the image reconstruction technically (it is done in Sections 13.4.6 and 13.4.7),
here we try to get a general idea by inspecting Figure 13.8. For the sake of simplicity, we assume that
all observed nuclei have the same chemical shift. The chemical shift diﬀerences (e.g. between aliphatic
protons of lipids and protons in water) result in artifacts, displacements of the apparent positions
of the observed molecules in the image. Figure 13.8A shows transverse polarization phases in the
absence of gradients. The phases are aligned at the beginning of the experiment and move coherently,
i.e., do not move at all in the coordinate frame rotating with the frequency −ωradio (Figure 13.8A).
In the absence of the gradients (kx = ky = 0), the coherent arrangement of the phases depicted in
Figure 13.8A does not change (except for relaxation eﬀects and technical imperfections). We therefore
record a signal proportional to the number of observed nuclei in the slice and to the magnetic moment
distribution in equilibrium (our constant κ). Application of gradients redistributes the phases as
shown in Figure 13.8B. Local transverse polarizations (magnetizations) pointing in opposite directions
at diﬀerent sites of the slice cancel each other, and the net transverse magnetization of the whole
slice is very small (equal to zero in Figure 13.8B). We see that the gradients greatly reduce signal in
slices with a uniform distribution of magnetic moments (of the spin density). What happens if the
magnetic moments (the spin density) are not distributed uniformly, but have some structure? For
example, if bones (containing much less protons than soft tissues) intersect the slice? If the structure
is periodic (e.g. like ribs) and if it has a period and orientation matching the period and direction
of the phase waves, the signal may greatly increase because protons are concentrated in the regions
of the slice with a similar phase of transverse polarization (magnetization). An example is shown in
Figure 13.8C.
The example of the Figure 13.8C represents an extreme case of signal enhancement. Most structures
in the human body are not periodic as the ribs. But any deviation from uniform distribution of
protons perturbs the regular patterns of phase waves resulting in net transverse magnetization close
to zero. Each wave interferes with the given structure diﬀerently. Therefore, the signal obtained for
diﬀerent kx and ky varies. Mathematically, the set of all values of kx, ky (and kz in some experiments)
forms a two-dimensional (or three-dimensional) space, called k-space. Each combination of gradients
represents one point in the k-space. If we plot the values of the signal obtained for diﬀerent gradient
settings in the order of increasing kx and ky, we obtain a picture of the imaged object in the k-space.
The task of image reconstruction is to convert this picture into dependence of the spin density on
the coordinates x and y. A very simple example is provided in Figure 13.10. Although the signal
is calculated only for 25 diﬀerent k values in Figure 13.10, it exhibits some general features. For
example, comparison of data collected for shapes with increasing complexity documents that higher
values of kx, ky (data further from the middle of the k-space) reﬂect ﬁner structural details.
In reality, there is a straightforward relation between the shape of the imaged object in a real
334
A
x
y
B
k
C
Figure 13.8: Coherent phase distribution (A) and a phase wave generated by the gradients Gx and Gy in selected
axial slice with a uniform spin density (B) and with a low-spin density structure (C). The wave vector k is depicted
in Panel B.
space (described by coordinates x, y, and z in some experiments) and the shape of the object’s
picture in the k-space (described by ”coordinates” kx, ky, and kz in some experiments). As shown
in Sections 13.4.6 and 13.4.7, the dependence of the signal on the distribution of magnetic moments
(spin density) in the x, y plane (and in space in general) has a form of the Fourier transformation.
Therefore, the distribution of spin density, deﬁning the shape of the object, can be calculated simply
by applying the inverse Fourier transformation.
13.3 Weighting
NMR spectroscopy of diluted chemical compounds is often limited by the inherently low sensitivity
of NMR experiments. However, the highest possible sensitivity is not the ultimate goal of imaging.
It is much more important to obtain a high contrast. It does not help us to get a very bright image
of the human body if we cannot distinguish individual organs and ﬁner structural details. So far,
we discussed how magnetic resonance imaging reﬂects the variations in the local concentration of
magnetic moments (spin density). But the signal is also inﬂuenced by relaxation. Relaxation gives
us a unique opportunity to distinguish regions of the body where protons are present in similar concentrations
but in molecules with diﬀerent dynamics and consequently diﬀerent relaxation. Among
numerous, often sophisticated imaging techniques, three major approaches can be recognized.
• Spin density weighting. The highest possible signal, depending only on the spin density is obtained
if the experiment starts from thermodynamic equilibrium and the transverse relaxation
does not decrease the signal signiﬁcantly. This is the case if (i) the time between the individual
measurements is much longer than 1/R1 (where R1 is the relaxation constant of longitudinal
relaxation which drives the system back to the equilibrium) and (ii) the duration of the experiment
is much shorter than 1/R2 (where R2 is the relaxation constant of transverse relaxation
which is the source of the signal decay). Therefore, the spin density weighted experiments are
run with a short echo time TE and long repetition time TR.
13.3. WEIGHTING 335
A B C
D E F
G H I
Figure 13.9: Phase waves generated in the selected axial slice by the gradients Gx and Gy.
336
A
0 0 0 0 0
0 0 0 0 0
0 0 0 0 0
0 0 0 0 0
0 0 0 016
B
−4
0 0 0 0 0
0 0 0 0 0
0 0 0 0 0
0 0 0 0 0
80 0−4
C
0 0 0 0 0
0 0 0 0 0
+1 +1 +1 +1 +1
+1+1+1+1+1
10+2 −2 −2 +2
D
−1 +1 −1 −1 +1
−1 −1 −1 −1 −1
+1 +1 +1 +1 +1
+1 +1 +1 +1 +1
11−1 −1+3 +3
Figure 13.10: A simple example of image reconstruction. In each panel, phase waves (left) and obtained relative
signal intensities (right) are shown for 25 diﬀerent gradient settings (25 small squares). The phases are presented as
arrows, color-coded as in Figure 13.1 (+Mx in blue, −Mx in red, ±My in white). The signal intensities are displayed
as numbers and corresponding colors (positive and negative intensities are shown in blue and red). Imaging of an
object with uniform proton density (A) and with structures of three diﬀerent shapes (B–D) is presented. Matter with
high and low proton density is shown in cyan and yellow, respectively. The depicted waves correspond to the kx
values of 2∆kx, ∆kx, 0, −∆kx, and −2∆kx (top-to-bottom) and to ky values of 2∆ky, ∆ky, 0, −∆ky, and −2∆ky
(right-to-left).
13.3. WEIGHTING 337
• T2 weighting. The signal strongly depending on the relaxation constant R2 (or on the relaxation
time T2 = 1/R2) is obtained if (i) the time between the individual measurements is much longer
than 1/R1 and (ii) the duration of the experiment is such that the diﬀerences in the factors
e−R2TE
of diﬀerent molecules are most pronounced. Therefore, the T2 weighted experiments are
run with a long echo time TE and long repetition time TR. As R2 is mostly given by J(0) and
J(0) is proportional to the rotational correlation time (cf. Eq 2.6), the T2-weighted signal is
most attenuated for slowly reorienting molecules (molecules in ﬁrm tissues).
• T1 weighting. The signal strongly depending on the relaxation constant R1 (or on the relaxation
time T1 = 1/R1) is obtained if (i) the time between the individual measurements is comparable
to 1/R1 and (ii) the duration of the experiment is is much shorter than 1/R2. Therefore,
the T1 weighted experiments are run with a short echo time TE and short repetition time
TR. In contrast to R2, the major contribution to R1 is J(ω0), which has a maximum (in the
approximation of Eq. 9.32) for a rotational correlation time equal to 1/ω0, i.e. 3.75 ns at 1 T or
1.25 ns at 3 T. Therefore, the highest contrast of T1-weighted signal is obtained for molecules
with intermediate (low-nanosecond) dynamics (molecules in semi-ﬁrm tissues).
338
13.4 SUPPORTING INFORMATION
13.4.1 Coherence dephasing and slice selection by ﬁeld gradients
Quantitatively, the magnetic ﬁeld gradient in the direction z is deﬁned as Gz = ∆B0/∆z. The same applies to gradients applied in other
directions: Gx = ∆B0/∆x, Gy = ∆B0/∆y. Note that all gradient describe linear perturbations of the vertical magnetic ﬁeld B0. As the
precession frequency ω0, and consequently the frequency oﬀset Ω = ω0 −ωrot = ω0 −(−ωradio), are proportional to B0, the gradient makes
the frequency dependent on the position:
Ω (x) = Ω − γGxx, Ω (y) = Ω − γGyy, Ω (z) = Ω − γGzz, (13.5)
where we set the origins of the axes x, y, z at the place where the gradient has no eﬀect. For the sake of simplicity, we analyze the
eﬀect of gradients for magnetic moments not inﬂuenced by interactions with electrons and other magnetic dipoles (i.e., we assume that
all molecules have the same chemical shift and the dipole-dipole and J couplings are not present or can be neglected). We start with a
density matrix describing an ensemble of magnetic moments uniformly rotated by a 90◦ radio wave pulse from the equilibrium distribution
ˆρ(0) = It − κIy (Figure 13.4). Then we apply a gradient, e.g. in the z direction. The density matrix at evolves as
ˆρ(t) = It − κIycos(Ω (z)t) + κIxsin(Ω (z)t) = It − κIycos((Ω − γGzz)t) + κIxsin((Ω − γGzz)t) = It − κIycos φ(z, t) + κIxsin φ(z, t),
(13.6)
where the horizontal bar indicates ensemble averaging. The expected value of the transverse magnetization is
M+ (t) = Tr{ˆρ(t) ˆM+} = −N
γ2 2B0
2kBT
Tr{IyI+}cos((Ω − γGzz)t) + N
γ2 2B0
2kBT
Tr{IxI+}sin((Ω − γGzz)t)
= N
γ2 2B0
4kBT
ei π
2 ei(Ω−γGzz)t, (13.7)
Performing phase correction and including relaxation,
M+ (t) = N
γ2 2B0
4kBT
e−R2t
ei(Ωt−γGzz)t = N
γ2 2B0
4kBT
e−R2t
cos((Ω − γGzz)t) + i sin((Ω − γGzz)t) . (13.8)
If the gradient is suﬃciently strong, the sine and cosine terms oscillate rapidly with frequencies depending on z and their ensemble
averages tend to zero. If the gradient is applied during a 90◦ excitation pulse, the resonance condition is fulﬁlled exactly only for a certain
value of γGzz that matches the chemical shift (position-independent frequency oﬀset Ω), Ω = γGzz. Therefore, the strongest signal is
obtained from a slice of the signal at
z =
Ω
γGz
=
ω0 − ωrot
γGz
=
ω0 − (−ωradio)
γGz
. (13.9)
The reduction of the signal form other vertical positions is given by the excitation proﬁle of the 90◦ pulse. Thickness of the slice
depends on the value of Gz (the stronger Gz the thinner the slice) and the position z can be varied by changing the carrier frequency of
the radio wave ωradio. The dependence of the magnetization on the position in the investigated sample is used in diﬀerent manners in
NMR spectroscopy and imaging:
• In NMR spectroscopy of samples with relatively low concentration of the studied substance, the signal, obtained only from a
selected slice would be very low, often below the limit of detection. Usually, 90◦ pulse applied in the absence of gradients creates
transverse magnetization in the whole sample and a gradient is applied later to de-phase magnetization at diﬀerent positions. This
is the principle of the action of cleaning gradients and gradient echoes destroying coherences that are not interesting. The use of
gradients in NMR spectroscopy is discussed in Section 13.1, and further analyzed in Sections 13.4.2–13.4.3.
• If the concentration of the detected compound is suﬃciently high and the transverse magnetization in the selected slice is observable,
signal from diﬀerent slices can be compared and further investigated. This is interesting especially if the number of magnetic
moments per volume element, or the spin density N(z) varies in the z direction. Detection of the magnetization dependent on
spin density in individual selected slices is the basis of slice-selective imaging, described in Sections 13.4.5ﬀ.
13.4. SUPPORTING INFORMATION 339
13.4.2 Field gradients with smooth amplitude
In NMR pulse sequences, the gradient is usually not switched on and oﬀ suddenly. Instead, the linear magnetic ﬁeld perturbation is
increased and decreased in a smooth fashion, following for example a function sin(πt/τz) for a gradient that starts at t = 0 and is ﬁnished
at t = τz. In such a case, the total rotation angle of the transverse polarization (the phase φ) is
φ(z, t) = Ωt − γz
∆B0(t = τz/2)
∆z
τzˆ
0
sin
πt
τz
dt. (13.10)
Because the ratio
t´
0
f(t )dt
t
(13.11)
is constant (deﬁnition of the average value of f(t)), it is convenient to absorb the eﬀect of the smooth amplitude of the gradient into
the value of Gz:
φ(z, τz) = Ωτz − γz
∆B0(t = τz/2)
∆z
τz´
0
sin πt
τz
dt
τz
Gz
τz = Ωτz − γGzzτz. (13.12)
The equations describing actions of gradients can be modiﬁed for the gradients with a smooth amplitude (shaped gradients) by
changing t to τz.
13.4.3 Coherence selection by pulsed-ﬁeld gradients
Unwanted coherences can be suppressed by cleaning gradients and gradient echoes. The principle of cleaning gradients (e.g. the magenta
gradient in Figure 13.3) is fairly simple. Phases of magnetizations corresponding to coherence evolving during the gradient at diﬀerent
vertical positions in the sample are spread to the whole xy plane.
Product operator analysis of gradient echoes is also straightforward. Let us assume that the density matrix at the beginning of the
echoes presented in Figure 13.2 is
ˆρ(0) = It −
1
2
κ1Iy −
1
2
κ2Sy, (13.13)
where Iy is the desired coherence and Sy is the undesired one. The density matrix at the end of the ﬁrst delay τ in Figure 13.2 is
(cf. Eq. 13.6)
ˆρ(t) = It−
1
2
κ1cos(Ωτ − γ1Gzzτz) +
1
2
κ1Ixsin(Ωτ − γ1Gzzτz)−
1
2
κ2cos(Ωτ − γ2Gzzτz) +
1
2
κ2Ixsin(Ωτ − γ2Gzzτz)
= It−
1
2
κ1Iyc11 +
1
2
κ1Ixs11−
1
2
κ1Syc21 +
1
2
κ1Sxs21. (13.14)
The following 180◦ radio-wave pulse in Figure 13.2a, inﬂuencing only the wanted coherences converts the density matrix to
ˆρ(t) = It+
1
2
κ1Iyc11 +
1
2
κ1Ixs11−
1
2
κ1Syc21 +
1
2
κ1Sxs21, (13.15)
which evolves during the second delay τ to
ˆρ(t) = It+
1
2
κ1Iy(c11c12 + s11s12) +
1
2
κ1Ix(c11s12 − s11c12)−
1
2
κ1Sy(c21c22 − s21s22) +
1
2
κ1Sx(c21s22 + s21c22). (13.16)
As the second gradient in Figure 13.2a is the same as the ﬁrst one,
c11c12 + s11s12 = c2
11 + s2
11 = 1 c11s12 − s11c12 = c11s11 − c11s11 = 0, (13.17)
c21c22 − s21s22 = c2
21 − s2
21 c21s22 + s21c22 = øˆρ(b) =
1
2
It −
κ
2
I1y −
κ
2
I2yverlinec21s21 + c21s21, (13.18)
and
340
ˆρ(t) =
1
2
κ1Iy−
1
2
κ2Sycos(Ω(2τ) − γ2Gzz(2τz)) +
1
2
κ2Sxsin(Ω(2τ) − γ2Gzz(2τz)), (13.19)
where the red terms provide only a very small signal in the selected slice. The Sx and Sy symbols may represent a nucleus we are
not interested in, or a portion of the coherence of the observed nucleus that is not refocused due to the oﬀset eﬀects. The cyna gradient
echoes in Figure 13.3 represent such an application.
Analysis of the gradient echo in Figure 13.2b is similar. The 180◦ radio-wave pulse following the ﬁrst delay τ in Figure 13.2b inﬂuences
only the unwanted coherences and converts the density matrix to
ˆρ(t) = It−
1
2
κ1Iyc11 +
1
2
κ1Ixs11+
1
2
κ1Syc21 +
1
2
κ1Sxs21, (13.20)
which evolves during the second delay τ to
ˆρ(t) = It−
1
2
κ1Iy(c11c12 − s11s12) +
1
2
κ1Ix(c11s12 + s11c12)+
1
2
κ1Sy(c21c22 + s21s22) −
1
2
κ1Sx(c21s22 − s21c22). (13.21)
As the second gradients in Figure 13.2b has the opposite eﬀect than the ﬁrst one,
c11c12 − s11s12 = cos(Ωτ − γ1Gzzτz) cos(Ωτ + γ1Gzzτz) − sin(Ωτ − γ1Gzzτz) sin(Ωτ + γ1Gzzτz) = cos(2Ωτ), (13.22)
c11s12 + s11c12 = cos(Ωτ − γ1Gzzτz) sin(Ωτ + γ1Gzzτz) + cos(Ωτ − γ1Gzzτz) sin(Ωτ + γ1Gzzτz) = sin(2Ωτ), (13.23)
c21c22 + s21s22 = cos(Ωτ − γ1Gzzτz) cos(Ωτ + γ1Gzzτz) + sin(Ωτ − γ1Gzzτz) sin(Ωτ + γ1Gzzτz) = cos(2γ1Gzzτz), (13.24)
c21s22 − s21c22 = cos(Ωτ − γ1Gzzτz) sin(Ωτ + γ1Gzzτz) − sin(Ωτ − γ1Gzzτz) cos(Ωτ + γ1Gzzτz) = sin(2γ1Gzzτz), (13.25)
and
ˆρ(t) = −
1
2
κ1Iy cos(2Ωτ) +
1
2
κ1Ix sin(2Ωτ)+
1
2
κ2Sycos(2γ2Gzzτz) +
1
2
κ2Sxsin(2γ2Gzzτz), (13.26)
where the red terms again provide only a negligible signal (the averaged sines and cosines tend to zero because the arguments range
from −pi to π). Note that the chemical shift evolution is not refocused because no 180◦ pulse aﬀects the desired coherences in Figure 13.2b.
It suggests that the gradient echo presented in Figure 13.2b can be used to select the desired coherences during the incremented delays
introducing the indirect time dimension (t1).
13.4.4 Pulsed-ﬁeld gradients and frequency discrimination
The green and blue gradients in Figure 13.3 (gradient-enhanced HSQC experiment) represent a gradient echo that selects only the coherences
that evolve with the frequency of 13C or 15N in t1 and with the proton frequency in t2. Following the analysis presented in Section 11.5,
the density matrix at point f in Figure 13.3 is
ˆρ(f) = 1
2
It + 1
2
κ1 (c212IzSy − s212IzSx) + 1
2
κ2 (c21Sy − s21Sx).
The sequence in Figure 13.3 continues by a refocusing echo with the green gradient. The analysis in Section 11.5 shows that the red or
green terms do not contribute to the observed signal. Therefore, we analyze here only the fate of the blue terms. Combination of the
chemical shift evolution during t1 and during the gradient echo results in
ˆρ(g) = 1
2
κ1(cos(Ω2t1)sin(γ2Gz1τz1)+sin(Ω2t1)cos(γ2Gz1τz1))2IzSx+ 1
2
κ1(sin(Ω2t1)sin(γ2Gz1τz1)−cos(Ω2t1)cos(γ2Gz1τz1))2IzSy+. . .
= 1
2
κ1sin(Ω2t1 + γ2Gz1τz1)2IzSx − 1
2
κ1cos(Ω2t1 + γ2Gz1τz1)2IzSy + . . .
The following two 90◦ pulses with x phase convert −2IzSy to 2IySz, which evolves to Ix during the second INEPT, and 2IzSx to
a multiple-quantum coherence 2IySx, which does not contribute to the signal. The chemical shift evolution during the blue gradient
combines with the chemical shift evolution during t2 and results in
ˆρ(t2) = 1
2
κ1cos(Ω2t1 + γ2Gz1τz1)cos(Ω1t2 + γ1Gz2τz2)Ix − 1
2
κ1cos(Ω2t1 + γ2Gz1τz1)sin(Ω1t2 + γ1Gz2τz2)Iy + . . .
= 1
4
κ1(cos(Ω2t1 + Ω1t2 + γ2Gz1τz1 + γ1Gz2τz2) + cos(Ω2t1 − Ω1t2 + γ2Gz1τz1 − γ1Gz2τz2))Ix
+1
4
κ1(−sin(Ω2t1 + Ω1t2 + γ2Gz1τz1 + γ1Gz2τz2) + sin(Ω2t1 − Ω1t2 + γ2Gz1τz1 − γ1Gz2τz2))Iy + . . .
Calculating the trace of ˆρ(t2) ˆM+, proportional to ˆρ(t2)I+, shows that the observable signal is modulated as
1
4
κ1(cos(Ω2t1 + Ω1t2 + γ2Gz1τz1 + γ1Gz2τz2) + cos(Ω2t1 − Ω1t2 + γ2Gz1τz1 − γ1Gz2τz2))
+ i
4
κ1(−sin(Ω2t1 + Ω1t2 + γ2Gz1τz1 + γ1Gz2τz2) + sin(Ω2t1 − Ω1t2 + γ2Gz1τz1 − γ1Gz2τz2))
= 1
4
e+i(Ω2t1)e−i(Ω1t2)e+i(γ2Gz1τz1−γ1Gz2τz2) + e−i(Ω2t1)e−i(Ω1t2)e−i(γ2Gz1τz1+γ1Gz2τz2) .
The green and blue gradients make an echo if +γ2Gz1τz1 − γ1Gz2τz2 = 0 (note that ˆρ evolves with the chemical shift Ω2, proportional
13.4. SUPPORTING INFORMATION 341
to γ2, during the green gradient, but with the chemical shift Ω1, proportional to γ1, during the blue gradient). With such setting of the
gradients, the signal is modulated as
e+i(Ω2t1)
e−i(Ω1t2)
+ e−i(Ω2t1)
e−i(Ω1t2)
e−i(γ2Gz1τz1+γ1Gz2τz2)
, (13.27)
where the red term results only in a very weak signal in the selected slice. This is a remarkable result, showing that frequency
discrimination is achieved without the need to repeat the acquisition with a phase of a 90◦ pulse shifted by 90 ◦ (note that Fourier
transform of ei(Ωt) is a signal with only one peak at the right frequency Ω). However, Fourier transformation of the obtained signal yields
phase-twisted peaks, as described in Section 9.5.1. Therefore, the acquisition is repeated while shifting a phase of the 90◦ pulse following
the green gradient by 90 ◦ (the red mark y in Figure 13.3). In the repeated acquisition, the two 90◦ pulses convert 2IzSx to 2IzSy,
which evolves to Ix during the second INEPT, and 2IzSy to a multiple-quantum coherence 2IySy, which does not contribute to the
signal. The chemical shift evolution during the second gradient (labeled in red for the repeated experiment) and during t1
ˆρ(t2) = 1
2
κ1sin(Ω2t1 + γ2Gz1τz1)cos(Ω1t2 + γ1Gz2τz2)Ix − 1
2
κ1sin(Ω2t1 + γ2Gz1τz1)sin(Ω1t2 + γ1Gz2τz2)Iy + . . .
= 1
4
κ1(sin(Ω2t1 + Ω1t2 + γ2Gz1τz1 + γ1Gz2τz2) + sin(Ω2t1 − Ω1t2 + γ2Gz1τz1 − γ1Gz2τz2))Ix
+1
4
κ1(cos(Ω2t1 + Ω1t2 + γ2Gz1τz1 + γ1Gz2τz2) − cos(Ω2t1 − Ω1t2 + γ2Gz1τz1 − γ1Gz2τz2))Iy + . . .
Calculating the trace of ˆρ(t2) ˆM+, proportional to ˆρ(t2)I+, shows that the observable signal is modulated as
1
4
κ1(sin(Ω2t1 + Ω1t2 + γ2Gz1τz1 + γ1Gz2τz2) + sin(Ω2t1 − Ω1t2 + γ2Gz1τz1 − γ1Gz2τz2))
+ i
4
κ1(cos(Ω2t1 + Ω1t2 + γ2Gz1τz1 + γ1Gz2τz2) − cos(Ω2t1 − Ω1t2 + γ2Gz1τz1 − γ1Gz2τz2))
= 1
4
e−i(Ω2t1)e−i(Ω1t2)e−i(γ2Gz1τz1+γ1Gz2τz2) − e+i(Ω2t1)e−i(Ω1t2)e+i(γ2Gz1τz1−γ1Gz2τz2) eiπ/2.
Now the red gradient is applied in the opposite direction than the blue one. The green and red gradients make an echo if γ2Gz1τz1 +
γ1Gz2τz2 = 0. With such setting of the gradients and after correcting the unimportant phase shift by π/2 (the last exponential term), the
signal is modulated as
e−i(Ω2t1)
e−i(Ω1t2)
− e+i(Ω2t1)
e−i(Ω1t2)
e+i(γ2Gz1τz1−γ1Gz2τz2)
, (13.28)
where the red term again results only in a very weak signal in the selected slice. Note that the signal acquired with the ”blue” phase of
the 13C (or 15N) pulse x (Eq. 13.27) is modulated by frequencies with the opposite signs in the direct and direct dimensions, whereas the
signal acquired with the ”red” phase of the 13C (or 15N) pulse y (Eq. 13.28) is modulated by frequencies with the same signs in the direct
and direct dimensions. Such signals are labeled N and P, or anti-echo and echo, respectively, in the literature. Sum and diﬀerence of Eqs.
13.27 and 13.28 yields a signal which is cosine and sine modulated, respectively, in the indirect dimension and can be processed as described
in Section 9.5.1 to provide frequency-discriminated, purely absorptive real part of the signal. However, it takes twice as long to acquire
the signal of the same size, compared to the discrimination by the States-Haberkorn-Ruben method without the gradient. Therefore, the
signal-to-noise ratio is lower by a factor of
√
2 for the same measurement time. A more sophisticated gradient HSQC experiment preserving
the original sensitivity (except for some relaxation loss) has been also developed and is used routinely.
13.4.5 Slice-selective imaging
As described in Section 13.4.1, a 90 ◦ pulse applied during a gradient Gz fully rotates magnetization to the xy plane at z = Ω/(γGz). The
more z diﬀers from Ω/(γGz), the lower is the signal.
In practice, we prefer to select signal from a region of a well deﬁned thickness. This is achieved by applying simultaneously the gradient
and a radio wave with the amplitude modulated so that magnetic moments with frequencies in a certain interval are rotated by an angle
close to 90 ◦, whereas magnetic moments with frequencies outside the selected interval are almost unaﬀected. The amplitude modulation
is achieved by dividing a radio-wave pulse into a series of short pulses with diﬀerent B1, as described in Section 1.5.10. The modulation is
shown in cyan in Figure 13.4. Then, the resonance condition −ωradio = ω0 − γGzz is fulﬁlled in an interval of z deﬁned by the range of the
frequencies aﬀected by the radio-wave pulse. The amplitude-modulated radio-wave pulse is usually relatively long and magnetic moments
with diﬀerent precession frequencies (within the selected range) have enough time to rotate signiﬁcantly during the pulse. This rotation,
diﬀerent for diﬀerent vertical positions inside the selected slice, is refocused by a negative gradient. It can be shown that the gradients
make an exact echo if the negative gradient corresponds to the second half of the positive gradient (between the middle and end of the
amplitude-modulated radio-wave pulse, see Figure 13.4). The exact mathematical prove is not easy, but approximative solutions can be
obtained more easily. The ﬁrst insight can be obtained if we imagine that the chosen range of frequencies was selectively excited by a
narrow pulse in the middle of the selection gradient (black bar in Figure 13.4). Then, no transverse magnetization exists before the pulse
and the transverse magnetization created by the pulse experiences a gradient echo consisting of the second half of the positive selection
gradient and by the negative refocusing gradient.
Such ﬁltering of the signal according to the z coordinate of the observed molecule is the basis of slice-selective imaging techniques.
The gradients applied in the x or y direction can be used in the same manner to select slices perpendicular to the x or y axis, respectively.
In human body imaging, the coordinate system is used so that Gx, Gy, and Gz selects sagittal, coronal, and axial slices, respectively (see
Figure 13.5).
13.4.6 Frequency encoding gradients
We now proceed to the imaging in the slice selected at z ≈ (ω0 −(−ωradio))/(γGz). In order to describe imaging in the x direction based on
frequency encoding, we analyze how the density matrix evolves during the Gx gradient in Figure 13.6. The density matrix at the beginning
of Gx is ˆρ(c) = It − κIy in the selected slice and ˆρ(c) = It everywhere else. During Gx, ˆρ(t)) in the slice evolves as
342
ˆρ(t) = It − κIycos((Ω − γGxx)t) + κIxsin((Ω − γGxx)t), (13.29)
which can be also written as
ˆρ(x) = It − κIycos((Ωt − γGxtx) + κIxsin((Ωt − γGxtx) = It − κIycos((Ωt − kxx) + κIxsin((Ωt − kxx), (13.30)
where kx is the x-component of the wave vector k in Figure 13.8. Introducing relaxation and performing phase correction,
M+ (t) =
γ2 2B0
4kBT
e−R2τz eiΩt−R2t
N(x)e−iγGxxt. (13.31)
Expressing the ensemble averaging explicitly,
M+ (t) =
γ2 2B0
4kBT
e−R2τz eiΩt−R2t
Lxˆ
0
N(x)e−iγGxxt
dx =
γ2 2B0
4kBT
e−R2τz eiΩt−R2t
Lxˆ
0
N(x)e−ikxx
dx, (13.32)
where Lx is the size of the imaged object in the x direction.
Fourier transformation of M+ (t) gives a spectrum corresponding to
Y (ω) =
γ2 2B0
4kBT
e−R2τz
Lxˆ
0
N(x)
R2
R2
2 + (Ω − γGxx − ω)2
+ i
Ω − γGxx − ω
R2
2 + (Ω − γGxx − ω)2
dx, (13.33)
with the spatial distribution encoded in the apparent frequency Ω = Ω − γGxx.
In reality, the signal is stored as N discrete data points sampled with a time increment ∆t. The value kx = γGxt = γGx · n∆t can
be written as n∆kx, where ∆kx = γGx · ∆t. The sampled time points correspond to n∆t = n∆kx/(γGx). Considering ∆t∆f = 1/N
(Eq. 3.7), ∆kx = γGx/(N∆f). The second integral in Eq. 13.32 has the form of the Fourier transformation (as N(x) = 0 for x < 0 and
x > L, the integration can be extended to ±∞). The distribution of the spin density N(x) can be evaluated at discrete values of x = j∆x
by the inverse discrete Fourier transformation of the signal sampled at n∆t = n∆kx/(γGx):
N(x) = Nj =
4kBT∆kx
γ2 2B0e−R2τz
N−1
n=0
M+ ne
−(iΩ−R2) ∆kx·n
γGx ei2π j·n
N . (13.34)
Note that all features of discrete Fourier transformation (e.g. aliasing) are relevant for image reconstruction.
Extending the discussion to the two-dimensional experiment (right panel in Figure 13.6), is straightforward:
x = jx∆x kx = γGxt2 = γGx · nx∆t2 ∆kx = γGx · ∆t2 = γGx/(Nx∆f2) (13.35)
y = jy∆y ky = γGyt1 = γGy · ny∆t1 ∆ky = γGy · ∆t1 = γGy/(Ny∆f1), (13.36)
and
N(x, y) = Njx,jy = ∆kx∆ky
4kBT
γ2 2B0e−R2τz
Nx−1
nx=0
Ny−1
ny=0
M+ nx,ny e
−(iΩ−R2) ∆kx·nx
γGx
+
∆ky·ny
γGy
e
i2π jx·nx
Nx
+
jy·ny
Ny
. (13.37)
13.4.7 Phase encoding gradients
In order to describe imaging in the y direction based on phase encoding, we analyze how the density matrix evolves during the Gy gradient
in the pulse sequence presented in the left panel in Figure 13.7. The gradient is placed in a refocusing echo of the duration TE. We
ignore the possible phase shift and assume that the density matrix at the beginning of Gy is ˆρ(d) = It + κIy inside the selected slice and
ˆρ(d) = It everywhere else. During Gy, ˆρ(d)) evolves to
ˆρ(e) = It + κIycos((Ω − γGyy)τy) − κIxsin((Ω − γGyy)τy), (13.38)
where τy is the duration of the gradient. Expressing ˆρ(e) as a function of y,
ˆρ(y) = It + κIycos((Ωτy − γGyτyy) − κIxsin((Ωτy − γGyτyy) = It + κIycos((Ωτy − kyy) − κIxsin((Ωτy − kyy). (13.39)
During imaging, τy is kept constant and the phase shift Ωτy is refocused by the echo. The parameter that is varied is the strength of
the gradient Gy, gradually decreased from the originally positive value to a negative one by increments ∆Gy.
13.4. SUPPORTING INFORMATION 343
Then, a negative pre-phasing gradient Gx is applied for a time period equal to the half of the total acquisition time Nx∆t/2. Ignoring
the phase shifts Ωτx and −ΩNx∆t/2 that get refocused at TE, the density matrix at the beginning of data acquisition is
ˆρ(f) = It + κIy cos(−γGyτyy) cos +γGx
Nx
2
∆tx − sin(−γGyτyy) sin +γGx
Nx
2
∆tx
− κIx sin(−γGyτyy) cos +γGx
Nx
2
∆tx − cos(−γGyτyy) sin +γGx
Nx
2
∆tx (13.40)
and further evolves during the acquisition as
ˆρ(x, y) = It + κIycxcy − sxsy − κIxsxcy + cxsy, (13.41)
where
sx = sin(kxx) = − sin
Nx
2
− nx ∆kxx cx = cos(kxx) = cos
Nx
2
− nx ∆kxx (13.42)
sy = sin(kyy) = − sin
Ny
2
− ny ∆kyy cy = cos(kyy) = cos
Ny
2
− ny ∆kyy (13.43)
x = jx∆x kx = kx(0) + γGxt = −
Nx
2
− nx γGx∆t ∆kx = −γGx · ∆t = γGx/(Nx∆f) (13.44)
y = jy∆y ky = ky(0) − nyγ∆Gyτy =
Ny
2
− ny γτy∆Gy ∆ky = −γ∆Gy · τy. (13.45)
The pre-phasing gradient makes the evolution of the density matrix to start from negative kx and pass kx = 0 in the middle of the
experiment. The modulation by kxx and kyy thus has the same form.
Using standard trigonometric relations,
ˆρ(x, y) = It + κIycos
Nx
2
− nx ∆kxx +
Ny
2
− ny ∆kyy + κIxsin
Nx
2
− nx ∆kxx +
Ny
2
− ny ∆kyy , (13.46)
Introducing relaxation and performing phase correction,
M+ (kx, ky) =
γ2 2B0
4kBT
e
−R2 TE− Nx
2
−nx
∆kx
γGx N(x, y)e
−i Nx
2
−nx ∆kxx+
Ny
2
−ny ∆kyy
. (13.47)
Expressing the ensemble averaging explicitly,
M+ (kx, ky) =
γ2 2B0
4kBT
e
−R2 TE− Nx
2
−nx
∆kx
γGx
Lxˆ
0
Lyˆ
0
N(x, y)e
−i Nx
2
−nx ∆kxx+
Ny
2
−ny ∆kyy
dxdy. (13.48)
Inverse discrete Fourier transformation converts the signal into the two-dimensional image
N(x, y) = Njx,jy =
4kBT∆kx∆ky
γ2 2B0e−R2TE
Nx
2
−1
nx=− Nx
2
Ny
2
−1
ny=−
Ny
2
M+ nx,ny e
−R2
Nx
2
−nx
∆kx
γGx e
i2π jx·nx
Nx
+
jy·ny
Ny
. (13.49)
The analysis can be easily extended to the three-dimensional imaging experiment presented in the right panel in Figure 13.7, where
two phase-encoding gradients Gx and Gy are applied (the frequency encoding gradient is Gz). The evolution of the density matrix matrix
from ˆρ(d) introduces the modulation
ˆρ(x, y) = It + κIycxcycz − sxsycz − sxcysz − cxcycz − κIxsxcycz + cxsycz + cxcysz − sxsysz, (13.50)
where
344
sx = sin(kxx) = − sin
Nx
2
− nx ∆kxx cx = cos(kxx) = cos
Nx
2
− nx ∆kxx (13.51)
sy = sin(kyy) = − sin
Ny
2
− ny ∆kyy cy = cos(kyy) = cos
Ny
2
− ny ∆kyy (13.52)
sz = sin(kzz) = − sin
Nz
2
− nz ∆kzz cz = cos(kzz) = cos
Nz
2
− nz ∆kzz (13.53)
x = jx∆x kx = kx(0) − nxγ∆Gxτx =
Nx
2
− nx γτx∆Gx ∆kx = −γ∆Gx · τx (13.54)
y = jy∆y ky = ky(0) − nyγ∆Gyτy =
Ny
2
− ny γτy∆Gy ∆ky = −γ∆Gy · τy (13.55)
z = jz∆z kz = kz(0) + γGzt = −
Nz
2
− nz γGz∆t ∆kz = −γGz · ∆t = γGz/(Nz∆f). (13.56)
The corresponding signal is
M+ (kx, ky, kz) =
γ2 2B0
4kBT
e
−R2 TE− Nz
2
−nz
∆kz
γGz
Lxˆ
0
Lyˆ
0
Lzˆ
0
N(x, y, z)e
−i Nx
2
−nx ∆kxx+
Ny
2
−ny ∆kyy+ Nz
2
−nz ∆kzz
dxdydz,
(13.57)
and the inverse discrete Fourier transformation converts it into the three-dimensional image
N(x, y, z) = Njx,jy,jz =
4kBT∆kx∆ky∆kz
γ2 2B0e−R2TE
Nx
2
−1
nx=− Nx
2
Ny
2
−1
ny=−
Ny
2
Nz
2
−1
nz=− Nz
2
M+ nx,nz,nz e
−R2
Nz
2
−nz
∆kz
γGz e
i2π jx·nx
Nx
+
jy·ny
Ny
+ jz·nz
Nz
.
(13.58)