Professional Documents
Culture Documents
https://doi.org/10.1038/s41557-018-0046-3
Human genome function is underpinned by the primary storage of genetic information in canonical B-form DNA, with a sec-
ond layer of DNA structure providing regulatory control. I-motif structures are thought to form in cytosine-rich regions of the
genome and to have regulatory functions; however, in vivo evidence for the existence of such structures has so far remained
elusive. Here we report the generation and characterization of an antibody fragment (iMab) that recognizes i-motif structures
with high selectivity and affinity, enabling the detection of i-motifs in the nuclei of human cells. We demonstrate that the in
vivo formation of such structures is cell-cycle and pH dependent. Furthermore, we provide evidence that i-motif structures are
formed in regulatory regions of the human genome, including promoters and telomeric regions. Our results support the notion
that i-motif structures provide key regulatory roles in the genome.
D
NA has a well-known propensity to adopt alternative non-B- Taken together, while previous in vitro and indirect evidence
form conformations in vitro, including G-quadruplex (G4)1 has supported the notion that i-motif structures are formed in vivo,
and intercalated motif (i-motif) structures2 (Fig. 1a). Of the direct evidence has remained elusive. Here, we report the genera-
two structures, G4 DNA formed within guanine (G)-rich regions tion and characterization of a human antibody fragment that recog-
of the genome is by far the more studied. Computational analysis3,4 nizes i-motif structures with high selectivity and affinity. We utilize
and high-throughput G4 sequencing5 have revealed that G4s are this reagent to demonstrate the first direct visualization of i-motif
enriched in the regulatory regions of the genome. Furthermore, sev- structures in the nuclei of human cells.
eral studies have investigated regulatory functions of G4s in cellular
pathways6,7. In spite of in silico and in vitro characterizations of G4s, Results
the in vivo existence of them in human cells remained speculative Selection and characterization of an antibody fragment specific for
until the visualization of these structures using an antibody frag- human telomere i-motif. We utilized the Garvan-2 human single-
ment that recognizes G4 in a structure-specific manner8. chain variable fragment (scFv) library32–34 to select i-motif-specific
In marked contrast, insights into the biological role of i-motif antibody fragments. Three rounds of phage selections were per-
DNA are limited. This structure is formed via a stack of intercalat- formed to isolate binders to the human telomere i-motif (hTelo
ing hemiprotonated C-neutral C base pairs (C+:C), which are stabi- i-motif), a well-characterized i-motif35 whose structure has been
lized in slightly acidic pH (Fig. 1a)9. As C-rich regions are common reported2 (Fig. 1b and Supplementary Table 1). Binders were charac-
within the human genome, i-motifs have been characterized in vitro, terized using soluble fragment enzyme-linked immunosorbent assay
derived from telomeres2,10, centromeres11,12 and promoter regions of (ELISA) followed by bio-layer interferometry (BLI) off-rate ranking,
proto-oncogenes13, using a range of biophysical techniques. These resulting in the identification of a lead candidate (which we termed
biophysical analyses demonstrated the formation of both intra- and iMab). A detailed protocol of the antibody selection procedure is
intermolecular structures, with the overall stability of the structure provided in Supplementary Method 1 and Supplementary Table 2.
dependent on the number of cytosines in the i-motif core, as well as We next used BLI to determine the kinetics and affinity of
the length and composition of loops13–16. the interaction between purified iMab and the hTelo i-motif. We
Despite strong evidence that shows C-rich sequences can adopt observed a highly stable complex, characterized by a slow dissocia-
i-motif structures in vitro, the in vivo existence of this four-stranded tion rate and a KD in the low nanomolar range (ka = 3.2 × 104 s−1 M−1,
DNA structure in the human genome is still a matter of scientific kd = 1.9 × 10−3 s−1, KD = 59 nM) (Fig. 1c). We next evaluated the spec-
debate. In particular, observations that in vitro formation of the ificity of iMab for the hTelo i-motif by ELISA. This analysis revealed
i-motif structure is dependent on acidic conditions have raised highly specific binding, as indicated by the absence of binding to
questions concerning biological relevance17,18. On the other hand, a wide range of control protein and nucleic acid antigens includ-
the existence of i-motif DNA in vivo is supported by studies that ing double-stranded DNA, hairpin DNA, microRNA, streptavidin,
demonstrated that this structure can form at physiological pH under neutravidin, hen egg-white lysozyme and neuropeptide Y (NPY)
conditions of molecular crowding19–21 and negative superhelicity (Fig. 1d). In addition, no binding was observed to a mutant hTelo
induced during transcription22. Recently, genomic sequences that oligonucleotide (Fig. 1e), which lacks the i-motif structure, as indi-
form stable i-motif structures at neutral pH have been reported23–25. cated by circular dichroism (CD) spectroscopy (Fig. 1f). To investi-
Furthermore, several studies have indicated that i-motif formation gate whether iMab is capable of binding to partially folded i-motif
modulates replication26 and transcription27–31. structures36, we tested truncated constructs with either three or two
Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, Darlinghurst, Sydney, New South Wales, Australia. 2St Vincent’s Clinical
1
School, Faculty of Medicine, University of New South Wales, Kensington, Sydney, New South Wales, Australia. 3Children’s Medical Research Institute,
University of Sydney, Westmead, Sydney, New South Wales, Australia. *e-mail: m.dinger@garvan.org.au; d.christ@garvan.org.au
a R b
N
N N
H
N H
N H N
H O
H N
H N R
N O
N
N
O G-quadruplex Cytosine
R N H N
N
O
H (G4) H
N H N
H
N
Thymine
H N H
N N H N N O
R R Adenine
N i-motif O
H
N N
R (iM) H
N
H
c Association Dissociation d e
10 ka (M
–1 –1
s ) = 3.157×10
4 –1
kd (s ) = 1.881×10
–3
2.0 hTelo i-motif 2.0
9
dsDNA
8 hTelo i-motif
hTelo i-motif
6 Streptavidin x3 C-runs
5 250 nM Neutravidin
1.0 1.0 hTelo i-motif
4 Lysozyme x2 C-runs
3 125 nM NPY peptide
2 0.5 0.5 hTelo i-motif
1 62.5 nM mut
0 0.0 0.0
0 60 120 180 240
5
25
8
16
32
64
5
25
5
1
2
4
8
16
32
64
12
0.
12
0.
Time (s)
0.
0.
0.
0.
iMab concentration (nM) iMab concentration (nM)
f 6 × 105
g h
12 Association Dissociation 12 Association Dissociation
Molar ellipticity (deg cm2 dmol–1)
hTelo i-motif 11 11
10 10
4 × 105
hTelo i-motif 9 9 Continued at pH 6
x3 C-runs 8 pH 6 8 pH 6
Binding (nm)
Binding (nm)
2 × 10 5 7 pH 7 7
hTelo i-motif 6 6
x2 C-runs 5 5
0 4 4
pH 8
3 3
hTelo i-motif Switched to pH 9
2 2
–2 × 105 mut
1 1
0 pH 9
0
220 240 260 280 300 320 0 60 120 180 240 0 60 120 180 240
Wavelength (nm) Time (s) Time (s)
Fig. 1 | Characterization of an hTelo i-motif specific antibody fragment (iMab). a, Schematic of i-motif and G4 structures. b, Structure and schematic
drawing of human telomeric (hTelo) i-motif (PDB: 1EL2). c, Bio-layer interferometry (BLI) analysis of binding of different concentrations of iMab to hTelo
i-motif at pH 6 (association constant, ka = 3.2 × 104 s−1 M−1; dissociation constant, kd = 1.9 × 10−3 s−1; equilibrium constant, KD = 59 nM). d, Examination of
iMab binding to control oligonucleotides and proteins (ELISA format). Plotted values are means ±s.d. of three replicates. e, Examination of iMab binding
to full-length hTelo i-motif, as well as to truncated (x3 and x2 C-runs) and mutant species (ELISA format). Plotted values are means ±s.d. of three
replicates. f, Circular dichroism (CD) spectra of full-length hTelo i-motif, truncated (x3 and x2 C-runs) and mutant species. g, BLI analysis of binding of
iMab to hTelo i-motif at conditions at or above pH 6. h, BLI analysis of binding of iMab to hTelo i-motif at pH 6 and dissociation at pH 6 and 9, respectively.
tandem runs of cytosines. Indeed, these truncated hTelo i-motifs constant concentration of iMab binding to the hTelo i-motif under
displayed reduced CD ellipticity compared to wild-type (wt) hTelo conditions ranging from pH 6 to 9 (the antibody itself was highly
i-motif (Fig. 1f), suggesting an inability to form a fully folded i-motif stable under all analysed conditions; Supplementary Fig. 1). This
structure. The two truncated constructs displayed either consider- experiment indeed indicated considerable reduction of antibody
ably reduced binding (three C-runs) or no detectable binding (two binding at higher pH and no detectable binding at pH 9 (Fig. 1g). In
C-runs), indicating that recognition by iMab is highly structure addition, antibody–i-motif complex formed at pH 6 could be read-
dependent (Fig. 1e). In contrast, wt hTelo displayed a high degree of ily dissociated when switching to pH 9 buffer conditions (Fig. 1h).
foldedness, as indicated by positive ellipticity at ~285 nm and nega- Next we investigated the influence of iMab on the i-motif structure.
tive ellipticity at ~255 nm (Fig. 1f). This was highly comparable to CD spectroscopy was performed for the hTelo i-motif in the pres-
a previously reported CD spectrum for wt hTelo35, and folded wt ence or absence of iMab across different pH values. This experiment
hTelo was recognized by iMab under all analysed conditions. revealed that the addition of iMab does not result in any wavelength
Next, the pH dependence of i-motif stability and iMab binding shift irrespective of pH (Supplementary Fig. 2), implying that iMab
was investigated. In general, i-motif structures are considered most binding does not detectably induce or alter the i-motif structure.
stable at acidic pH37, reflecting the negative logarithmic acid dissoci- Taken together, our results demonstrate that iMab recognizes the
ation constant (pKa-) of cytosine and the requirement for the forma- folded hTelo i-motif structure with high affinity and specificity.
tion of hemiprotonated C+:C pairs35. We thus hypothesized that pH
and the binding of iMab should be inversely correlated, assuming iMab recognizes i-motifs in a structure-specific manner. In
that the antibody predominantly recognizes folded i-motif rather addition to telomere regions, C-rich sequences capable of adopt-
than unfolded species. We investigated this notion, using BLI and a ing i-motif structures in vitro are commonly found in other regions
© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved.
NAture CHeMIstry Articles
iM 1G22 iM 1A83 iM c-MYC 3′
2.0 Apparent KD = 635 pM 2.0 Apparent KD = 575 pM 2.0 Apparent KD = 409 pM
5′
Absorbance (450 nm)
5
25
5
1
2
4
8
16
32
64
5
25
5
1
2
4
8
16
32
64
2JSM 148D 1XAV
12
0.
12
0.
12
0.
0.
0.
0.
0.
0.
0.
iMab concentration (nM) iMab concentration (nM) iMab concentration (nM)
5
25
5
1
2
4
8
16
32
64
5
25
5
1
2
4
8
16
32
64
2GKU 2F8U 2M27
12
0.
12
0.
12
0.
0.
0.
0.
0.
0.
0.
iMab concentration (nM) iMab concentration (nM) iMab concentration (nM)
Fig. 2 | Evaluation of the specificity of iMab for different i-motif and G4 structures. iMab recognizes a wide range of i-motif DNA structures with high
affinity; it does not detectably bind to G4 structures. Plotted values are means ±s.d. of three replicates. Structure and PDB identifiers are provided;
predicted conformations are provided where structural information is unavailable. Green, cytosine; yellow, adenine; purple, thymine; blue, guanine.
throughout the genome. Several of these i-motifs have been bio- did not show any binding to i-motifs and G4s (Supplementary
physically characterized, including sequences from centromeres Fig. 3), iMab recognized i-motifs and differentiated them from G4s
and the promoters of proto-oncogenes13,18. In addition to the hTelo (Supplementary Fig. 3). As a second control, we utilized the widely
i-motif structure described above, we evaluated the binding of used commercially available G4 quadruplex antibody BG48. As
iMab to six other well-defined intramolecular i-motif structures expected, this antibody showed substantial binding to all analysed
(Supplementary Table 1) by ELISA. These include 1G2211 and G4 structures (Supplementary Fig. 3); however, BG4 also displayed
1A8338, whose structures have been solved, and four other i-motifs, detectable cross-reactivity to i-motifs (Supplementary Fig. 3) and
c-MYC22, RET39, VEGF40 and Nano41. As depicted in Fig. 2, iMab competed with iMab for binding to several i-motifs (Supplementary
has high apparent affinities for all of these i-motifs, ranging from Fig. 4).
approximately 290 picomolar to around 10 nanomolar. These results
suggest iMab is an i-motif structure-specific antibody and capable I-motif structures are formed in the nuclei of human cells. We
of recognizing this structural motif with high affinity, irrespective then used iMab for indirect immunofluorescent staining of three
of sequence diversity. different human cell lines. This staining revealed punctate foci in the
nuclei of MCF7, U2OS and HeLa cell lines that were dependent on
iMab differentiates i-motifs from G4s. Unlike i-motifs, for which the presence of iMab (Fig. 3a). Next, we investigated whether DNase
only a small number of structures are available in the Protein Data I digestion can affect the number of visualized foci. This experi-
Bank (PDB)42, structures of many G4s have been reported; these ment showed that DNase I treatment significantly decreased the
structures demonstrate extensive structural and topological diver- number of foci in the nuclei (Fig. 3a,b; see Supplementary Method 2
sity43. Therefore, it is conceivable that i-motifs and G4s could dis- for counting of the number of foci). Limited residual foci were
play common epitopes. Moreover, in genomic DNA, i-motifs are observed after treatment, presumably due to nuclease-resistant
often accompanied by complementary G-rich sequences, which can DNA structures or shielding by proteins. Another possibility is the
adopt G4 structures. For applications such as immunostaining, a formation of RNA i-motifs that are not removed by DNase I diges-
high level of specificity and the absence of binding to G4s is therefore tion. A small decrease in the number of iMab foci after RNase A
likely to be important. To evaluate iMab cross-reactivity we exam- treatment supports this speculation (Fig. 3b). As a further control,
ined binding to six G4s of known structures, which were selected we blocked iMab by pre-incubation with a fivefold molar excess of
to represent a diverse range of conformations (Supplementary folded hTelo i-motif oligonucleotide before staining and observed
Table 1): promoter G4s c-MYC44, BCL245, VEGF46 and telomeric a significant decline in the immunofluorescence signal (Fig. 3b).
G4s 2JSM47, 2GKU48 and aptamer G4 TBA49. As outlined in Fig. 2, In addition, DP47DPK9 germline antibody staining did not show
ELISA results show that iMab does not display any detectable bind- detectable punctate foci (Fig. 3b).
ing to these molecules, suggesting that it is highly capable of differ- To validate that the detected foci correspond to i-motif struc-
entiating between G4 and i-motif structures. tures, we transfected HeLa cells with folded i-motif and G4 oligo-
To confirm these results, binding was analysed by BLI using two nucleotides. Cells were transfected with 1, 2 and 4 µg of folded hTelo
additional antibodies as controls. As a negative control, we used the i-motif. Staining cells after 16 h using iMab showed an increase in
DP47DPK9 germline scFv antibody32, from which iMab and the the number of foci that directly correlated with the amount of trans-
parental Garvan-2 scFv library were derived. While DP47DPK9 fected hTelo i-motif (Fig. 3c). The same experiment was performed
a +iMab –iMab + DNase I / + iMab mimosine arrests them at the G1/S boundary51. Staining of these
cells showed that i-motif formation is at the highest level at this
phase of the cell cycle. Three hours after being released from the
mimosine-induced blockade, when the cells synchronously begin
MCF7
S phase, the number of iMab foci was reduced compared to the
G1/S boundary phase (Fig. 4a,b). Similar trends of i-motif forma-
tion during cell-cycle progression were observed in the MCF7 cell
line (Supplementary Fig. 5).
160
140
intensity is inversely correlated with intracellular pH). As depicted
Number of
Number of
80
120
100
80
in Fig. 5d, compared to 5% CO2, we observed ~20% lower and
40 60 ~30% higher fluorescence at 2% and 8% CO2, respectively. These
40
20 results indicate that intracellular pH can be decreased by incu-
0 0
bating cells at higher CO2 concentrations, resulting in increased
G elo cu A
μg
μg
μg
μg
μg
μg
μg
PK dy
t
d
en
m i-m ate
hT ein se
7D tibo
9)
as
4
2
4
0
2
1
1
m
(D ne otif
ith pr Na
b
at
Amount of transfected experiments using HEK 293T cells and pHrodo Green, a different
P4 an
D
tre
oligonucleotide
o
li
w ab
er
© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved.
NAture CHeMIstry Articles
a Serum starving Mimosine arrest Mimosine release b
(G0/G1) (G1/S boundary) (early S phase )
140 ****
***
****
120
80
Sub-G0
G0/G1
S
G2/M
Multi-nuclear
cells
Sub-G0
G0/G1
S
G2/M
Multi-nuclear
cells
Sub-G0
G0/G1
S
G2/M
Multi-nuclear
cells
4.0 × 103 4.0 × 103 4.0 × 103 60
Cell count
Cell count
2.0 × 103 2.0 × 103 2.0 × 103 20
S
ry
G
da
rly
0/
0 0 0
Ea
un
G
bo
0 0 0
3
3
3
S
15 10
20 10
25 10
10
15 10
20 10
25 10
10
15 10
20 10
25 10
10
10 10
10 10
10 10
1/
G
×
×
×
×
0
0
50
50
50
B610_20-A :: PI B610_20-A :: PI B610_20-A :: PI
Fig. 4 | Visualization and quantification of i-motif structures during cell-cycle progression in HeLa cells. a, Imaging of iMab foci in the nuclei of HeLa
cells using confocal microscopy. Cells were synchronized at the G0/G1, G1/S boundary and early S phases. Nuclei were counterstained with DAPI. Scale
bars, 5 µm. Flow cytometry graphs outline the population of synchronized and propidium iodide stained cells in each phase. b, Quantification of the iMab
foci in the nuclei of HeLa cells synchronized at the G0/G1, G1/S boundary and early S phases. iMab foci of 200–300 nuclei were counted per condition.
Boxes represent the 25th to 75th percentiles. Horizontal lines and ‘+’ indicate medians and means, respectively. Whiskers indicate highest and lowest
values of the results. ***P < 0.001, ****P < 0.0001. Statistical significance was assessed by one-way ANOVA using Tukey’s multiple comparisons test.
b 120 c d
140
****
**
pH of MCF7
Number of foci per nucleus
culture 120
**** ****
Relative intracellular
80 100
2% CO2 7.98 ± 0.10 80
60
40 5% CO2 7.51 ± 0.07
40
2
O
O
C
C
2%
5%
8%
2%
5%
8%
Fig. 5 | Investigation of the effect of pH variation on the formation of i-motif structures in MCF7 cells. a, Imaging of iMab foci in the nuclei of MCF7 cells
after incubation at different CO2 concentrations for 2.5 h using confocal microscopy. Confocal images show iMab foci in the nuclei of MCF7 cells. Nuclei
were counterstained with DAPI. Scale bars, 3 µm. b, Quantification of iMab foci in the nuclei of MCF7 cells after 2.5 h variation of CO2 concentration.
iMab foci of 200–300 nuclei were counted per condition. Boxes represent 25th to 75th percentiles. Horizontal lines and ‘+’ indicate medians and means,
respectively. Whiskers indicate highest and lowest values of the results. **P < 0.01, ****P < 0.0001; statistical significance was assessed by one-way
ANOVA using Tukey’s multiple comparisons test. c, pH values of MCF7 cultures after 2.5 h variation in CO2 concentration. d, Relative intracellular pHrodo
Red fluorescence across different CO2 concentrations in MCF7 cells. Error bars represent mean ±s.e.m. of three replicates. ****P < 0.0001; statistical
significance was assessed by one-way ANOVA using Tukey’s multiple comparisons test.
a iMab TRF2 Co-localization Merged of i-motif structures has considerably slowed research in the i-motif
field. However, it is possible that the transient nature of the i-motif
structure, far from being a biophysical artefact, may actually medi-
ate important regulatory roles; more specifically, variation in local
conditions such as pH may be modulating function by controlling
i-motif stability, for instance within promoter and telomere regions.
b iMab E12/E47 Co-localization Merged We conclude that our results support the notion that C-rich
sequences within the genome form i-motif structures in the nuclei of
human cells and control regulatory functions. Therefore, it is conceiv-
able that any changes, such as mutations that alter i-motif stability and
conformation, can affect their regulatory role, as has been previously
reported for other DNA and RNA structures61,62. Our results thereby
Fig. 6 | Evidence of i-motifs formation in regulatory regions. a, Imaging of
provide a core foundation for future studies exploring the biological
iMab and TRF2 co-localization at telomeres in the nucleus of an U2OS cell
role of this common genomic DNA structure, and for validation as a
using confocal microscopy. b, Imaging of iMab and E12/E47 co-localization
therapeutic target in cancer and other pathological conditions.
at promoters in the nucleus of an U2OS cell using confocal microscopy.
Nuclei were counterstained with DAPI. Scale bars, 3 µm. Imaris software Methods
was used to analyse co-localizations. Material and methods are described in the Supplementary Information.
© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved.
NAture CHeMIstry Articles
19. Cui, J., Waltman, P., Le, V. H. & Lewis, E. A. The effect of molecular crowding 45. Dai, J., Chen, D., Jones, R. A., Hurley, L. H. & Yang, D. NMR solution
on the stability of human c-MYC promoter sequence I-motif at neutral pH. structure of the major G-quadruplex structure formed in the human BCL2
Molecules 18, 12751–12767 (2013). promoter region. Nucleic Acids Res. 34, 5133–5144 (2006).
20. Rajendran, A., Nakano, S. & Sugimoto, N. Molecular crowding of the 46. Agrawal, P., Hatzakis, E., Guo, K., Carver, M. & Yang, D. Solution structure of
cosolutes induces an intramolecular i-motif structure of triplet repeat DNA the major G-quadruplex formed in the human VEGF promoter in K+:
oligomers at neutral pH. Chem. Commun. 46, 1299–1301 (2010). insights into loop interactions of the parallel G-quadruplexes. Nucleic Acids
21. Li, H., Hai, J., Zhou, J. & Yuan, G. The formation and characteristics of the Res. 41, 10584–10592 (2013).
i-motif structure within the promoter of the c-myb proto-oncogene. 47. Phan, A. T., Kuryavyi, V., Luu, K. N. & Patel, D. J. Structure of two
J. Photochem. Photobiol. B 162, 625–632 (2016). intramolecular G-quadruplexes formed by natural human telomere sequences
22. Sun, D. & Hurley, L. H. The importance of negative superhelicity in inducing in K+ solution. Nucleic Acids Res. 35, 6517–6525 (2007).
the formation of G-quadruplex and i-motif structures in the c-Myc promoter: 48. Luu, K. N., Phan, A. T., Kuryavyi, V., Lacroix, L. & Patel, D. J. Structure of
implications for drug targeting and control of gene expression. J. Med Chem. the human telomere in K+ solution: an intramolecular (3+1) G-quadruplex
52, 2863–2874 (2009). scaffold. J. Am. Chem. Soc. 128, 9963–9970 (2006).
23. Wright, E. P., Huppert, J. L. & Waller, Z. A. E. Identification of multiple 49. Schultze, P., Macaya, R. F. & Feigon, J. Three-dimensional solution structure
genomic DNA sequences which form i-motif structures at neutral pH. of the thrombin-binding DNA aptamer d(GGTTGGTGTGGTTGG). J. Mol.
Nucleic Acids Res. 45, 2951–2959 (2017). Biol. 235, 1532–1547 (1994).
24. Fleming, A. M. et al. 4n–1 is a ‘sweet spot’ in DNA i-motif folding of 2′ 50. Kuryavyi, V., Phan, A. T. & Patel, D. J. Solution structures of all parallel-
-deoxycytidine homopolymers. J. Am. Chem. Soc. 139, 4682–4689 (2017). stranded monomeric and dimeric G-quadruplex scaffolds of the human c-kit2
25. Mir, B. et al. Prevalent sequences in the human genome can form mini i-motif promoter. Nucleic Acids Res. 38, 6757–6773 (2010).
structures at physiological pH. J. Am. Chem. Soc. 139, 13985–13988 (2017). 51. Jackman, J. & O’Connor, P. in Current Protocols in Cell Biology Ch. 8 (John
26. Takahashi, S., Brazier, J. A. & Sugimoto, N. Topological impact of Wiley and Sons, 1998).
noncanonical DNA structures on Klenow fragment of DNA polymerase. Proc. 52. Simon, S., Roy, D. & Schindler, M. Intracellular pH and the control of
Natl Acad. Sci. USA 114, 9605–9610 (2017). multidrug resistance. Proc. Natl Acad. Sci. USA 91, 1128–1132 (1994).
27. Kang, H. J., Kendrick, S., Hecht, S. M. & Hurley, L. H. The transcriptional 53. Huang, Z. & Huang, Y. The change of intracellular pH is involved in the
complex between the BCL2 i-motif and hnRNP LL is a molecular switch for cisplatin-resistance of human lung adenocarcinoma A549/DDP cells. Cancer
control of gene expression that can be modulated by small molecules. Invest. 23, 26–32 (2005).
J. Am. Chem. Soc. 136, 4172–4185 (2014). 54. Smogorzewska, A. et al. Control of human telomere length by TRF1 and
28. Kendrick, S. et al. The dynamic character of the BCL2 promoter i-motif TRF2. Mol. Cell Biol. 20, 1659–1668 (2000).
provides a mechanism for modulation of gene expression by compounds that 55. Kendrick, S., Akiyama, Y., Hecht, S. M. & Hurley, L. H. The i-motif in the
bind selectively to the alternative DNA hairpin structure. J. Am. Chem. Soc. bcl-2 P1 promoter forms an unexpectedly stable structure with a unique 8:5:7
136, 4161–4171 (2014). loop folding pattern. J. Am. Chem. Soc. 131, 17667–17676 (2009).
29. Sutherland , C., Cui, Y., Mao, H. & Hurley , L. H. Mechanosensor mechanism 56. Roy, B. et al. Interaction of individual structural domains of hnRNP LL
controls the G-quadruplex/i-motif molecular switch in the MYCpromoter with the BCL2 promoter i-motif DNA. J. Am. Chem. Soc. 138,
NHE III1. J. Am. Chem. Soc. 138, 14138–14151 (2016). 10950–10962 (2016).
30. Kaiser, C. E. et al. Insight into the complexity of the i-motif and 57. Aronheim, A., Shiran, R., Rosen, A. & Walker, M. D. The E2A gene product
G-quadruplex DNA structures formed in the KRAS promoter and subsequent contains two separable and functionally distinct transcription activation
drug-induced gene repression. J. Am. Chem. Soc. 139, 8522–8536 (2017). domains. Proc. Natl Acad. Sci. USA 90, 8063–8067 (1993).
31. Brown, R. V. et al. The consequences of overlapping G-quadruplexes and 58. Schaffitzel, C. et al. In vitro generated antibodies specific for telomeric
i-motifs in the platelet-derived growth factor receptor beta core promoter guanine-quadruplex DNA react with Stylonychia lemnae macronuclei. Proc.
nuclease hypersensitive element can explain the unexpected effects of Natl Acad. Sci. USA 98, 8572–8577 (2001).
mutations and provide opportunities for selective targeting of both structures 59. Xu, B., Devi, G. & Shao, F. Regulation of telomeric i-motif stability by
by small molecules to downregulate gene expression. J. Am. Chem. Soc. 139, 5-methylcytosine and 5-hydroxymethylcytosine modification. Org. Biomol.
7456–7475 (2017). Chem. 13, 5646–5651 (2015).
32. Dudgeon, K. et al. General strategy for the generation of human antibody 60. Cui, Y., Kong, D., Ghimire, C., Xu, C. & Mao, H. Mutually exclusive
variable domains with increased aggregation resistance. Proc. Natl Acad. Sci. formation of G-quadruplex and i-motif is a general phenomenon governed by
USA 109, 10879–10884 (2012). steric hindrance in duplex DNA. Biochemistry 55, 2291–2299 (2016).
33. Rouet, R., Dudgeon, K. & Christ, D. Generation of human single domain antibody 61. Wells, R. D. Non-B DNA conformations, mutagenesis and disease. Trends
repertoires by Kunkel mutagenesis. Methods Mol. Biol. 907, 195–209 (2012). Biochem. Sci. 32, 271–278 (2007).
34. Rouet, R., Dudgeon, K., Christie, M., Langley, D. & Christ, D. Fully human 62. Zeraati, M. et al. Cancer-associated noncoding mutations affect RNA
VH single domains that rival the stability and cleft recognition of camelid G-quadruplex-mediated regulation of gene expression. Sci. Rep. 7, 708 (2017).
antibodies. J. Biol. Chem. 290, 11905–11917 (2015).
35. Manzini, G., Yathindra, N. & Xodo, L. E. Evidence for intramolecularly Acknowledgements
folded i-DNA structures in biologically relevant CCC-repeat sequences. The authors thank S. Thompson (Leica Microsystems Australia) and P. Young
Nucleic Acids Res. 22, 4634–4640 (1994). (University of Sydney) for access to the Leica 3XSTED instrument. This work was
36. Dhakal, S., Lafontaine, J. L., Yu, Z., Koirala, D. & Mao, H. Intramolecular supported by Program Grants 1113904, Project Grant 1148051, Development Grants
folding in human ILPR fragment with three C-rich repeats. PLoS One 7, 1113790 and 1076356 and Fellowship 105146 from the National Health and Medical
e39271 (2012). Research Council (NHMRC) and Discovery Grants 160104915 and 140103465 from the
37. Kim, B. G. & Chalikian, T. V. Thermodynamic linkage analysis of pH-induced Australian Research Council (ARC).
folding and unfolding transitions of i-motifs. Biophys. Chem. 216,
19–22 (2016).
38. Han, X., Leroy, J. L. & Gueron, M. An intramolecular i-motif: the solution Authors contributions
structure and base-pair opening kinetics of d(5mCCT3CCT3ACCT3CC). J. M.Z., M.E.D. and D.C. conceived the project. M.Z. designed and performed the
Mol. Biol. 278, 949–965 (1998). experiments. D.B.L. and R.R. contributed phage display experiments. P.S. contributed in
39. Guo, K. et al. Formation of pseudosymmetrical G-quadruplex and i-motif vitro characterization of the antibody. A.L.M. and T.M.B. contributed biophysical studies.
structures in the proximal promoter region of the RET oncogene. W.E.H. contributed microscopy. M.Z., M.E.D. and D.C. analysed the data and wrote the
J. Am. Chem. Soc. 129, 10220–10228 (2007). paper. All authors reviewed the manuscript.
40. Guo, K., Gokhale, V., Hurley, L. H. & Sun, D. Intramolecularly folded
G-quadruplex and i-motif structures in the proximal promoter of the vascular Competing interests
endothelial growth factor gene. Nucleic Acids Res. 36, 4598–4608 (2008). The authors declare no competing interests.
41. Li, T. & Famulok, M. I-motif-programmed functionalization of DNA
nanocircles. J. Am. Chem. Soc. 135, 1593–1599 (2013).
42. Berman, H. M. et al. The Protein Data Bank. Nucleic Acids Res. 28, 235–242 Additional information
(2000). Supplementary information is available for this paper at https://doi.org/10.1038/
43. Karsisiotis, A. I., O’Kane, C. & Webba da Silva, M. DNA quadruplex folding s41557-018-0046-3.
formalism—a tutorial on quadruplex topologies. Methods 64, 28–35 (2013). Reprints and permissions information is available at www.nature.com/reprints.
44. Ambrus, A., Chen, D., Dai, J., Jones, R. A. & Yang, D. Solution structure of
the biologically relevant G-quadruplex element in the human c-MYC Correspondence and requests for materials should be addressed to M.E.D. or D.C.
promoter. Implications for G-quadruplex stabilization. Biochemistry 44, Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in
2048–2058 (2005). published maps and institutional affiliations.
Please do not complete any field with "not applicable" or n/a. Refer to the help text for what text to use if an item is not relevant to your study.
For final submission: please carefully check your responses for accuracy; you will not be able to make changes later.
` Experimental design
1. Sample size
Describe how sample size was determined. iMab foci of 300 - 400 nuclei were counted per condition for each biological replicate. Means
shown for clarity purposes.
2. Data exclusions
Describe any data exclusions. No data were excluded from the analyses.
3. Replication
Describe the measures taken to verify the reproducibility For each experiment and condition, at least three independent technical replicates were
of the experimental findings. performed for each of the two biological replicates. All observations reported in the
manuscript were reproducible.
4. Randomization
Describe how samples/organisms/participants were N/A
allocated into experimental groups.
5. Blinding
Describe whether the investigators were blinded to N/A
group allocation during data collection and/or analysis.
Note: all in vivo studies must report how sample size was determined and whether blinding and randomization were used.
6. Statistical parameters
For all figures and tables that use statistical methods, confirm that the following items are present in relevant figure legends (or in the
Methods section if additional space is needed).
n/a Confirmed
The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement (animals, litters, cultures, etc.)
A description of how samples were collected, noting whether measurements were taken from distinct samples or whether the same
sample was measured repeatedly
A statement indicating how many times each experiment was replicated
The statistical test(s) used and whether they are one- or two-sided
Only common tests should be described solely by name; describe more complex techniques in the Methods section.
Provide confidence intervals or give results of significance tests (e.g. P values) as exact values whenever appropriate and with effect sizes noted.
A clear description of statistics including central tendency (e.g. median, mean) and variation (e.g. standard deviation, interquartile range)
Clearly defined error bars in all relevant figure captions (with explicit mention of central tendency and variation)
See the web collection on statistics for biologists for further resources and guidance.
1
` Software
For manuscripts utilizing custom algorithms or software that are central to the paper but not yet described in the published literature, software must be made
available to editors and reviewers upon request. We strongly encourage code deposition in a community repository (e.g. GitHub). Nature Methods guidance for
providing algorithms and software for publication provides further information on this topic.
b. Describe the method of cell line authentication used. MCF7: validated by Cellbank Australia using short tandem repeat (STR) profiling
HeLa: not done
HEK 293T: validated by Cellbank Australia using STR profiling
U2OS: not done
c. Report whether the cell lines were tested for All cell lines were tested and are negative for mycoplasma
mycoplasma contamination.
d. If any of the cell lines used are listed in the database Cell lines used in this study were chosen as models and results did not show any cell line
of commonly misidentified cell lines maintained by dependency.
ICLAC, provide a scientific rationale for their use.