0% found this document useful (0 votes)
138 views270 pages

Biomod

Biomod manual

Uploaded by

Jhe-Yu Liu
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
138 views270 pages

Biomod

Biomod manual

Uploaded by

Jhe-Yu Liu
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 270

BIOMOD Handbook

Shawn Douglas
This book is for sale at http://leanpub.com/biomod

This version was published on 2018-02-28

This is a Leanpub book. Leanpub empowers authors and publishers with the Lean Publishing
process. Lean Publishing is the act of publishing an in-progress ebook using lightweight tools
and many iterations to get reader feedback, pivot until you have the right book and build
traction once you do.

© 2017 - 2018 Shawn Douglas


Tweet This Book!
Please help Shawn Douglas by spreading the word about this book on Twitter!
The suggested tweet for this book is:
Check out the BIOMOD Handbook: leanpub.com/biomod
The suggested hashtag for this book is #BIOMOD.
Find out what other people are saying about the book by clicking on this link to search for this
hashtag on Twitter:
#BIOMOD
Contents

About . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
Support . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1

Chapter 1 Let’s go to BIOMOD . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2


1. Welcome to the World of DNA Molecule Design . . . . . . . . . . . . . . . . . . . . 2
2. What is BIOMOD? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
3. Judging System of BIOMOD . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
4. Days at the Jamboree . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
5. Learn from Previous Winners . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
6. BIOMOD Calendar . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
7. Tips on Group Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
8. Tips on Website Creation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22
9. Tips on Making YouTube Video . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
10. Tips on Presentation in English . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
11. What You’ll Get From BIOMOD . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31

Chapter 2 Before Starting Your Project . . . . . . . . . . . . . . . . . . . . . . . . . . . 35


12. Subject Matter of Project . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35
13. Types of Brainstorming . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
14. Design Process . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
15. How to Collect Information . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
16. More About Papers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46

Chapter 3 What Does Design of DNA Molecules Make Possible? . . . . . . . . . . . . . . 50


17. Structural DNA Nanotechnology: Overview . . . . . . . . . . . . . . . . . . . . . . 50
18. Two-Dimensional DNA Origami . . . . . . . . . . . . . . . . . . . . . . . . . . . . 53
19. Three-Dimensional DNA Origami . . . . . . . . . . . . . . . . . . . . . . . . . . . 57
20. DNA Tiles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 60
21. DNA-Modified Particles (SNA) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 66
22. DNA Hydrogels . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67
23. DNA Nanomachines and Nanorobots: Overview . . . . . . . . . . . . . . . . . . . 72
24. DNA Tweezers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77
25. DNA Walker . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81
26. DNA Nanomachines . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87
CONTENTS

27. Autonomous DNA Reaction Systems . . . . . . . . . . . . . . . . . . . . . . . . . 90


28. DNA Computing: Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 93
29. DNA Logic Gates . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 96
30. DNAzyme Logic Gates . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99
31. Entropy-Driven Gate . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102
32. Seesaw Gate . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 105
33. DNA Toolbox . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 110
34. DNA-Based Finite State Machine . . . . . . . . . . . . . . . . . . . . . . . . . . . 114
35. Traveling Salesman Problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 117

Chapter 4 Basic Textbook Knowledge . . . . . . . . . . . . . . . . . . . . . . . . . . . . 121


36. Importance of Multidisciplinary Knowledge: Overview . . . . . . . . . . . . . . . 121
37. Thermodynamics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 123
38. Free Energy and Chemical Equilibrium . . . . . . . . . . . . . . . . . . . . . . . . 128
39. Reaction Kinetics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 134
40. Molecular Structure of DNA . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 139
41. Thermodynamics of DNA Hybridization . . . . . . . . . . . . . . . . . . . . . . . . 143
42. Strand Displacement . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 148
43. Secondary Structures of DNA . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 153
44. DNA Synthesis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 157

Chapter 5 Techniques/Materials to Boost DNA . . . . . . . . . . . . . . . . . . . . . . . 161


45. Enzymes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 161
46. Polymerase Reactions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 166
47. Restriction Enzymes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 169
48. Artificial Bases . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 172
49. Non-Watson-Crick Structure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 180
50. Protein Motors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 189
51. FRET and Molecular Beacons . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 194
52. Artificial Cell and Liposome (and Some Related Topics) . . . . . . . . . . . . . . . 197

Chapter 6 Software Techniques . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 201


53. Basic Software Programs: Overview . . . . . . . . . . . . . . . . . . . . . . . . . . 201
54. Document/Table/Graph Preparation . . . . . . . . . . . . . . . . . . . . . . . . . 205
55. Preparation of Presentation Slides . . . . . . . . . . . . . . . . . . . . . . . . . . 208
56. Figure Preparation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 211
57. 3D Animation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 216
58. Image Processing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 221
59. Video Editing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 226
60. Numerical Analysis Library . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 229
61. Software for Molecular Design of DNA . . . . . . . . . . . . . . . . . . . . . . . . . 234
62. DNA Secondary Structure Prediction . . . . . . . . . . . . . . . . . . . . . . . . . 238
63. Sequence Design . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 242
CONTENTS

64. Design of DNA Origami . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 247


65. Simulation of DNA Origami . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 252
66. Molecular Modeling of DNA . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 256
67. Simulation of DNA Logic Gate . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 259
About
The BIOMOD Handbook is an English-language translation of the following book:

Open Access PDF¹.


The authors retain the copyright to the original book, and the CBI society holds the rights to
copy/distribute/edit/republish Japanese version.

Support
Translation and editing costs were paid by the BIOMOD Foundation with support from many
sponsors².
S.M.D. was supported by a Burroughs Wellcome Fund Career Award at the Scientific Interface,
the Pew-Stewart Scholars Program for Cancer Research, the Del E. Webb Foundation, and
National Science Foundation (CCF-1317694, CCF-1453847, SI2-1740282), Army Research Office
(W911NF-14-1-0507), Office of Naval Research (GRANT12390474), and the UCSF Program for
Breakthrough Biomedical Research.

¹http://cbi-society.org/home/documents/eBook/Biomod2016.pdf
²http://biomod.net/sponsors/
Chapter 1 Let’s go to BIOMOD
1. Welcome to the World of DNA Molecule Design
It is now possible to design biological macromolecules, typically DNA sequences, to create com-
plex nanodevices and nanostructures. This is a revolution that is comparable to the industrial
revolution but occurring at the molecular level. Molecule design will could into a gargantuan
field of technology comparable to silicon semiconductor technology developed in the 20th
century. Welcome to a world of molecular design that holds infinite potential.

Fundamental Differences between Manufactured Articles and


Organisms
Our world holds an endless variety of manufactured articles made of metal, plastic, wood and
clay, and other materials. Shapes of these objects are formed through cutting and molding of
bulk materials. Essentially all manufactured articles, including buildings, machines, and daily
goods are made in this way, which requires bulk materials and tools and factories to work them.
On the other hand, how are organisms made? We eat and degrade other organisms, which
are then turned into energy and life materials for our bodies without the use of any tools or
factories. We turn the materials into a highly structured system by means of incredibly complex
chemical reactions. Cells in our body are each made of innumerable biological macromolecules
synthesized with extreme precision. They are by far more complex and far larger in scale than
the most complex semiconductor circuit ever designed.

The Central Dogma of Biological Design


We know a great deal about chemical reactions that occur inside organisms. At the center of
these processes is a series of conversion reactions involving three kinds of macromolecules:
DNA → RNA → protein. The sequences of these macromolecule chains (e.g. base and amino acid
sequences) are of significant. DNA, composed of strings four types of bases (A, C, T, and G), is a
macromolecule that stores genetic information. It is converted to an intermediate string of RNA
(transcription), which in turn is converted to amino acid sequences of proteins (translation).
Therefore, the shape and other properties of any protein primarily depend on the base sequence
of the corresponding DNA. Proteins are responsible for innumerable biological functions. In
other words, biological functions are written in the base sequence of DNA.
Chapter 1 Let’s go to BIOMOD 3

Manufacturing Based on Biological Principles


Molecular biology, which began with the discovery of DNA by Watson and Crick in 1950s (see sec-
tion 41), has developed into one of the largest field of science, i.e., life sciences. Scientists have
studied the constituent parts of the biological system one by one, and functional relationships
between the parts are becomingly better understood. Richard Feynman said, “What I cannot
create, I do not understand.” In terms of contraposition, anything can be manufactured once
its mechanism is understood. Thus, the idea of applying biological principles to manufactured
articles emerged. It began with the idea by N. Seeman to produce DNA-based nanostructures
(1984). By regarding DNA not as only a genetic information carrier but also as a nanostructure
material, Seeman noticed that only mixing DNA molecules of particular base sequences could
produce nanomaterials of various shapes.

DNA-based Manufacturing: Design of DNA Molecules


It is still a tall order to generate, or to design the amino acid sequence of, a protein with a
desired shape and/or function. There are 20 different amino acids, several hundreds to several
thousands of which polymerize to form a string of protein. Therefore, there are 20N possible
sequences for a protein of N amino acid residues. The number of possible sequences, or
sequence space, is too huge to identify a particular desired sequence. In the case of DNA,
however, there are only 4 kinds of bases (A, C, G, and T), resulting in 4N sequences at most. This
means that the power of a standard personal computer suffices to search the sequence space of
DNA (see sections 63 and 64). Only since the late 1990s, however, have researchers attempted to
generate a wide variety of parts using only DNA. BIOMOD is a two-day jamboree and competition
to bring together people interested in the design of DNA molecules.

Biology vs Molecular Design

Structural DNA Nanotechnology (see section 17)


The purpose of structural nanotechnology is to manufacture nanostructures of various shapes
using DNA. One typical example is the methodology of “DNA Origami”, a design technology that
uses the principles of folding dictated by DNA to create molecules with complex shapes on a
Chapter 1 Let’s go to BIOMOD 4

computer. Stuctural DNA nanotechnology also tries to combine DNA nanostructures with other
molecules such as proteins and lipids (see section 49).

Molecular Robotics (see section 24)


A molecular robot is a DNA-based structure combined with a sensor, actuator, and DNA com-
puter. A molecular robot is a molecule-sized system that can autonomously react to its en-
vironment, e.g., one that can find diseased cells and deliver a drug to them. Structural DNA
nanotechnology has allowed some static structures to be made, but has not yet successfully
created many dynamic structures.

DNA Computing (see section 29)


Based on the complementarity of DNA, a reaction system can be made in which the concen-
tration of a particular DNA molecule in a solution reflects the result of a logical operation. This
field has progressed rapidly in recent years, now allowing successful fabrication of a complex
molecular computing circuit with several hundred logic gates.

Moore’s Law in Design of DNA Molecules


Moore’s law is the observation that the packaging density in an integrated circuit increases
exponentially over time. We now realize that this principle also applies to DNA molecule design.
The length of designable DNA has reached an order of several ten thousand bases and is
expected to keep increasing. The time is coming when large and complex articles can be made
at the level of molecular design.

Increasing Complexity of DNA Molecular System


Chapter 1 Let’s go to BIOMOD 5

Molecular Robotics Research Group and Molecular Robotics Project


DNA computing also started relatively early in Japan; Wiplash PCR (see section 35) and DNA
memory (see section 28) were both devised in Japan. The Molecular Robotics Research Groupᵃ,
started as a SICE technical division in 2010, is a pioneering group aiming at combining DNA
computing with structural DNA nanotechnology and further applying these breakthroughs to
robotics.
A big research projectᵇ proposed by this group has been funded through a Grant-in-Aid for
Scientific Research on Innovative Areas in the field of molecular robotics (2012-2016).
The purposes of this project include the enlightenment of students and training of young
researchers. It supports various aspects of BIOMOD (see section 2).
ᵃhttp://molbot.org/
ᵇhttp://www.molecular-robotics.org/

—Satoshi Murata (Tohoku University)

2. What is BIOMOD?
This section describes previous competitions of BIOMOD, its relation to Robocon Japan, and
how it all started.

Introduction
Welcome to BIOMOD. In a few months, you will be trying to create an unprecedented molecular
robot/device by trial and error, from the brainstorming of project ideas to the final presentation.
Before that, you should make sure you understand the logic and rules of the BIOMOD competi-
tion.

Background
Progress in molecular biology, organic chemistry, and other disciplines in the past several
decades has revealed a trove of information about biomolecules. In particular, we have gained
a great deal of information on nucleic acids (DNA and RNA) and related matters. The famous
discovery of the double helix structure by Watson and Crick was followed by the establishment
of enzymatic manipulation, and the introduction of novel functions by means of artificial
bases. Such progress in basic science and the resultant technologies together provide a basis
to develop novel fields in engineering such as modern biotechnology and nanotechnology.
Only decades ago, we considered the rules of biological phenomena to be ungovernable using
Chapter 1 Let’s go to BIOMOD 6

engineering methodology. Now we understand them by means of biology, chemistry, and


computer science, and are ready to apply our knowledge to manufacturing. We are facing a
historical turning point.

DNA Nanotechnology and BIOMOD


Progress in nucleic acid science has led to novel fields of engineering, one of which is DNA
Nanotechnology. DNA Nanotechnology originated from an idea of making a “nanoscale DNA
scaffold” to facilitate the crystallization of a protein. This idea was proposed by Nadrian (“Ned”)
Seeman, an American chemist, in the 1980s. Ned Seeman and many researchers have worked
for decades since then to turn this vision into reality. A major breakthrough came in 2005
when Paul Rothemund developed a method to program many short DNA strands with a long
“scaffold” template strand to self-assemble into a desired shape. He termed this method “DNA
Origami”. In addition to creating custom shapes, DNA has also been used to create nucleic acid-
based “sensors” made of aptamers, and repurposed into “information processing systems” and
“motors” based on “strand displacement” reactions. These are some of the key innovations of
DNA Nanotechnology (see corresponding sections and columns for details and references of
these technologies).
As a natural consequence of such progress in biomolecule manipulation technology, people
have come to think of applying these technologies to designing/assembling non-biological
systems at the molecular level. In Japan, the Molecular Robotics Research Group [1] was born
in February 2010 as a SICE technical division. As of 2015, a Grant-in-Aid for Scientific Research
on Innovative Areas “Molecular Robotics” [2], proposed by this group, was launched and now
supports various aspects of Japanese BIOMOD teams as the organizer. This handbook was
written by the leading researchers from the Molecular Robotics Research Group. Meanwhile, in
2010, American groups published on “prototype” molecular robots based on DNA origami core
structures.
At the end of 2010, Shawn Douglas, a postdoctoral researcher at Harvard (and now faculty at
University of California, San Francisco) and one of the USA-based authors of a molecular robot
paper, proposed organizing the first BIOMOD to research groups all over the world. One goal of
BIOMOD was to offer undergraduate students direct hands-on experience in the field just as it
just gaining momentum. The philosophy of BIOMOD is that students should bring their creative
enthusiasm to the forefront of scientific research and drive it forward, rather than repeating
cookie-cutter laboratory exercises that mimic past breakthroughs, long after those advances
entered the history books.

BIOMOD’s Relationship to Robocon in Japan


BIOMOD and robotics are closely intertwined. When Japanese researchers received the pro-
posal, they wanted to apply know-how from Robocon, a popular engineering competition in
Japan, to participating in this new competition.
Chapter 1 Let’s go to BIOMOD 7

Originally, BIOMOD planned to borrow the entire competition framework of iGEM (International
Genetically Engineered Machine competition) [5], a student competition in synthetic biology.
The idea was that incorporating the similarly standardized methodology of DNA origami and
CAD (for easy design of DNA origami) into the framework of iGEM and applying it to BIOMOD
should result in similarly rapid progress in the competition, and thus in the field of molecular
robotics.
However, Japanese researchers thought that the judging system of iGEM was not fully transpar-
ent, and, in terms of both scientific progress and education, they wanted a more transparent and
more equitable judging system that would still be suitable for student participants and teachers.
As a result, we proposed the introduction of as fair judging criteria as possible, and a Robocon-
like pilot competition and Molecular Robotics Award, which in the long term could turn into a
more Robocon-like competition. Our proposal was to introduce the essence of the open and fair
judging system of Robocon into the framework of iGEM. Our proposal combined advantages of
two competitions of totally different origins and most of the ideas were adopted. However, the
proposed judging system based on point scoring is still utilized essentially in its original form
in BIOMOD. After the Robocon-like pilot competition and Molecular Robotics Award at the 1st
competition in 2011, we concluded that it was too early to have a Robocon-like competition as
part of BIOMOD. The award remains as a special award given to the project contributed the most
to molecular robotics among the projects of all participating teams.
—Shogo Hamada (Cornell University)

3. Judging System of BIOMOD


BIOMOD is comprised of 3 categories: Website, YouTube, and Presentation. It’s important to un-
derstand the judging system in order for your project to be competitive. This section describes
the types of awards in BIOMOD and the scoring system to determine the award winners.

Overview of Judging System of BIOMOD


About a week before the final judging, each participating team must upload a Website that
provides a detailed description of the team’s project and a project video (YouTube) of about 3
minutes long. The video should give a lucid explanation of the project. At the final judging, each
team is given 12 minutes to do a presentation on their project. Based on these three categories,
Top Prizes (overall score ranking), Gold, Silver, and Bronze Award winners are determined.

Kinds of Awards of BIOMOD


- Bronze: Team satisfied all minimum requirements for judging (i.e. submitted a complete
project Website and uploaded a YouTube video).
Chapter 1 Let’s go to BIOMOD 8

- Silver: Satisfied criteria for Bronze, plus at least one device (part of the system) in the team’s
design worked as expected.
- Gold: Satisfied criteria for Silver, and overall point score from Website + YouTube + Jamboree
presentation are in top 50% of all teams.
Most of the awards celebrate top three teams. The Website, YouTube, and presentation scores
are independent of each other. As of March 2016, the Tohoku University team is the only
Japanese team to won first place in any of the major categories (YouTube Award, Presentation
Award, and Grand Prize in 2012, and YouTube Award and Grand Prize in 2015). In the Website
category, the best Japanese team was the Tohoku University team in 2014, which won 2nd place.
The Audience Choice Award, not included in overall score ranking, is given to the best presen-
tation determined by vote of student participants. Usually, the Audience Choice Award winners
are the same as the Best Presentation Award winners.
There are two more awards, each of which is given to only one team: the Best T-shirt Award
(TEAM TITECH won this in 2013) based on the vote of participants and MOLBOT Award supported
by Molecular Robotics Research Group from Japan (see the column for details).

Specifics of Judging System of BIOMOD


A perfect overall score is 100, with the Website category worth up to 50 points, YouTube worth
up to 25 points, and presentation worth up to 25 points. Judging criteria for the three categories
are fully provided on the judging criteria page [1]. Because the judges also read this page to learn
how to judge, you should thoroughly read and understand the judging criteria. Even with an epic
project, you cannot win the competition if you ignore key elements of the judging criteria.

About Judges
For each category, the judges consist of mentors of participating teams and invited experts.
Judging is done on an anonymous basis, and each mentor is not allowed to evaluate her/his
team. For each team, online content evaluation (Website and YouTube) is allotted to five
randomly chosen judges. Live presentation evaluation is done by all judges present. Highest
and lowest judges’ scores are excluded, and the three remaining judges’ scores are averaged.

Judging Criteria for Websites


Criteria include the evaluation of three elements: Project Ideas (worth up to 20 points), Project
Documentation (worth up to 20 points), and Project Execution (worth up to 10 points).
- Project Ideas. (1) Relevance (worth up to 5 points): Has the team made a strong case that
their project idea is scientifically and/or technologically interesting? (2) Specification (worth
up to 5 points): Are the project goals well-defined? (3) Feasibility (worth up to 5 points): Was
it reasonable to expect that the proposed solution could be implemented by a BIOMOD team
Chapter 1 Let’s go to BIOMOD 9

in one summer? (4) Merit (worth up to 5 points): Is the proposed solution a good one? Is it
particularly elegant or innovative?
- Project Documentation. (1) Clarity (worth up to 10 points): Is the project description well-
written and easy to understand? Does it include the necessary background information and
motivation for the project, methods, results, and discussion? Are the figures easy to understand?
(2) Transparency (worth up to 5 points): Are all of the raw experimental data and source files
easily accessible? Would it be straightforward to attempt to reproduce the team’s results? (3)
Layout (worth up to 5 points): Is the team’s project page arranged in a clear and logical fashion?
- Execution: Did the team accomplish what they set out to do?

Judging Criteria for YouTube Videos


Criteria include the evaluation of three elements: Overall Impact (worth up to 10 points), Clarity
(worth up to 10 points), and Production (worth up to 5 points).
- Overall Impact: Was the video interesting? Did you want to watch it more than once? A beautiful
image, interesting trick, and unexpected story will help. It’s best to keep the video under five
minutes because most people will not want to watch a video longer than four minutes twice.
- Clarity: Was the project described in a simple and clear manner that could be easily understood
by a wide audience? Basically, any BIOMOD project of any team is complicated. You must
describe your project simply and without too many details. Watch the videos made by any of
the top three teams in previous competitions and you can learn a lot.
- Production: Were the sound and video high quality? Were the images focused and scaled
properly? In most cases, a high scorer in other criteria also gets a high score here. Top teams
score similarly for this criterion; noisy video, either visual or auditory, can easily lose points.

Judging Criteria for Presentations


Criteria includes evaluation of three elements: Content (worth up to 10 points), Delivery (worth
up to 10 points), and Impact (worth up to 5 points).
- Content: Were the slides clear and easy to understand? Did the project narrative have a logical
flow, with clearly stated goals and results?
- Delivery: Did the speaker(s) give a well-rehearsed, well-paced presentation? Did the speaker(s)
engage with the audience and maintain good eye contact?
- Impact: Was the presentation interesting? Fun? Clever? Memorable?

MOLBOT Award
The MOLBOT Award, sponsored by Molecular Robotics Research Group from Japan, is given
to the project that most contributed to molecular robotics in the BIOMOD competition of the
year. The winner is determined by the judges, based only on the live presentation score. Thus,
in order to win this award, you must emphasize in the presentation how you tried to assemble
Chapter 1 Let’s go to BIOMOD 10

a molecular robot.
Japanese teams have been dominating in this category. In the 1st and 3rd competitions,
Japanese teams won this award. In 2014, the Columbia University team won the award, a non-
Japanese winner for the first time. Competition for this award will become more and more
intense. However, because molecular robot projects are of high difficulty and thus not much
attempted by the students, a project with a good concept can compensate for less than prefect
results. I hope the readers of this book try for this award.

—Kei Fujiwara (Keio University)

4. Days at the Jamboree


The BIOMOD Jamboree was held at Harvard Univeristy but has now been moved to the Uni-
versity of California, San Francisco, USA [1]. “Jamboree”, which sounds unfamiliar to Japanese,
means a competition to be enjoyed like a festival. BIOMOD is and should be fun, in addition to
being a learning experience.

Schedule before/during BIOMOD Jamboree


After you upload your Website and YouTube, and prepare your presentation materials, you are
heading for the BIOMOD Jamboree (the final). The Jamboree schedule is divided into three
parts: Registration on Day 1, Presentations on Day 2, and Awards & Closing Ceremony on Day
3.

Day 1: Registration
Most teams will fly into San Francisco International Airport (SFO). Japanese teams can take a
direct flight from Narita. To get from the airport to hotel and/or registration, car service apps
such as Lyft or Uber can be used from anywhere in San Francisco and surrounding areas.
Download the app on your smart phone and set up an account. Both apps use your smart
phone’s GPS to detect your location and connect you with the closest available drivers. You
will receive a text message with a photograph of your background-checked driver and their car,
you’ll be able to track their approach, and get a text message when they arrive. Payment can
be split among passengers and will be securely charged to the credit card on file. No cash will
change hands.
At the registration desk in the hotel lobby, you will get a registration packet with a name badge,
and a BIOMOD original T-shirt. If you arrive after 20:00, you can register at the competition venue
on the morning of the Jamboree.
Chapter 1 Let’s go to BIOMOD 11

On Day 1, conference rooms in the hotel are usually available for practicing presentations. This
is a good idea, but you have reserve them. Rehearsing is crucial to success on Day 2. Some of
the teams keep rehearsing until midnight. Sometimes teams are still rehearsing on the day of
presentation, but generally this didn’t seem to result in a good presentation. Make everything
ready by the day of presentation.

Day 2: Start of the Jamboree


This is the day the Jamboree really kicks off. Participants gather in a room, and the session starts
at 08:00 AM. Dr. Shawn Douglas gives the opening remarks, after which the participating teams
present one after another. Each team is allotted twelve minutes of presentation plus 3 minutes of
discussion. All the teams have different vibes which contributes to the festival-like atmosphere.
In some teams, everyone wears a solid suit, some teams entertain the audience with cosplay
or tricks, and some teams appeal to the audience with a play or cross-talk. The photos on the
BIOMOD Facebook page³ give a good idea of the atmosphere.
There are two 15 minute coffee breaks, one in the morning and one in the afternoon, and you
can drink some coffee and eat snacks to relax, or do a final check of your presentation. At lunch
(usually buffet), people eat in ones, twos and threes, and seem a little too nervous to become
friends with other teams yet. There is a group photo time just before/after lunch. During the
coffee break, a photographer comes around to take team photos. After the presentations are
done, Day 2 is over. You can enjoy dinner and relax. The Jamboree included a party night in
2013 but not in 2014.

A Day of 25 Hours
Daylight Saving Time ends in the first week in November, often around the same time when
BIOMOD is held. This means that clocks are set back one hour at 2:00 a.m. There is no special
ceremony such as bell ringing, because it is midnight. Nevertheless, it is surely a special night
that you cannot experience in Japan or other countries.

Day 3: End of Jamboree


Day 3 begins with a Sponsor Workshop by Autodesk (this was after the presentation of the
last team in 2014-2015), followed by the Awards & Closing Ceremony. In the ceremony, Bronze,
Silver, and then Gold Award winners are called up and receive award certificates. Then Category
Awards, MOLBOT Award, Grand Prizes, etc. are awarded. Videos of the top 3 YouTube Award
winners from the previous year are run on a large screen at the competition place at full volume.
There is a photo session, but no winner’s speech.
³https://www.facebook.com/biomod/photos/
Chapter 1 Let’s go to BIOMOD 12

After the Jamboree is over, you are free until the return flight. Let’s stroll around UCSF and San
Francisco. Some sites of interest are listed in the column.

Talk to Other Teams


During the BIOMOD Jamboree, you will meet students from various countries at coffee break,
lunch, and social times, to whom you can talk, with a little courage, to speak English. Japanese
students are relatively shy and tend to flock together, possibly due to lack of confidence in
English skills. You should try and talk to students from different countries. Exchanging greetings,
asking where they come from, introducing your team, and asking about theirs will start the
conversation. Tell them how you enjoyed their project, and they will be more excited about
the conversation. The foreign student participants I talked to always welcomed me. Only the
Japanese students, who didn’t want to speak English, appeared a bit standoff ish. It would be
a pity if you end up getting new but only Japanese friends at the international competition. Try
and explore the world.

How to Enjoy San Francisco


Once BIOMOD is over, you are free, released from days of struggle. The world looks brighter
again. Now you can enjoy San Francisco. There are a bunch of sights including the souvenir
and other shops, restaurants, and so on at the Ferry Building, Golden Gate Park, and Alcatraz.
San Francisco has great restaurants. If you are in the mood for Mexican food, head to the
Mission neighborhood. You may wish to visit La Taqueriaᵃ (2889 Mission St, near 25th st), which
is often crowded since it was ranked the Best Burrito in Americaᵇ, although the tacos may be
superior to the burrito. Other great options include Taqueria Vallarta (3033 24th Street, near
Harrison) with excellent $2.50 street tacos, or Taquerias El Farolito (2779 Mission St @ 24th) for
great burritos.
ᵃhttps://www.facebook.com/LaTaqSF/
ᵇhttps://fivethirtyeight.com/burrito/

— Kei Fujiwara (Keio University)

5. Learn from Previous Winners


BIOMOD is a biomolecule design competition, and there is a high degree of freedom in what you
do. In this section, we will take a look at the projects of previous Grand, Website, YouTube, and
Presentation Award winners to understand the project genres and styles that can end up with a
high score.
Chapter 1 Let’s go to BIOMOD 13

Previous Grand Prize Winners


Below are the projects that won a Grand Prize in the 1st to 4th competitions. In the 1st compe-
tition, the winning project was a DNA/RNA nanostructure with potential clinical applicability,
demonstrating that a nanoscale octahedron carrying a siRNA drug with potential therapeutic
use was actually able to act on a cell. In the 2nd, the winning team created a DNA nanostructure
that made a channel in a cell membrane and actively transported DNA molecules. It featured
a DNA transport system driven by the stretching/contraction of “porter” DNA molecules. In
the 3rd, it was a system with improved arrangeability that consisted of a DNA nanostructure
combined with a nanodiamond useful for bioimaging. In the 4th, the winning project was a
sensor whose design was based on cooperation between neighboring molecular beacons, in
which a DNA nanostructure recognized an environmental substance to emit light with high
precision.

Highly Evaluated Projects


These previous Grand Prize winners all focused on DNA-based nanostructures. Because the
establishment of BIOMOD itself was an effort to spur innovation in the technology of molecular
self-assembly of DNA at the nanoscale, projects based on this technology tend to get a high
score.
On the other hand, projects focusing on the construction of protein-based systems tend not
to win as much. This is probably because most biological systems depend on combinations
of proteins, so that designed assemblies may not get high scores without going beyond these
principles somehow.
In between these two types of projects are those that apply the sequence specificity of DNA
to logical operation or DNA amplification. This is probably because they score high in terms of
design and execution, but tend to score low in terms of clarity in presentation and Website. To
avoid this, you should use simple schematics and similes instead of symbols like A-B-C and D-
B’-A’ (the trade might like the latter, but not other people).

Highly Evaluated Websites


You can read any of the Websites of previous participants [1]. If a team deleted their data, you
can trace the page history and access it.
Example of good Websites include those of Team Dresden (Germany) 2012-2014, and Team
TITECH, 2012. The most likely “winner’s formats” for BIOMOD can be found in the Websites of
the Harvard University team, 2012, the LMU team, 2013, and the Tohoku University team, 2014.
Their websites describe briefly results that align with BIOMOD’s goals and accord well with all
judging criteria.
Your Website should include a figure that describes the degree of Execution of the project.
The judges will refer to this figure to understand other data in the Website page and properly
Chapter 1 Let’s go to BIOMOD 14

evaluate what your team achieved. If there is no such figure, they will start looking for what has
not been achieved.
Since Execution and Feasibility are each worth up to 15 points and Relevance and Merit are
each worth up to only 5 points. Therefore, a subject matter that is fairly interesting (Relevance ₃
points₎ but has high feasibility ₍12 points; thus high Execution points) will result in a better total
score. Therefore, it is better to assemble an achievable unique structure, instead of trying for
medical and industrial products that are not easily achievable (and not unique) and thus are
likely to lose points.

Highly Evaluated YouTube Videos


There are two major factions competing for YouTube Award: one of them clearly explains their
project based on beautiful images (“image faction”), and the other based on comedic value
(“comedy faction”). In short, clarity is a prerequisite, and you just choose which faction you
want to belong to: image or comedy. Image won in 2011, 2013, and 2014, and the comedy in
2012. The comedy faction dominates in the 2nd and 3rd place. An image faction team will need
a flawless project (no excuses) and a very high quality video to win. If you want to win 1st place,
try for the quality of the 2013 winner (3D) or the 2014 winner (2D) project.
A comedy faction team will need something more than ordinary gags to make an audience
of different nationalities laugh. Intelligent comedy (for example, the Tohoku University team,
2012-2013) or a standard funny video (for example, Team TITECH, 2011-2013) have been winning
choices.
Because YouTube does not need to include the results of the project, a team with a good project
idea but without experimental results can win this Award with good technique and scenario. If
you are not successful in the experiment, put all your efforts into YouTube.

Highly Evaluated Presentations


Remember that this is not a presentation in a meeting. Judges do not always have an interest
in your project or your presentation. You must appeal to them in order to win. Previous winners
tried in various ways to make the audience interested and the project well understood: e.g.,
through drama, by the use of props and characters, and through cross-talk. In particular, cross-
talk of two or more, with one speaker pointing out illogicalities and obscurities in the story and
another solving the problems, can be appealing to the audience. Such efforts will be reflected
in the Delivery score of the Presentation category.
The keys to preparing good slides are the same among presentation styles. Refer to the presen-
tation tips section (section 10) for details. Presentations with fewer words and more animation
were more easily understood in the previous competitions. A short presentation is not a problem
but a long one will offend the judges and consequently may lose points. When the Tohoku
University team won Best Presentation Award, the presentation was about 10 minutes long.
Chapter 1 Let’s go to BIOMOD 15

If you are unable to memorize the script for the presentation in any way, do not worry. You
can perform a puppet show while a narrator reads the script behind the curtain. You can also
have one of the members show you cue cards from an audience seat, or you can have the script
written on your hand or accessory (fan, scroll, or the like).
Chapter 1 Let’s go to BIOMOD 16

Do NOT rely on judges to fill in the blanks


Students tend to make the mistake of thinking that the judges will understand your project
because you have told them or written it down. The judges have a very short time to evaluate
your team, usually 15 minutes and up to 1 hour at most. To make judges “understand” in such
a short period of time, being logically clear and straightforward is not enough. You must check
every detail. Can the judges understand the presentation only by looking at the figures? Is
the script polished and of a minimal length? Are the necessity of each experiment and the
interpretation of the results clearly described? Will people think the project is great at a glance
at the front-page image?

—Kei Fujiwara (Keio University)

6. BIOMOD Calendar
This chapter describes a standard working schedule and team organization. Because BIOMOD
is a team competition with a defined deadline, choice of team organization is very important.
Appropriate team organization will depend on your team members and on support from labo-
ratories.

Standard Schedule for BIOMOD


BIOMOD starts in Spring and ends in early Winter. The 6 months will pass quickly. You have to
work with it in parallel to your regular classes, so the schedule is relatively tight. A standard
schedule is as follows.
April Start a team
June Literature review and brainstorming of project ideas
August Start the experiment
September Regional BIOMOD practice @ Japan
October Prepare for the presentation
November Final @ San Francisco

The team starts in April. The team can be put together in various ways: with friends, with
members of a single laboratory, etc. The BIOMOD website (HP) recommends no more than 10
team members. Somewhere between 4 and 10 seems appropriate. Some say that the team
members should have different backgrounds and others like a group with similar backgrounds.
Either seems fine. Having different backgrounds will introduce a variety of ideas and skills into
the team, but runs the risk of causing the project to be less focused. Similar backgrounds will
Chapter 1 Let’s go to BIOMOD 17

facilitate mutual understanding, but may not be good for the brainstorming of project ideas.
Good team organization is key.
Once the team is agreed on, you need to go through lectures and textbooks and literature
reviews to find project ideas. The University of Tokyo Kashiwa team assigned relevant chapters
of the textbooks “Molecular Biology of the Cell” and “DNA Nanoengineering” to team members
in a journal club format. Note that too much textbook learning will take away the time and
energy for the brainstorming of project ideas, and may make it feel like another regular class,
resulting in a passive attitude of the members. The Kashiwa team also introduced a literature
review in a journal format, assigning about 2 papers per student. Based on such knowledge, you
will start brainstorming project ideas. This brainstorming is an important factor for the success
of the project, and is described in another section.

Typical team meeting in Japan

When summer vacation comes, the experiment starts. The experiment can be carried out by all
of the team members or only by the experimenter(s), depending on the team. In the case of the
University of Tokyo Kashiwa team, every member worked on the experiment at least 3 days a
week during the summer vacation (except during Obon or other holidays). Many of the members
worked for much longer hours.
A month after the beginning of the summer vacation, there is a trial in Japan. It is held at the
Hongo campus of Tokyo University in the beginning of September every year. At this domestic
trial, the primary purpose is to determine the direction of many aspects of the team, so the
project does not need to be complete at this point. Nevertheless, because you will be judged
in the same format as the final, you have to make some preparations for it. The presentation
must be in English, so you will have to do exercises in English in addition to presentation. The
result of the domestic trial does not necessarily correlate with the result at the final. Many teams
were motivated by their defeat at the trial and had a good result at the final. If your result at the
domestic trial is not good, just cheer up and work hard again toward the final.
After the summer vacation passes and Autumn comes, the team will have to push toward the
final. There are the experiments to be done, the presentations to be prepared, and the Website
and YouTube to be made. You must appropriately assign tasks to the members. All of these steps
Chapter 1 Let’s go to BIOMOD 18

will test the cooperation of the team members.


The final is the presentation in San Francisco. You may feel nervous to see foreign teams, but
they are too. Make the best preparations that you can and rehearse so you are prepared. The
result will reflect the amount of your preparations. In BIOMOD, many of the presentations are in
a drama format. Observe previous presentations to learn how to appeal to the audience.

Team Organization
As in a club or a part-time job, the most important position is the leader. However, the type
of leader required will depend on the team members so there is no single universal solution.
Usually, anyone who wants to be the leader should be. In addition to this general leader, the
University of Tokyo Kashiwa team also included sub-leaders such as a principal experimenter.
This was because the team leader, who oversees the entire project, might not to be able to make
an objective decision due to being too concerned about the experiment if she/he were concur-
rently the leader of the experiment. Because BIOMOD is a short-term battle, it is necessary to
balance ideals with reality and short-term considerations with long-term considerations. The
leader should be able to consider everything and make decisions when necessary. The principal
experimenter is responsible for the execution of the experiment, and should communicate with
the leader and the mentors and TA to decide the experimental plan and the goal, and share
information with other experimenters to direct the experiment. During this process, data should
be filed and processed whenever it is appropriate. In the battle against the deadline, there is a
great difference between preparing a Website and its figures from scratch and preparing it from
figures already finished or half-finished for the presentation slides.

Brainstorming Project Ideas


The success of a project principally depends on its ideas. You should spend as much energy and
time as possible on generating project ideas. If nobody comes up with a good idea, you can
modify existing projects or brainstorm ideas with other members. Alternatively, you can find
problems that you want to solve in your daily life. Note that, although brainstorming is fun, ideas
are often raised but not discussed. You may need an organizer, someone to gather feedback, or
make other rules. You should present your ideas, even if premature, to other members, and have
them scored by other members (if possible, like at the competition). This will reveal advantages
and disadvantages of the ideas, which will be helpful in choosing the subject matter. On the
University of Tokyo Kashiwa team, groups of 2-3 members each presented their ideas weekly
for a month (once a week, about 4 times in total), and then all voted on their topic. Once you
have chosen the subject matter, discuss it with the mentors and TA to decide the experiment
required and the goal to be achieved by BIOMOD. You may have to switch to a more practical
plan in the course of the project. The domestic trial is a good opportunity to assess this; take
advantage of it as much as possible.
— Hisashi Tadakuma (Kyoto University)
Chapter 1 Let’s go to BIOMOD 19

7. Tips on Group Work


In BIOMOD, undergraduates with almost no laboratory research experience make a team, and go
through project proposal, planning, experiment, and making a video, Website, and presentation
slides in 6 months. The efficiency of group work [0] is a critical factor in success. Here are some
tips that have been/will be effective.

What Is Group Work


Group work (teamwork) originated in the Stone Age, when people needed to cooperate, i.e., to
make a team, to hunt large animals that they were unable to fight against individually [1]. In the
three elements essential for the success of a project, “human, material, and financial resources”,
the effectiveness of the latter two depends on the cooperation of members.

Divide the Work


Even Leonardo da Vinci, an “all-round genius”, might not be able to win BIOMOD all by himself.
Assign tasks, such as leader, designer, experimenter, Web manager, Youtube producer, BGM
producer, CG producer, and presenter, to members, taking the schedule of each task into
consideration. Members usually work independently with minimal teamwork and the results
combined are enough to win the competition [2].If the project is well defined and strong points
of individual members are known, task assignment will be effective. As the project proceeds,
task reassignment may be possible based on the strong points and weak points of the members.
The strong point, if any, of a freshman who is new to BIOMOD is difficult to see. Work in
an atmosphere that allows the members to find out strong points of each other. Around the
sophomore grade, students begin to exhibit interesting personal variation. Produce a more than
additive effect out of individual strong points through meetings and information sharing.

Group Work Cycle


Up until the deadline (see section 6), follow a pattern of “Meeting (report/discussion, task
assignment, deadlines set) → Execution of individual tasks (experiment, data analysis, report
writing) → [Repeat these 2 steps] → Jamboree”.

Before Meeting
Because you and the members have separate brains, you need meetings. Find a place for both
meeting and working where the team members can come together anytime and even make
noise. You can use club rooms in the University or meeting room rentals, but you may have
extra troubles in the final stage before the deadline if the meeting spot is located far from the
Chapter 1 Let’s go to BIOMOD 20

advisers. In Tohoku University, it should be in a student room. I never recommend a house of any
member for these reasons: The place should be equipped with a fast Internet connection and
a whiteboard, to which a hard copy can be also attached with a magnet. Digitizing technology
such as electronic whiteboards is attractive but still expensive and slow in response, and also
not good at distinguishing texts from figures. Both handouts and whiteboard content can be
conveniently recorded by cellular phone → digitized → shared. The frequency of meeting at
Tohoku University was once a week usually and twice a week or on demand from Autumn to the
deadline. Put the calendar in the meeting room so that everyone can see it, with the schedules of
the different tasks aligned with each other from the start to November, which facilitates sharing
schedules. Count down by the week will make people aware of the deadline. Also write in the
schedules of the advisers.

What to do at meetings
Check the entire schedule/current problems/process steps/task assignment, share information,
and discuss possible solutions to problems. Brainstorming of project ideas may seem to be the
most important task, but will be discussed later.
Meeting Organization
- Chaired by the team leader. Does not need to be a funny comedian.
- A record keeper is a must. Let the proceedings be read by everyone just after the meeting.
- Distribute handouts. One-way traffic information through slides is insufficient.
- Bring a list of the problems to be discussed.
- Any problem unsolved at the meeting should be assigned to any of the members, with a defined
deadline.
- Thank the observer(s).
- Do not get personal.
- Do not repeat the discussion after the meeting. If you have to, do it at the next meeting.
- Time >>>> (Impassable barrier) >>>> Money
Regularly holding meetings can help you identify problems, and thus are good for the mental
and physical health of the members.

Brainstorming Project Ideas


There is a saying that a good idea shall win BIOMOD (see section 12). An excellent idea excites
you and will have a long half-life. It may even allow the undergraduates to compete with
professionals. Thus, you should brainstorm ideas (see section 13) at a different time from
meeting times. However, a feasibility study is the highest hurdle, and the most time-consuming.
Once you have explained an idea to others, the idea has to be evaluated. Either bring it back
home for reconsideration or have it judged by the advisers if it is promising. Of course, the
greatest idea would be one that has never occurred to (outwits) the advisers. The advisers are
professional scientists, who have survived by outwitting existing ideas, so that they would be
Chapter 1 Let’s go to BIOMOD 21

pleased to be outwitted.
Share Your Information with Other Members
If you don’t share information among the team members, it can result in a tragedy such
as a plurality of members happening to do the same task separately. Share the address of
team members to send/receive a meeting schedule/room number, sick note, etc. Personal
information management is important.
- E-mail: Be polite. Avoid sending a file to the mailing list (share a link to cloud storage).
- Twitter: Fast. Stable. For announcements. Not suitable for discussion.
- Facebook: Pretty fast. Communication before/after journal club or meeting. Not suitable for
storing experimental data.
- Texting / Line: Fast. Stable. For announcements.
- Skype and other TV meeting systems: Basically for information sharing between two people.
Check also prepared data and discussion. Seems to allow a conclusion faster than telephone.
Facial effects?
- Google Docs: Fast. Fairly stable. Because it allows simultaneous editing, you and other mem-
bers can edit the proceedings over a Skype chat to save time. Also this can be run on a projector
for brainstorming of project ideas.
- Evernote: Powerful note-taking tool. Fast. Group sharing is a little bit slow (as of March 2015).
- Face-to-face discussion: Super effective. Courtesy should exist even among intimate friends.
Prior appointment and punctuality essential for persons who are busy or whom you meet for
the first time.
- Telephone: Fast. Stable. A double-edged sword that is convenient for the caller but takes away
the time of the other once answered. Do not hesitate to use in emergency.

How to Share Data


Whether it relates to group work or BIOMOD, those who have materials shall survive. Every detail
of your experiment must be recorded in a laboratory notebook. It is convenient to share digital
data, including experimental data and presentation data, on a cloud storage which is accessible
to every member. Be careful not to delete the data during editing. Free data storage service
include Google Drive (15 GB for free), One Drive (15 GB for free), Drop Box (2 GB for free), iCloud
(5 GB for free), and so on. You can set up your own server, which is safer, but its maintenance
will take a lot of time. Previous Tohoku University teams all used Google Drive. When you have
uploaded data, don’t forget to tell everyone. It’s important to standardize the format of files.
The background in image files should be transparent, which reduces time lost due to piles of
file editing. Image files should be in PNG format which is small in size. See sections 9 and 59 for
video, sections 56 and 57 for image files, and sections 54 and 56 for other key contents.

Conclusion
At BIOMOD, your project will be evaluated by other people. In order to get as good an evaluation
as possible, you should have as many people as possible comment on the project beforehand.
Chapter 1 Let’s go to BIOMOD 22

You must decide which comments to listen to and which to discard. No questions asked at the
competition should be new to you. Always prepare for the discussion as a team.
— Shinichiro M. Nomura (Tohoku University)

8. Tips on Website Creation


Website is worth up a relatively high number of points, which makes it very important at the
BIOMOD competition. It also reflects the character of your team explicitly, including the layout,
so you should never make light of it. Refer to sections 53-59 for the basic procedure to make a
Website.

Importance of Website
A good Website is key to success, because it accounts for up to 50 points of a perfect total score
of 100 points. In order to win the competition, it is important to make the team Website clear
and interesting.
Judging criteria for Website
- Project Idea (worth up to 20 points): Relevance, feasibility, etc.
- Project Documentation (worth up to 20 points): Layout, clarity, etc.
- Project Execution (worth up to 10 points): Did the team accomplish what they set out to do?

Design of Website
The Website should consist of more than one page for reasons of clarity. A standard construction
is as follows.

1. Top Page: Usually this includes figures and YouTube so that the overview and results of
the project can be understood at a glance. Some teams only describe the project overview
here.
2. Project Overview: Briefly describe the background and goal of the project, using a fig-
ure(s). In particular, clearly describe “the necessity of the use of a DNA nanostructure for
the purpose” to justify the project.
3. Experiments/Calculation: Arrange figures into a clear story, not just a list of the results. If
you have too many figures to show, you can include less important figures in a supple-
mentary information page.
4. Discussion: In most cases, the project will not proceed as expected. Describe the results
and unsolved problems of the project clearly so that it is useful for future BIOMODers and
researchers interested in the BIOMOD pages.
Chapter 1 Let’s go to BIOMOD 23

5. Protocols: Like paper writing, the project will be forgotten if others cannot reproduce your
results based on the Website. Describe your protocols in as clear and detailed a manner
as possible. The underclass students will enjoy learning from it in the next year.
6. About the Team: Introduce your team members. Also include acknowledgements and
links to the sponsors who support multiple aspects of BIOMOD.

How to Setup A Website


There are many ways including GitLab Pages, GitHub Pages, OpenWetWare, Weebly, or Word-
press.
GitLab Pages⁴ is currently the preferred option for hosting BIOMOD websites. It allows the
greatest flexibility in that any major static-site generator can be used. On the other hand, it may
requiring learning some skills with HTML and git-based version control.
GitHub Pages⁵ was introduced as a website option for BIOMOD in 2014. Like GitLab, it supports
simultaneous editing and allows you to easily restore previous versions because it retains
the edit history. GitHub Desktop⁶ can be used to highlight changes to your files, so that it is
convenient for simultaneous editing.
Early BIOMOD teams used OpenWetWare⁷. OpenWetWare seems good in that it can be operated
intuitively, and has been employed since the first competition so that it has a lot of scripts and
other merits. It is also highly compatible with image and video files. However, it does not support
simultaneous editing of a single page, which means you will have to work in a place where you
can talk with other editors just before the deadline, for the sake of efficiency.
Weebly⁸, and Wordpress⁹ are allowed. These are likely the most user-friendly options if your
team does not have a technical lead, but they can be inconvenient to archive for viewing by
future generations of BIOMOD students.

Tips on Website Writing


Website writing is not a task to be done just before BIOMOD. Whenever it’s appropriate, process
obtained data to facilitate Website writing. For this purpose, draw hypothetical figures based on
the expected results that you will need to demonstrate the story, before the experiment starts.
At first such a figure might be hard to imagine. Published information and discussion with the
mentors and TA will help you. You should train in this way already (plan backward from the goal).
It will be useful when you join a laboratory.
⁴https://pages.gitlab.io/
⁵https://pages.github.com/
⁶https://desktop.github.com/
⁷http://www.openwetware.org/wiki/Biomod
⁸https://www.weebly.com/
⁹https://wordpress.org/
Chapter 1 Let’s go to BIOMOD 24

Technically, fixing the format (size, font, etc.) of the figures to be uploaded on Website and them
filing them will facilitate Website writing. Figure titles, legends, and other considerations should
be attached to the figures, so that a person who processes/uploads the figures will not have to
ask for the information later . Another hurdle in Website writing is English. In order to make
a good Website in English, read papers to learn technical terms/phrases, and let your draft be
reviewed by the mentor, TA, etc.
— Hisashi Tadakuma (Kyoto University)

9. Tips on Making YouTube Video


The YouTube video is the introduction for the judges to come and see the content of your project.
Below are some tips on making it.

Why Video Is Important


The video category makes BIOMOD unique among and other student competitions. You have
to upload a video of about 2 to 3 minute long that introduces the content of your project on
Youtube, Vimeo, or any other online video-hosting website, which is then scored by judges. The
judging criteria are as follows.
Online video (worth up to 25 points/perfect total score of 100)
- Overall impact: Was the video interesting? Did you want to watch more than once? (worth up
to 10 points)
- Clarity: Was the project described in a simple and clear manner that could be easily understood
by a wide audience? (worth up to 10 points)
- Production: Was the sound and video high quality? Were the images focused and scaled
properly? (worth up to 5 points)
The video is the introduction to your project. First impression is always important. Most people
will decide whether to read over the detailed content of your project based on the video. This
also applies to the judges to some extent. An easy-to-understand video will make the judges
understand your Website and Presentation to a greater extent, and even make them more eager
to read/listen. Also, a clear and impressive video will be helpful in describing the project in the
following Website and at the Presentation. This section will introduce tips on making a video in
the order of the process steps.

Story
However beautiful your graphics are, the most important thing in your video is the story. A good
video must be based on a story that clearly describes the project and at the same time has an
Chapter 1 Let’s go to BIOMOD 25

impact.
This piece is dependent on how well you brainstormed project ideas to decide the subject
matter. Always keep in mind whether a given idea will turn into an interesting video (or how
the idea can turn into a story with an impact).

Planning
Empirically, many teams are panicked and start making the video when the last month comes.
You should avoid this if possible. I recommend that each team have a director. The director
should watch and study previous BIOMOD videos, and practice camerawork, video editing, and
the like by herself/himself. If the director is also an audio engineer and CG engineer, she/he
should practice these tasks, too. Filming a documentary of a team meeting and editing it will be
an effective practice. After the domestic trial (or alternatively some modification of the project),
you will have picked the specific subject matter of the project. Ideally, the director should have
acquired enough know-how to start shooting the video by this time.
Before shooting, you should check the schedule. After you begin shooting, you should collect
80% of the required shots in the first 20% of the remaining days. In the last lap, you will be often
occupied by the experiment or Website writing. Finishing predetermined parts of the video as
soon as possible will help out later. The entire process can be viewed as follows.

Video Timeline

Scenario
To start with, decide exactly what kind of flavor and story the video should have. The scenario
is important, which you have to fully work out in advance to make the entire process of video
production go smoothly. Needless to say, the scenario also affects the description of your
project in Website and Presentation categories, so that it should consist of a story acceptable
to all members and a clear description. You can run a scenario competition to choose one of
those written by the team members. Alternatively, you can borrow a standard genre format as a
sample form to write your scenario, though it is not good for your originality. In order to engage
the audience within just 3 minutes, you can adapt styles of well-known video games, movies,
and TV dramas to introduce your story. (Anyway, enjoy this process.)
Chapter 1 Let’s go to BIOMOD 26

At this point, the director should, if necessary, correct the scenario in terms of the impact of
the story, clarity of the description, and even the feasibility of the story, based on the know-
how obtained so far. If the description part cannot include many CG shots, look for alternative
means, such as reduction of the shots. You can also write individual lines for specific actors
(“Ategaki” in Japanese), so that these amateurs can act easily. Location shooting outside the
University will usually take too much time. You can write the scenario so that it will take as
little time as possible. A green/blue screen is convenient, but note that it takes much time in
the postproduction step.
Once the casting has been set, gather the actors in one place to rehearse the voice recording at
least one time. In particular, if the actors are to talk in English without any subtitles, every actor
has to do a thorough rehearsal in advance. Here, the director should watch the actors speaking
carefully. If there is anyone who cannot talk in English in any way, all the lines can be displayed
as English subtitles. In this case, you need English subtitles translated from the original script
written in Japanese. (Each subtitle should be of a length that can deliver the meaning of the
subtitle at a glance.)

Storyboard
Once the scenario has been written, the entire story should be somewhat clear. The director has
one more step to prepare for the shooting: the storyboard. This places individual shots into the
scenario (storyboarding), based on which specific camera techniques, SE, and BGM are assigned
to the shots. Once the shots are determined, get a stopwatch and count the individual shot
lengths, with their lines and the acting imagined in your head, to adjust these and other factors
including staging. If any subtitles are to be used, balance the lengths with those of the shots and
lines at this step. (Actually, some directors skip the latter steps and start shooting at once. Since
the total length of the video and the deadline are predetermined and tight, these steps will help
you imagine the actual shots and avoid retakes at the filming site, especially if you are new to
filming.) At this point, you should be able to imagine the entire video in your head and to share
the images with the actors and film crews. If there are any props required, manufacture them
during this step. If you have a CG engineer and/or a sound engineer, let them start their job at
this point. Confirm that the lengths of the shots do not exceed the total film length and that the
shots balance with each other in length.

Filming
A rule for filming is to do it at one sitting, to film all required shots in as short a period of time
as possible. First of all, decide the order in which the shots in the storyboard will be filmed.
Because you already have details of each shot in the storyboard, you don’t need to start from
the first page of the storyboard. Basically, you will film the shots as you have imagined them from
the storyboard. Of course, you can adopt a good shot that happens accidentally. Easy shots that
can be filmed independently of others, such as landscape shots, may be assigned to different
members.
Chapter 1 Let’s go to BIOMOD 27

Postproduction (Editing, Sound, and VFX)


Put all the shots together, roughly or not. (This can be done in parallel with the filming.) If there
are any shots lacking, do additional filming at this off-line step. Put the selected shots into place
according to the storyboard, like puzzle pieces, and fine-tune the length of each shot to adjust
the overall balance and the tempo of the shots. Also arrange the sounds roughly in place, and
then edit the timing and the sounds themselves in detail. Synchronizing the sound with the
shots makes your video look solid. (When you have no sound engineer, you can rely on the
loops included in Garageband, free sound resources available online, and the like.) When you are
almost done, you should improve the sound quality by dubbing the voices. At every opportunity,
show the video to other members and get feedback. Let them point out problems that the editor
cannot notice, such as any gap in the story or script, and improve it. (Such unclear problems are
usually corrected at the scenario writing step, but a kind of tempo, joining of shots, or staging
may sometimes make the audience misinterpret the story in an unexpected manner.) A finished
storyboard is advantageous because you can get this feedback at an early stage even when the
shots are yet to come.

Conclusion
This section has summarized the basic process for making a BIOMOD video, starting from the
importance of the video and along the actual video production process. Of course you don’t
necessarily have to go through all of these process steps, but with such a process always in your
mind you can make a better video. The judges and the audience are the same as you_—humans.
Thus, your video should be amusing and interesting to yourselves too.
— Shogo Hamada (Cornell University)

10. Tips on Presentation in English


For Japanese students, the greatest problem at BIOMOD is giving a presentation in English. You
have to get through it in order to get a good score. This might be your first presentation, and in
English. What if you cannot understand the questions? There are a lot of what-ifs, so just do your
best to prepare.

Logical Story Will Get You Through


The greatest problem with the presentation in English is not your English skills. It is whether
the story of your presentation is based on a clear logic (see section 12). A clear story will not be
difficult for you to express in English. The logic of presentation does not have many variations
and the following framework will always work.
- Background: Why you chose that subject matter.
Chapter 1 Let’s go to BIOMOD 28

- Purpose: What goal you set.


- Methods: What experiment you did to reach the goal.
- Results: What the result of the experiment was.
This framework is very different from “Introduction-Development-Twist-Conclusion”, which you
learned for school composition. You can use “Twist” (i.e., subject matter and/or scene change)
in a school composition but not in a presentation, especially in English. You should develop
the story of a single subject matter in a straightforward manner from the start to the end: i.e.,
Introduction-Development-“Do it straight”-Conclusion.
- Introduction: Background.
- Development: The resultant subject matter of your project (Purpose).
- Do it straight: Develop the story of the subject matter (Methods and Results).
- Conclusion: What the results mean and what they make possible.
This “straightness” can be depicted as follows.

Good vs Bad Presentation Flow

Staging in Presentation
Presenting at BIOMOD is very different from presenting at scientific meetings in general. At
scientific meetings, you will only have to describe the content of your study and its value as
clearly as possible. BIOMOD, as a student competition, allows you to present more freely in an
original way, and originality is also evaluated. In fact, the judging criteria include how impressive
and interesting your presentation is (see section 3). Therefore, staging of your presentation is
important.

Staging Techniques
- Dramatization: Incorporation of some sort of fictional drama.
- Dialogue: Cross-talk of more than one member.
- Cosplay: Use of a puppet, Kigurumi (character costume), or the like (in the middle of the
presentation).
- Visual effects: Use of a video projector.
Chapter 1 Let’s go to BIOMOD 29

- Joke: Talk with humor and wit.


- Audience participation: Ask the audience to move their bodies.
A typical staging is to explain the motivation or background of the project in a dramatic way,
e.g., a story. For example: “A crime organization generated a dangerous virus. The properties of
the virus were… We had to find measures against it as soon as possible. Therefore we started…”
Such a story makes the subject matter more interesting to the audience.
In addition, the presentation in this format allows for more than one character, i.e. speakers,
instead of only one of the authors at an ordinary meeting. This format makes a conversational
style possible, where one speaker can ask a question on behalf of the audience and another can
answer it, which makes the audience able to follow the presentation more easily. For Japanese
speakers, it helps by lessening the burden of memorizing lines in English.
Jokes can spice up your presentation. If you succeed in making the audience laugh, then you
can say the presentation is going well. However, it does not mean that you can use any joke.
Avoid jokes about social taboos (those related to sex and religion) or things that may bother
other teams (confetti etc.). It will only annoy people. It’s best not to distribute goods either.
At BIOMOD, there have been a variety of styles of presentation. In 2014, there was a team that
incorporated audience participation into the presentation for the first time. They asked the
audience to make a motion that is related to the story of the presentation (to wave or to pretend
to have a balloon over their heads), which helped the audience understand the project. I expect
a more innovative idea from you.

Rehearsal, Rehearsal, Rehearsal


In order to be successful in your presentation in English, you have to rehearse. Eye contact is
always demanded in a presentation in English, and the speaker has to look at the audience while
talking. As a result, the speaker cannot read the script and must memorize the lines. Because
of this, the script must be polished and short. Have the mentors and native English speakers
critique the script in advance. Although it may be practically difficult, the script should be ready
by one week before Jamboree.
Once the script is done, you should rehearse by reading the lines slowly and with pauses. Let
someone good at English correct any incorrect pronunciation or intonation, before it’s too late.
Rehearse again and again on a stopwatch. The number of rehearsals will be proportional to the
quality of your presentation. You can do rehearsals on the way to the USA or at the hotel, too.
Try to rehearse just before the competition. (Because the presenter’s talk must be synchronized
with the slides, the slide assistant should join at least a few rehearsals.)

Another hurdle: Discussion


The presentation is over. Fortunately the audience seems to have enjoyed your presentation.
Then the discussion time comes. The impression of the discussion will greatly affect the score,
Chapter 1 Let’s go to BIOMOD 30

although it is not included in the list of judging criteria. Non-native English speakers like
Japanese students are greatly handicapped. You might not hear the question well, and, if you
were able to understand it, you might not know how to answer. The team members on the stage
look at each other’s faces, standing still in an awkward silence… which is the worst situation of
all. Below is the list of what you can do in advance to avoid this. At the domestic trial held around
September, you are also required to do the presentation and discussion in English, where you
can get a hint of the atmosphere at the final.

How to Get Ready for Discussion


- Prepare a list of expected questions. (Also prepare answers to difficult questions.)
- Assign different expected questions to different members. You should have a rehearsal for
responding to the question(s) assigned to you within 3 seconds.
- Appoint a good English-speaking member (e.g., an overseas student) as the answerer.
- The domestic trial serves as a good rehearsal. Fix the problems found in your presentation
before the final.
— Satoshi Murata (Tohoku University)
Chapter 1 Let’s go to BIOMOD 31

11. What You’ll Get From BIOMOD


The summer vacation is not long enough to prepare for BIOMOD. You will continue to struggle
with preparation in September and October. Sometimes you might have to work all night, and
sometimes you might quarrel with the team members. What will you get after all that? Here are
some recollections of previous student participants and a comment of a mentor.

Recollection of Student Participants


You can best understand what BIOMOD is like from the voice of previous student participants.
Many of the previous student participants have reported on BIOMOD in Newsletters of Grant-
in-Aid for Scientific Research on Innovative Areas “Molecular Robotics” [1]. Here are just two of
them [2].

Ko-ichiro Katayama (Team Sendai 2013)


When looking back on these 6 months, it seems like I was haunted by BIOMOD,
24 hours a day. I was always thinking about BIOMOD: design of experiments, a
protocol to be modified, unfinished Website articles and presentation slides, etc.
What was the reason for such obsession? The pressure as the previous winner and
responsibility as an upper class student were surely part of it. However, the greatest
reason was that it was so much fun.

The greatest feature of BIOMOD is that it is a “competition of total skills to work on a


project from scratch”. You have to decide what to do and how to achieve it, conceive
all the ideas by yourselves, and work by trial and error toward the goal. You have
to not only carry out the experiment, but also prepare a Website, YouTube video,
and presentation that provide a clear and attractive explanation of your project. It
requires a wide variety of skills of the members and good total skills as the team.
Surely, it was a hard thing. However, to work hard with other team members for
BIOMOD was so much fun to me that, after BIOMOD, I felt they were the most fruitful,
really happy days in my university life.

On the way to BIOMOD, I acquired skills too. BIOMOD forced me to acquire basic
skills in laboratory experiments, PC skills to handle Website and use Illustrator, and
other skills. For me, however, the greatest thing about it was that it taught me the
importance of English.

At the final, I was shocked that my English skills were so awful compared to the
presenters of foreign teams and to other Japanese teams. BIOMOD was a miniature
of the world to me, and taught me that English was essential to work in an interna-
tional field. I want to improve my English skills not only in terms of writing/reading
but also communication, to be good at using English as a tool.
Chapter 1 Let’s go to BIOMOD 32

Risa Matsudo (Titech Nano-Jugglers 2012)


If someone asks me: “What did you do during this summer vacation?” I will say:
“BIOMOD!” with pride.

In April of sophomore year, I felt somewhat unsatisfied with my university life. I had
been thinking that a university student should actively study what she/he wanted at
her/his own will, but I was always occupied with tasks given by others. That was why
I was moved by the passion of the upper class students for BIOMOD and decided to
join the team.

In the brainstorming of project ideas, I often spent a day in the library looking for
something useful in the literature. After the project was set up, often the experiment
didn’t work well for many days and we had to discuss the possible cause for
several hours. At BIOMOD Japan, we had to make a presentation using slides and
a script that had been just prepared that morning. During the presentation I felt so
desperate and nervous that my hands started shaking and I was not able to smile
naturally. When we won Grand Prize, I was so happy that I could not stop crying.

The days before the final passed quickly. The place of BIOMOD, Harvard University,
had a large, relaxing campus surrounded by greenery. This allowed everyone of
us to carry out her/his role in the presentation, which was based on the motto of
the team “Involvement of all members,” without any pressure. I don’t know if it
was because we kept on improving both Website and YouTube until just before the
deadline, but we won more than one award, which I’m really proud of. The lobster
served at the night of the Awards Ceremony really tasted good.

I owe all the results and pleasure to the advice and cooperation of the mentors.
At the same time, I’m very grateful to the support from my family and friends, and
hard work of the team members. In order to make it better and better, I spent a very
enjoyable, fruitful 6 months discussing, colliding, studying, and laughing a lot with
the team members.

Hard work with the team members toward this one goal made me very happy,
making it a lifetime memory. To devote one summer to BIOMOD will certainly
reward you in the end.

Mentor’s comment (Tohoku University, 2011-2013)


Observing the challenge of about 30 students preparing for BIOMOD, I was surprised
by their hidden talents. One student acquired 3D graphics skills starting as a rank
Chapter 1 Let’s go to BIOMOD 33

amateur, one came to make a high-level video, another made a molecular dynamics
program, etc. None of these students seemed to have such talents when we first
met, and even a few months after the project started. Probably they were not
aware of it either. They happened to participate in BIOMOD, were forced to devote
themselves to it, and improved to a great extent.

Some of the students were not able to contribute much to the BIOMOD project, but
were kind of awakened after BIOMOD. For example, one of the students improved in
English skills from around 700 points of TOEIC to a nearly perfect score in two years,
and even mastered French by the junior grade. More than one student obtained
experimental results enough to write a paper in one year. I have worked in more
than 5 laboratories, but I have never encountered such a group of students who
had so many hidden talents or were awakened at such a high rate.

You will get not only skills/abilities/ambitions, but also a friendship with other mem-
bers through the hard work during Summer. BIOMOD is tough. Before Summer, you
might not find a good project. During Summer, the experiment might not proceed
as you have expected. Your Website and YouTube might still be unfinished when
Autumn comes. Criticisms from the mentors will often make you feel down. You
are sure to feel nervous before the deadline and the presentation, and might be
frustrated with a member who is not cooperative. The leader might sometimes
request too much from you. After you have got over all these troubles, however,
the team members will become comrades in arms.

Such skills, awakening, and friendships won’t come for free. You may get these
things after you have really devoted yourself to the project, working late, forgetting
about lunch until you have done everything you can. At a meeting after a crushing
defeat at the domestic trial, one of the students, who had always seemed to be a
happy-go-lucky one, groaned out: “I want to win. We have to change ourselves.”
Through these moments of passionate determination, the students really start
improving through BIOMOD.

— Kei Fujiwara (Keio University)


Chapter 1 Let’s go to BIOMOD 34

Tokyo University, Kashiwa, 2014


Chapter 2 Before Starting Your
Project
12. Subject Matter of Project
It may be difficult to choose a topic for your project. In addition to accommodating different
intersts of the team members, you will have to learn about designing biomolecules. Below are
some important points to consider when choosing a project.

What Do You Want to Do? Setting Priorities


BIOMOD is not an ordinary competition in which participants compete for a single, defined goal.
That is, you can choose your own goal as well as your methods of reaching it. Anything goes as
long as the subject matter of the project relates to “designing biomolecules”. In such an open-
ended project, having team consensus is important. Teams should have a solid consensus about
the group’s activities.
Your team should agree as to how to prioritize these considerations:
- Something no one has tried
- Applicability and practicality
- Is it of interest to other people
- Showy staging
- Importance of technical execution of project

Brainstorming
Once these priorities have been agreed on, you can start to narrow down the specifics of the
project. It can be any project that relates to the designing of biomolecules. You may not exactly
know what it is like to design a biomolecule, so you should consult this book, YouTube videos of
previous participants, and discuss the question with upper-class students and teachers. Then
you need to discuss your possible ideas with other team members to generate more ideas and
refine the ideas that come up–brainstorming. There are different types of brainstorming (see
section: Types of Brainstorming).
Chapter 2 Before Starting Your Project 36

Turn Tough Times into Opportunities


Sometimes brainstorming may not bring new ideas and you may be caught in a circle of fruitless
discussion. However, this is really an opportunity for you to generate good ideas. Brainstorm
over several sessions until, after days of discussion, you think you have investigated as many
things as possible but in vain, and you can come up with no more ideas. At this point, consult
the mentors. They will give you some good hints.

A Good Idea Has Clear Logic


What is a good idea, and how can you judge whether an idea is a good one. A good idea needs
to be logical. “Logical” means:

1. The team’s priority is clear.


2. The question is in line with that priority.
3. The process to solve that question is fairly clear.

You should be able to summarize these three points clearly and simply. For example, a good
idea might be summarized as:

1. The subject matter must be universal.


2. Let’s make a tool that is applicable to a wide variety of situations.
3. This should be feasible by combining/developing tools from previously reported studies.

However, the last one might be difficult for you to judge. Again, you can consult the mentors.

Turning an Idea into a Topic


An idea with clear logic is relatively easy to turn into a project. Next, you should make a list of
prerequisites:
- What will be required to complete the project ?
- Are there methods available to demonstrate (experiment with) it?
- Are the necessary materials available?
- Is the experimental equipment available?
- Do we enough bandwidth? Enough time?
Chapter 2 Before Starting Your Project 37

Keep in Mind the System for Judging the BIOMOD Competition


You should design your project with an aim to win, not just participate. At BIOMOD, all projects
are scored by judges. Judges are selected from the mentors of participating teams, who are
professionals in this field. You have to satisfy these people to get a high score, regardless of
how confident you are in your project. Check the judging criteria of BIOMOD as you design your
project (see section 3).
This competition is good practice for getting grants, where your work is also evaluated by others.
Evaluations depend on the value system of the community of researchers, and of academic and
human society.

Rise and Fall of a Project


Once you have chosen a topic, the project starts. You don’t have much time until the final
(see section 6). However, sometimes you may have to change topics. If you have enough team
members, you can run more than one project in parallel and then narrow the list down to a single
project that seems promising. In the course of the experiment, however, you may encounter
an obstacle that you cannot overcome, in which case you may have to reconsider the subject
matter itself. In that case, you should construct the overall logic of a new topic all over again.
The judges and the audience will only see the final presentation. Changing topics midstream is
difficult, but sometimes necessary. In the process of reconsideration, the logic is polished and
becomes clearer. The victory will be yours after you get through it.

Copyright
Your YouTube video and project Wiki for BIOMOD will be made public online. You must make
sure that the figures, videos, and sounds included are either original or in the public domain.
Any publication that you consulted should be cited with appropriate information (authors,
paper title, journal, year, and so on). Respect for the work of others is very important in the
academic world too.

—Satoshi Murata (Tohoku University)

13. Types of Brainstorming


Four eyes see more than two. Generating ideas in a group setting can be far more productive
than individuals trying to generate ideas on their own. Brainstorming should be done in a
divergent, free-flowing manner [1]. This section introduces brainstorming and other techniques
to generate ideas in a group setting.
Chapter 2 Before Starting Your Project 38

Brainstorming
Brainstorming is a techniques to find new ideas and perspectives through divergent thinking.
More than one member gathers to have a free, extensive discussion on a given subject matter.
It is the most orthodox process for generating creative ideas, and can be conducted in a very
straightforward manner.
First, decide together on a concrete topic. For example, the subject matter “how to make every
worker wear a hard hat” is better than “how to reduce construction site accidents”. Then,
individual members write ideas relating to the subject matter on a large sheet of paper or
whiteboard, and all discuss them one by one. The number of members and the length of the
whole process are not particularly limited, but several to 10 members and about 1 hour work
well. If you want go it for longer, the group should take a break about every 1 hour to refresh
yourselves.
In brainstorming, you can present essentially any idea but you should note the following four
points:

1. Welcome wild ideas: Free yourself from common sense and good sense to present any
idea. Never mind if it is strange or unusual.
2. Withhold criticism: Evaluation of proposed ideas should be put on hold. Do not criticize
other members’ ideas or your own, however ridiculous or uninteresting they seem.
3. Go for quantity: Instead of looking for the finest ideas, present as many ideas as possible.
4. Combine and improve ideas: Active utilization of existing ideas is also important. Mod-
ifying and combining different ideas can effectively generate new advanced ideas. As
described in (2), do not criticize ideas of others unless it improves the ideas and as a
consequence generates new ideas.

How to Generate Ideas


It is up to you how to generate ideas. Below are some tips on generating ideas.

1. Free association: this includes association by contrast, association by contiguity, and


association by similarity.
- Example of association by contrast: “day / night”, “female / male”
- Example of association by contiguity: “sea / beach”, “car / road”
- Example of association by similarity: “ball / earth”, “bird / airplane”
2. Nonrandom association: this includes “forced association” and “analogy”. Forced as-
sociation is a technique to impose some condition on the subject matter, and then
generate ideas. Analogy is a technique to get inspiration from something that is essentially
analogous to the subject matter.
- Example of forced association: Choose “novel scissors” as the subject matter. Impose a
Chapter 2 Before Starting Your Project 39

condition such as “they are to be transformed”, “ they are for seniors”, etc.
- Example of analogy: Choose “novel scissors” as the subject matter. Think “The role
of scissors is to cut” -> “Cutting is the purpose of guillotines, too” -> “Let’s make novel
scissors based on guillotines”.
3. Different perspectives: Think about different “products”, “properties”, “functions”, “com-
binations”, and so on, from “bird’s eye view/bug’s eye view” or the like [2].

Brainwriting
“Brainwriting” is similar to brainstorming [3]. It is effective in generating many ideas in a short
time, generating ideas in a group of rather reticent people, etc. Below is an example of the
process, applied to six people.

1. Participants sit in a circle or square.


2. Prepare six worksheets, each with 6 rows and 3 columns.
3. Distribute the worksheets to the participants, one sheet per participant. Write the subject
matter at the top of the worksheet.
4. Generate 3 ideas that relate to the subject matter and write them in the 3 blanks in the
first row, within 3 to 5 minutes.
5. Then, pass on your worksheet to the neighbor on your left and receive another worksheet
from your neighbor.
6. Look at the ideas in the first row in the worksheet that was passed to you, and write 3
new ideas in the 3 blanks in the second row. Again, pass on the worksheet in the same
pattern–to the left.

7. Every time you receive a worksheet, add new ideas inspired by the ideas in it (you can develop
ideas of the previous participant or propose totally different ideas inspired by them).
Repeat the steps 6 times in total. This takes 30 minutes or less, and you will get as many as 108
ideas (= 6 members x 3 ideas x 6 rounds).

Mind Map
In this technique, a list of ideas inspired by the subject matter is generated by drawing pictures.

1. Draw the subject matter in the center of a blank page.


2. Write ideas inspired by the subject matter around it, and connect the subject matter and
each idea with a line.
3. Add new ideas inspired by each written idea. Connect each of the former ideas and the
latter with a line.
You can make use of curves, colored pencils, or something to make it more fun and
visually interesting. This idea generation technique can be used in an individual setting.
Chapter 2 Before Starting Your Project 40

KJ Method
The 3 techniques described above are for divergent idea generation. Sometimes you may have
to sort the ideas generated. One technique to sort ideas is the KJ method [4], which can be
summarized as follows.

1. Record ideas on cards (or notes). One idea per card.


2. Sort the cards into groups based on the content, and give a representative title to each
group.
3. Sort the groups into larger clusters based on the title, and give a representative title to
each cluster. Repeat this step.
In the course of this process, the divergent list of ideas gains become s organized by theme.

—Ken Sugawara (Tohoku University)

14. Design Process


BIOMOD is a competition for “design” of biomolecules. “Design” is to prepare a plan for man-
ufacturing a product. Basically it is the same as designing a handicraft for summer homework,
but has to be done more carefully because molecules are not visible to the naked eye.

Fiat Specification
As described in section 12, the logical basis of the project is important. If the logic is clear
enough, the functions and properties of the desired product will be self-evident. Specification is
making a list of the functions and properties the project is aiming for. The design process starts
with specification.
The difficulty of a design will depend on the specification. A specification that includes conflict-
ing requirements (e.g., a car with good speed and high fuel efficiency) is not feasible (i.e. has
no design solution). However, an easily achievable specification, e.g. one that doesn’t need an
elaborate design may result in an ordinary product.

Breakdown of Design Process


Below is a breakdown of the design process. Repeat steps 1 to 6 as many times as necessary. In
other words, successful design by its nature requires trial and error.
Chapter 2 Before Starting Your Project 41

Design process

1. Establish specification: Define the desired functions.


2. Choice of parts: Choose parts required for the functions.
3. Design: Combine the parts into a system.
4. Experiment: Test the system by simulation and/or experiment.
5. Evaluation: Analyze the results to identify problems.

6. Repeat the process from step 3 (or 2 or 1).

At each step, check the following points. Check points of each step:

1. Is specification the right level of difficulty—not too easy and not too hard.
2. Choice of parts: Are they readily available, e.g. mass produced (see section 73). Avoid too
many parts (different molecules). Cost should be appropriate (see section 73).
3. Design: Avoid any non-practical assumption or wishful thinking.
4. Experiment: Simulation model possible? (see section 62). Experimentable? (see sections
69 and 70).
5. Evaluation: Reproducible? Accurate? (see section 70)

In principle, the design should proceed from the whole to details, from abstractness to concrete-
ness.

1. Conceptual design: Specify the required functions based on the overall image.
2. Preliminary design: Specify the structure and shape to achieve each function.
3. Detailed design: Specify the size and material of each part.

The process of dividing a system into smaller concrete parts is called decomposition, and the
process of combining the resultant parts back is called integration. Successful design requires
going from decomposition to integration and vice versa to specify details of the system. Remem-
ber this important constraint: one function per part. Any part with more than one function will
hinder design changes and error corrections. Such “optimization” should be done, if any, after
the basic design has been established.
Chapter 2 Before Starting Your Project 42

Decomposition and Integration

Talents for Handicraft


Designing DNA molecules is like a form of handicraft. Handicraft is a standard of summer
homework. Most of you must have experience with it, such as when you have made something
from waste cardboard or disposable chopsticks, but may not have succeeded in making what
you had expected. Most of you probably got something that didn’t look like what you were
envisioning at all. In handicraft, most things don’t proceed as expected. Unexpected problems
will happen, and your determination will be tested.
Required talents for handicraft:
- Pessimism that most things don’t proceed as expected.
- Imagination: the ability to visualize a thing before it is made.
- Observation: the ability to find the cause of a trouble.
- Flexibility: the ability to make use of resources around you.
- Stamina: the ability to seek for the best result.
- Imagination, or the ability to visualize, means having a concrete image of an object, to clearly
imagine its shape, size, weight, texture, motion, and so on. In molecular design, you have to
learn chemistry and molecular biology in order to imagine the properties and motion of a
molecule. It is not possible in a day, so start with anything that interests you in a textbook,
making steady efforts (see chapter 4). Various examples of research will be helpful (see chapter
3).
- Observation is the ability to identify something that is wrong. This is done by examining
experimental results, making a hypothesis about the cause of any trouble, modifying the
conditions, and working with patience. If you can find out the cause of the problem, you can
find a solution.
- Flexibility is the ability to make use of various resources around you, to find useful things and
possible applications of things from everyday life. However, flexibility doesn’t mean anything
goes: thoughtless application of these things to your project might distort it.
Chapter 2 Before Starting Your Project 43

- Stamina is the patience to pursue optimization. If you have made a system but are still anxious
about possible hidden problems in it, and if it seems more and more incomplete to you as you
keep working with it, then you surely have a talent for handicraft.
You are going to combine all these abilities to design a molecular system. There is no easy
way out, and remember that you should not make the schedule too tight. Otherwise you can’t
examine the design from various perspectives, and you can’t deal with troubles. Good schedule
management is a prerequisite for a good design (see sections 6 and 7). In most cases, it is no use
making a final spurt just before the deadline in order to solve any technical problems. You will
have nothing to learn from “Project X”-like success stories.

MIRACLES
One more thing is important to designing. Only one MIRACLE at a time. MIRACLE means
an unknown element required to attain the desired function, which is not described in any
textbook or paper and thus has to be newly developed. A project without any MIRACLE tends
to be just a combination of previously reported results, and thus not very exciting. On the
contrary, a project with more than one MIRACLE will not be achievable in 6 months and thus is
unsuitable for BIOMOD.

—Satoshi Murata (Tohoku University)

15. How to Collect Information


You need information to choose the subject matter of the project, to conduct the project, to
learn something, to make a research plan, to carry out an experiment, etcetera. This section
introduces some information sources and how to access them [1].

Textbooks and Published Papers


In the course of your BIOMOD project, you will have to study many things. First, what is DNA?
What can you do with DNA? You have found a good subject matter, but hasn’t someone already
done it? How to construct an experiment like this? There are two main kinds of information
sources to study these things: books (textbooks) and papers. Books are suitable for learning
basic knowledge systematically. Papers describe the results of specific research in detail, and
are more timely than books. In the actual course of your project, you will read quite a few papers.
In rare cases, you may read papers with incorrect information. Sometimes you may have to
critically examine the information you find.
Chapter 2 Before Starting Your Project 44

Literature Search
You will usually use the Internet for a literature search. There are several literature search sites.
One that is accessible from anywhere is Google Scholar [2]. For example, when the keywords
“DNA origami” (see section 18) are entered in the search window, the search results are like the
figure below. You can look at these titles and abstracts to find papers of interest.

Google Scholar

External link (publisher etc.)


Partial contents (abstract)
Other links providing the article (If you don’t like the above)
Related articles (according to Google)
Articles citing this article
If you are not getting good search results, try modifying search keywords as follows.
- Try synonyms and different expressions to prevent drop-out.
- Use adequately defined keywords to reduce noise in search results.
- Use search operators to narrow down search conditions.

Use operators to narrow down search conditions

AND: Insert space DNA origami


Chapter 2 Before Starting Your Project 45

OR: Insert “OR” or “|” DNA OR origami, DNA |origami


NOT: Put “-“ before the word DNA -origami
Phrase: Include the phrase between quotations "DNA origami"
Operators may differ among search engines. Once you have found a paper of interest, you can
check references listed in/papers citing that paper, to find related information.

Make Use of A Library


How should you access these textbooks and papers? Of course you may buy them, but textbooks
are fairly expensive and financial resources are always limited. Because you are a university
student, the cleverest way is to make use of the university library. If you have ever visited it, you
know how large a collection of scientific textbooks and journals it has. In most universities, the
library HP provides a service by which you can search the library catalog based on book title,
author name, journal name, and so on. So you can check if a book of interest is in the library
in advance. Often journals are not allowed to be taken out, but you can still copy the paper of
interest at the library. Note that a journal or book is copyrighted, and thus its copy is restricted
to “no more than half of the total pages” [3]. Even if the journal of interest is not in the library
of your university, you can use the copying service system to order it from other universities.
Please refer to a detailed user guide provided on each university’s HP.

Electronic Journals
In recent years, most papers are also published electronically on the Internet, so that you
don’t have to go to the library to read them. Papers are sometimes open access (free access),
but most charge a fee. However, you can access any paper from the university’s network and
read/download it for free, as long as your university is under contract with the journal. As
mentioned above, a paper is still cpyrighted, and thus should not be modified or redistributed
without permission. In addition, massive download in a short time might result in university-
wide suspension of service.

Let’s Read a Paper


Many of you will be reading papers for the first time. In your everyday life, you read a book or
newspaper from the front page to the back page. However, reading a paper in this manner is
inefficient. You should skip through it once to get the outline, and then read it again in detail.
This will help you to understand the content more easily.
A typical paper includes the following sections. These sections or their order may differ between
papers, but, for any paper, what you see first is the title. The title expresses the content of the
paper explicitly, such as “Folding DNA to create nanoscale shapes and patterns” [4]. Then, read
the abstract and figures to get the outline of the paper. To that end, you should also refer to
the Introduction (i.e., brief description of the aim and results of the study) and Conclusions as
Chapter 2 Before Starting Your Project 46

appropriate. Then read through the paper once. You don’t need to understand everything in it
on the first read. Read it again, referring to textbooks and references to address any confusion.
It will be fun to discuss the paper with your BIOMOD team members and upper-class students.

Sections of a Paper

Read as many papers as you can. Practice makes perfect. In doing so, you should also record
these papers in a file of reference management software or any other tools, which will help you
make create the references for the Wiki (see section 8). You can try Mendeley, ReadCube, JabRef,
and various other tools. Please visit the web site for details.
—Fumi Takabatake (Tohoku University)

16. More About Papers


In BIOMOD, reading papers is essential for training yourselves as well as for choosing the subject
matter. Papers are published reports of research results. This section describes the process of
publishing a paper, its contents, and how to read it.

Introduction
Scientific research consists of an continuous cycle of investigating a phenomenon, summa-
rizing the results, and disseminating the information to the community. Through this cycle,
researchers enrich the world little by little. This section focuses on papers which are a very
important part of the process of disseminating research findings.
Many procedures and norms in the scientific community were established before the advent
of the modern computing and the Internet. While the core research cycle is likely to persist in
some form, the approaches used to measure, summarize, and disseminate in research findings
are gradually evolving in response to technological innovation.

About Papers
Papers report on a specific subject matter, and are published in a scientific journal (figures
and videos included). Because a paper reports a new discovery by the researcher (or team),
Chapter 2 Before Starting Your Project 47

it is called a primary source. A textbook summarizes the essence of the results of papers into
a learner-friendly form, which is called secondary source. There are fee-charging papers and
free ones. A university is a public institute that provides access to primary information, i.e.,
papers, which allows you to do a wide variety of in-depth research. For example, access to
the latest issue of Nature if you were in your home and not logged on to a university would
cost you 3300 yen per letter. There are peer-reviewed papers and non peer-reviewed ones. In
addition to journal articles, there are papers published in international conference proceedings
(“international conference papers”). In some fields, papers of this type are peer-reviewed more
strictly than journal articles, and thus more appreciated.

Scientific Journals
A publication that includes more than one paper is called a journal, or proceedings, Since 1996,
many journals have gone to electronic publication.

Peer Review
A paper sent from the author or researcher to a scientific journal will not be published as
received. The received paper is first read by the editor, and then passed on to reviewers if it is
of interest and describing sufficiently high quality research. Usually, more than one reviewer in
the relevant field reviews the paper and returns comments to the editor, and the editor decides
whether to publish it. In most cases, the comments are fed back to the author, and the author
can revise the paper based on the comments. After one or more cycles of this process, the editor
decides to accept or reject the paper. This is called the peer-review system.

Pre-prints
Preprints are a means to rapidly share scientific manuscripts with the scientific community
prior to completion of peer review. Preprints are rapidly gaining adoption. More information is
available at YouTube: What are preprints?¹⁰ and Ten simple rules to consider regarding preprint
submission¹¹.

Types of Papers
The types of original papers for a typical journal include “communication” (about 2 pages),
“letter” (about 4 pages), and “article” (about 5 pages or more). Articles are the most common
type of publication. A review paper summarizes the state of the art and remaining challenges
in a specific field, citing related papers in the field. Remember that authors of reviews are still
humans and not free of bias when they lists references.
¹⁰http://bit.ly/2rMUyme
¹¹http://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1005473
Chapter 2 Before Starting Your Project 48

Sections of a Paper
Below are the sections of a typical full paper.
- Title: Important. It sometimes shows the author’s struggle to make it witty.
- Author(s): The first author is usually the one who actually wrote the paper. The corresponding
author(s) assume overall responsibility for the manuscript, including compliance with journal
policies, vouching for data veracity, and handling any questions that arise during and after
publication.
- Affiliation: Tells you were the authors work and do their research.
- Abstract: Very Important. If the content is not what you expected, look for other papers.
- Introduction: Describes the background of the paper. You can skip it if it’s familiar to you.
- Materials & Methods: Important. Describes details of the experimental methods used to
evaluate the hypothesis of the paper.
- Figure(s): Very important. Presents data.
- Results: Describes experimental results. As you get used to reading papers, sometimes you
can understand a paper by only looking at figures.
- Discussion: Important. The results of the paper are discussed. This often occupies a relatively
large portion in a paper written by a native English speaker.
- Conclusion: Usually no tricks here.
- Acknowledgment: Appreciation for those who provided advice, technical support, samples,
and financial support for the research.
- References: Very important. Papers cited here have been chosen because they are worth
discussing; i.e. the number of citations is sometimes more important than the number of
accepted papers.
- Supplementary Information: Important. Because experimental procedures nowadays are
getting more and more complicated, some results and details are too lengthy to be described
in the article body, and thus are provided separately as supplementary information. Sequences
for DNA origami are usually found here. Most videos are found here too. This section may even
include something like an essay on twists and turns of the study.

Let’s Read a Paper


Find a paper that relates to the subject matter of interest (see section 15). You can ask the upper-
class students and teachers. Read through it once as written, then in depth with a critical eye.
Particularly in an experimental paper, the focus is on whether the experiment described really
supports the author’s conclusion. Again, criticism is important. If you have any doubts, check
related references. You should always read with skepticism.. A paper with an exciting conclusion
and very little room for criticism is a good one.
Chapter 2 Before Starting Your Project 49

Value of a Paper
Writing a paper is fun. Publishing it is fun. Having other people citing it is fun. Researcher
productivity is measured by number of publications that person has. As mentioned above, a
paper’s value is measured by how much that paper enriches the world in terms of its originality,
reproducibility, and wide applicability. These criteria can be judged based on the number of
citations, impact factor (a measure of the relative importance of a journal), and so on. The
simplest way to collect information for your project is to investigate a specific subject matter.
On the other hand, because both research and publications are the result of human activity, you
may have fun tracking a favorite author or the author of a favorite paper. Every paper includes
the e-mail address of the corresponding author. If the paper interests you, you can ask for advice
from her/him via e-mail, which should be welcomed.

Let’s Write a Paper


A paper is an outcome, a place for discussion, and a launch pad for the next generation. After
reading it, you should have some kind of feeling. Age and gender should not be relevant in
writing a paper. If you have collected a lot of data in the course of your BIOMOD project,
go ahead and write a paper. A good starting point is GM Whitesides Writing a Paper¹² (doi
10.1002/adma.200400767).
—Shin-ichiro M. Nomura (Tohoku University)
¹²http://onlinelibrary.wiley.com/doi/10.1002/adma.200400767/abstract
Chapter 3 What Does Design of DNA
Molecules Make Possible?
17. Structural DNA Nanotechnology: Overview
Structural DNA nanotechnology can produce various shapes using DNA molecules as material.
This section outlines the molecular nature of DNA, basic structures that can be assembled from
it, and three approaches currently used to assemble such structures.

DNA Molecule
When asked what is DNA, most of you will answer, “genes.” This is a correct answer in a biology
class. However, have you ever thought of DNA simply as a “molecule” with a defined shape?
Chemically, DNA is just a molecule made of a long string (“polymer”) of “nucleotide” units, each
of which consists of a sugar, base, and phosphate. DNA in this form is sometimes called “single-
stranded DNA”.

There are 4 kinds of bases: A (adenine), G (guanine), C (cytosine), and T (thymine). A base
sequence is obtained by reading out bases in a single strand of DNA from one end to the other.
DNA has a directionality, with one of its ends called the 5’ terminus and the other the 3’ terminus
(based on the positions of two phosphates attached to a sugar), wherein its sequence is read
from the 5’ terminus (as seen from the figure, the direction is often indicated by an arrow).
“DNA=genes” because organisms recognize DNA base sequences as information that guides a
variety of biological activities including protein synthesis. Although DNA itself is just a molecule,
organisms use it as a “data storage medium” to record/read out a set of information that matters
to them.
As you know, the structure of DNA is a double helix. Two single strands of DNA that run in
opposite directions stick to each other along their length to create this form, in which the
Chapter 3 What Does Design of DNA Molecules Make Possible? 51

bases also play an essential role. Each base has a property of “complementarity”, so that only
a specific partner can stick to it through hydrogen bonding, i.e. A to T and C to G. Two single
DNA strands with mutually complementary sequences stick (hybridize) to each other along their
length through hydrogen bonding between bases, forming this double helix structure of double
strands. Strictly speaking, hydrophobic interactions between adjacent bases in the same strand
are also involved in the double helix structure formation, but we will not discuss that here.
In terms of engineering, you can expect that, even when a large number of DNA molecules
with different sequences are present in an aqueous solution, any of them will only stick to
a complementary DNA molecule to form a double strand, i.e. the double helix structure. See
sections 41 to 44 for more details.

Components of DNA Nanostructure


As discussed above, from an engineering point of view, you can regard DNA as a material with
an interesting property: i.e., a string-shaped molecule that can stick to another one or more to
spontaneously form a double helix, The process of matching one string to another is guided by
internal base sequences. As described in section 45, today we can chemically synthesize a DNA
molecule of any sequence. As a result, a pair of complementary sequences in a given site can be
designed to be as dissimilar to those in other sites as possible (these are called orthonormal
sequences), so that double helix-forming sites are appointed unequivocally. Because of our
ability to create these structures, researchers initially investigated branched structures and
sticky ends of DNA . These two features also serve as key components in the design of more
complex DNA nanostructures, as discussed below.

Branched Structure

As an example, in the left Figure, the regions (a) and (c) in the first DNA have, by design, a
sequence complementary to a region of the fourth DNA and to a region in the second DNA,
respectively. Four different DNA molecules thus make this four-arm branched structure because
Chapter 3 What Does Design of DNA Molecules Make Possible? 52

only mutually complementary regions can hybridize to form a double helix. By dividing each
DNA molecule into multiple regions and designing each region to stick to a specific region in
the same or another DNA molecule, you can assemble not only a straight chain of double helix,
but also more complex two-dimensional structures. At present, the most commonly used DNA
nanostructure components include four-armed branches as described above (one without any
single strand region b is sometimes called a crossover junction) and similarly designed three-
armed branches.

Sticky Ends

Another important component of creating DNA nanostructures is sticky ends. Imagine two
double strand DNA molecules (red and blue in the figure), each provided with a single strand
overhang (a or b in the figure) in its terminus. If the base sequences a and b are mutually
complementary, the two double helices are joined together through the hybridization of a and
b. Such single stranded (i.e. a and b) regions are called “sticky ends”. This technique is frequently
used to join units into a large structure by the “glue”-like function that allows selective binding
between their termini.

Three Approaches
How should you apply these concepts, i.e. branches and sticky ends, to assemble a DNA
structure? Currently there are three major approaches to the assembly of structures using DNA.
1. Folding (e.g. DNA origami and tiles)
2. Network formation (e.g. DNA hydrogel)
3. Connector-like function (e.g. SNA)
These approaches can be distinguished by the different ways they direct a DNA molecule. In
the folding approach, DNA double helices are bundled to form a desired nanostructure. Each
double helix is regarded as a “tube”, and, in principle, the structure is designed to form by
joining the tubes. Techniques that rely primarily on folding include DNA origami, DNA tiles
composed of a variety of DNA motifs, and recently reported DNA bricks. DNA hydrogel is a
relatively new assembly approach, which considers a long double helix made of a plurality
of joined DNA molecules as a “flexible macromolecule structure”, not as a rigid tube. DNA
hydrogel combines such structures to form a network, or gel, on the macro scale. In SNA, which
is described in section 21, the selective binding ability intrinsic to DNA is more important than
Chapter 3 What Does Design of DNA Molecules Make Possible? 53

its structure, and DNA is used as “a glue with a length and a selective action”. In this approach,
DNA molecules are attached to gold nanoparticles to control the interaction between them and
thus the crystallization of the nanoparticles.
These approaches will be discussed in detail in the following sections.
—Shogo Hamada (Cornell University)

18. Two-Dimensional DNA Origami


DNA origami structures, now a prominent field of study in DNA nanotechnology and the most
popular subject of BIOMOD projects, started with a DNA smiley face published in Nature maga-
zine in 2006. That article outlined the principles underlying the design of their various elaborate
structures. We explain a few principles here.

Structural Basis of DNA Origami


Every DNA Origami structure is made of only two kinds of components, i.e., a long single-
stranded DNA chain designated as “scaffold chain” and multiple short single-stranded DNA
chains designated as “staple chains”. The scaffold chain serves as a scaffold to assemble the
structure. Usually the chain is M13mp18 ssDNA, a circular single-stranded DNA of 7,249 bases.
This is the genome of the M13 bacteriophage which infects E. coli and can be prepared from the
organism in a highly homogeneous manner. Any single strand DNA can be prepared using this
bacteriophage, and it had been widely used as a DNA sequence analysis tool before modern
DNA sequencers were developed. As a result, it is still available from biology reagent suppliers
at a reasonable price. M13mp18 ssDNA is generally used in its circular form. It may also be
used after linearization by enzymatic cleavage, which is, however, not recommended because
the assembly of the origami structure will be incomplete when the cleavage does not occur
quantitatively. Similarly, in order to assemble larger DNA origami structures, scaffold chains
other than M13mp18 ssDNA (e.g., a fragment of the lambda phage genome) have been tested
and reported, but seem to achieve less consistent assembly than M13mp18 ssDNA.
The single scaffold chain is folded like the weft thread in a fabric and formed into a desired
planar structure by the action of the staple chains, which are generally 32 bases long. In order
to assemble a DNA origami structure, DNA double helices have to be bundled like a wooden raft,
similar to those in DX tiles. This can be achieved by introducing a crossover between the second
helix and the third helix at a position that is 1.5 helical turns away from a crossover between
the first helix and the second helix. This corresponds to 10.5 x 1.5 = 15.75 bases because the
periodicity of a B-type DNA double helix is 10.5 base pairs. However, this is usually rounded to
16 bases because the decimal fractions in the number of bases are somewhat difficult to handle.
That is, in a DNA origami structure, a staple chain bridges helices every 16 bases. Sixteen bases
in the 32 bases correspond to this “region complementary to the second helix”, while the staple
chain also has to bind to the first and third helices. Thus, 8 bases complementary to the first
Chapter 3 What Does Design of DNA Molecules Make Possible? 54

helix and 8 bases complementary to the third helix are added to the aforementioned 16 bases,
resulting in 32 bases total. As a result, in a DNA origami structure, the staple chains are bended
in an S shape and/or a Z shape to bind to the scaffold chain. This might seem unimportant, but
every helix has its handedness. Therefore, it is important to note that whether a staple chain is
arranged in an S shape or a Z shape can make a large difference. That is, when a staple chain
is arranged in an S shape, the termini of the staple chain will be the rear most objects in the
design. When it is arranged in a Z shape, they will be the fore most objects in the design. This
should be taken into consideration when the staple chain is to have an additional DNA chain
or the like hanging from it. In order to fold the 7,249-base scaffold chain by means of 32-base
staple chains, 7249/32 = 226.5 so 226.5 staple chains are required in the simplest calculation.
Because the M13mp18 genome sequence can be regarded basically as random, two hundreds
and several tens of DNA chains with mutually different sequences are required. Preparing their
mixture might feel like a hard job, but the beauty of the resulting structure will reward you for
all your trouble.

Assembly and Observation of DNA Origami Structures


Once you have obtained the appropriate design and corresponding staple chains, a DNA origami
structure can be assembled by a very simple procedure. By simply dissolving the scaffold chain
and an excess amount (generally about 5 equivalents) of the staple chains in 1 x TAE/Mg²+ buffer
(40 mM Tris, 20 mM acetic acid, 2 mM EDTA, and 12.5 mM Mg²+ included), which is commonly
used in DNA nanotechnology, and slowly cooling it from 90˚C to 25˚C (this is referred to as
annealing), each staple chain will find their target regions complementary in the scaffold chain,
folding the scaffold chain into the desired structure. You must always include a divalent cation
(generally Mg²+) in the solution. Each DNA chain carries a high net negative charge due to its
phosphodiester bonds, and thus cannot by itself form a dense structure of helices such as are
required in DNA Origami, due to static repulsion. Thanks to the divalent cation, this repulsion
can be neutralized and the helices bridged.
A divalent cation is also indispensable in observing a DNA origami structure. A two-dimensional,
thin and planar DNA origami structure is usually observed under an atomic force microscope
(AFM). When the DNA origami solution is added dropwise onto the highly planar structure of
freshly cleaved mica, the negative charge on the mica surface and the negative charge of the DNA
origami structure are bridged and neutralized by the divalent cation. As a result, the DNA origami
structure beautifully adheres to the mica surface. This adhesion is so strong that even tapping
by the AFM probe will not easily peel off the DNA origami structure. Thus, if your assembly has
gone as planned, you will observe the desired Origami structure with a thickness of 2 nm (i.e.,
that of a DNA double helix).
Chapter 3 What Does Design of DNA Molecules Make Possible? 55

Adapted by permission from Macmillan Publishers Ltd: Nature (doi: 10.1038/nature04586),


copyright 2005
Schematic Diagram of DNA Origami. The folded single black line is the scaffold chain, and the
colored lines are the staple chains. In this figure, the staple chains are arranged in a Z shape.
Reprinted with permission from Nature 2006.
Chapter 3 What Does Design of DNA Molecules Make Possible? 56

Design and AFM image of Smiley DNA Origami Structure. Reprinted with permission from
Nature, 2006.

Column: Flipbook Cartoon Made of DNA Origami

In BIOMOD 2013, Team Kansai from Kansai University reported a “DNA Origami flipbook
cartoon”, composed of a total of 10 AFM images, of “the Rabbit and the Turtle”, and won the
gold prize. Their idea of minimizing the number of the staples by connecting the ground, turtle,
and rabbit through linkers was admired. The team also assembled a background layer of a
Chapter 3 What Does Design of DNA Molecules Make Possible? 57

roadside tree passing by via DNA origami, and got to the final in the Intercollegiate Science
Competition sponsored by MEXT.

—Akinori Kuzuya (Kansai University)

19. Three-Dimensional DNA Origami


“Origami” models well known to Japanese researchers include “crane” and other three-dimensional
structures as well as two-dimensional structures such as “samurai helmet” and “yakko-san”
(samurai’s attendant). Several techniques for three-dimensional structure assembly have also
been devised using nanoscale DNA origami.
In principle, the three-dimensional DNA origami structures reported so far can be divided into
two classes.

Three-Dimensional Hollow DNA Origami


The earliest three-dimensional DNA origami structures were hollow polyhedrons, each assem-
bled from separate planar parts, which were reported by four groups almost at the same time in
2009 [1-4]. Two of them were box-shaped and designed to be intentionally “opened and closed”
for their future use as nanocontainers (the other two were a regular tetrahedron and prism,
respectively). In the structure made by Anderson et al., the faces are joined in a manner that
allows only one face of the cube (about 40 nm square) to open/close like a lid [1]. The lid is kept
closed by two different pairs of one DNA strand and another partially complementary strand.
Addition of completely complementary DNA strands can selectively open the lid. The author’s
group calls the DNA strand pairs that keep the lid closed “locks” and the DNA strands that open
the lid “keys”. Afterward, Anderson et al. also made a smaller box-shaped structure that operates
by the same mechanism [5].
The earliest “nanomechanical DNA origami device” independently made by the author’s group
is also a cubic, box-shaped DNA origami structure (about 40 nm square), but its operating
mechanism somewhat differs from that of the above example [2]. In the first step to make this
structure, two units, each composed of three mutually perpendicular faces, are joined through a
hinge to form one “open structure”. Then the open structure is turned into a box-shaped “closed
structure” by the addition of a trigger DNA strand. Recently, it has been successfully used as a
nanocarrier to selectively contain a single gold nanoparticle 10 nm in diameter [6]. Selective
uptake of a gold nanoparticle into a similar lidded, box-shaped DNA origami structure has also
been reported by Sugiyama et al. [7]. This system employs an artificial photoreactive base to
assemble a DNA origami structure, which allows a light-triggered structural change.
Chapter 3 What Does Design of DNA Molecules Make Possible? 58

Exemplary Hollow Three-Dimensional DNA Origami Structures. (a) Box-shaped DNA origami
structure [1]. Reprinted with permission from Nature 2009. (b) Regular tetrahedron DNA origami
structure [3]. Reprinted with permission from Nano Lett. 2009. (c) DNA origami structure from
our group [2].
Once the box-shaped structures as described above are opened, there is now way to keep them
in the open state. Therefore, their free motion in an aqueous solution provides a driving force
to open these structures. Firrao et al. reported an exemplary DNA origami structure that can
be opened/closed by a more active driving force. They created a tubular DNA origami structure
with a similar hatch that is driven by double strand DNA formation, and showed that it actually
worked.
Hollow three-dimensional DNA origami structures can include curved surfaces, by virtue of the
ability to bend DNA strands in a DNA origami [8]. For example, Yan et al. have successfully
assembled flask- and sphere-shaped DNA origami structures.
Chapter 3 What Does Design of DNA Molecules Make Possible? 59

A schematic based on [8].


Honeycomb Three-Dimensional DNA Origami
Another category of three-dimensional DNA origami structures is the honeycomb three-dimensional
DNA origami [9].

Reprinted with permission from Nature 2009.


These structures will be more familiar to BIOMOD participants who use caDNAno. In actual
situations, a hollow DNA origami structure in an aqueous solution will be subject to substantial
fluctuation, because DNA itself is very flexible. On the other hand, such fluctutations will not
Chapter 3 What Does Design of DNA Molecules Make Possible? 60

occur in a honeycombed three-dimensional DNA origami, which is a solid three-dimensional


structure formed by folding one origami sheet in a multilayer geometry. This technique can
even introduce bends and twists into any three-dimensional structure, by carefully choosing
the length of the DNA components and stacking differently sized layers [10]. Today, a plurality
of such curved structures can be combined to make previously impossible complex three-
dimensional nanostructures, such as a huge bird cage-like structure. Recently, a structure with
an ion channel-like function that can pierce through a liposome has been created. [11].
—Akinori Kuzuya (Kansai University)

20. DNA Tiles


Among DNA nanostructure assembly techniques, the “DNA tile” technique is used to make
patterned two-dimensional structures and three-dimensional crystals. This section describes
DNA tile basics, taking two-dimensional DNA motifs as an example. We also introduce the basic
idea of a so-called “algorithmic self-assembly” technique.
DNA tiles = motifs + sticky ends
DNA tile is a representative assembly technique to assemble a nanostructure by DNA folding. It
is like a jigsaw puzzle, which is assembled by joining small pieces one by one, at the molecular
scale.

Adapted with permission from [1].


Here, a structural unit composed of more than one DNA double helix (“motif”) corresponds
to a jigsaw puzzle piece. In the case of a jigsaw puzzle, pieces have different shaped tabs and
slots on their edges, which determine how the pieces are joined. In the case of DNA tiles, sticky
ends play this role. DNA tiles can be joined together only when their sticky ends have mutually
complementary sequences. In an actual jigsaw puzzle, you have to join pieces by hand. In DNA
tiles, structures form spontaneously in an aqueous solution through Brownian motion and
hybridization. This is called self-assembly. Specifically, slow cooling or “annealing” (see section
76) is employed to achieve a thermodynamically stable state (i.e. “a state that provides a long
Chapter 3 What Does Design of DNA Molecules Make Possible? 61

double helix”). During the cooling, each set of DNA molecules in the solution forms a tile, after
which the tiles self-assemble to form a two/three-dimensional structure (i.e., in the case of DNA,
puzzle pieces themselves are also spontaneously formed in the solution). Motifs that make up
DNA tiles and an “algorithmic self-assembly” technique to form DNA tile-based patterns are
outlined below.
DNA Motifs
A “repeat unit” structure made of a combination of branched DNA structures is called a DNA
motif. The most representative motif used for DNA tiles is the double crossover (DX) molecule
structure. In an aqueous solution that contain magnesium ions or the like to suppress the
electrostatic repulsion between DNA molecules, a four-arm branched (crossover) structure can
have a closed conformation as shown in the left or right in panel a. However, in this state,
the conformation cannot be unequivocally specified because the branch still can move freely
like a hinge. In panel b, two branches are provided between two double helices to fix their
conformation, which is called a double crossover.

Adapted with permission from [1].


In double crossover (DX) tiles, the dimensions and sticky ends are carefully chosen so that
double helices from adjacent tiles will be in phase, as shown in c [2]. These sticky ends can be
interlocked to form a planar structure of two dimensionally joined tiles. This structure looks like
a bundle of double helices that are arranged parallel to each other.
Similarly, triple crossover (TX) tiles have been reported, which are bundles of three double
helices, (a) [3], n-point star with rotational symmetry (b) [4], cross-tile, which is a four-arm
branch motif (c) [5], and single-stranded tile (SST), composed of a single strand which becomes
double helical only when it is incorporated into the final assembly (d) [6].
Chapter 3 What Does Design of DNA Molecules Make Possible? 62

Reprinted with permission from [1].


The authors have developed the T-motif [7], which is composed of T-branched structures instead
of crossover branches. This can produce structures on mica or other substrates, serving as a
platform to form a desired pattern of nanoparticles and proteins over a considerably large area.

Reprinted with permission from Angew. Chem. Int. Ed. ©2009. Reprinted with permission from
[7].
Chapter 3 What Does Design of DNA Molecules Make Possible? 63

Algorithmic Self-Assembly
When DNA tiles have only one kind of sticky end, you will get a uniform two-dimensional planar
structure made of identical tiles. What if more than one kind of sticky end is present: e.g., two
sticky end pairs 1 and 0? When the right side of a tile A can be joined to the left side of a tile B
through 1 and the right side of B to the left side of A through 0, a structure composed of alternate
repetitions of A and B will form (panel. a).

Reprinted with permission from [1].


One such structure was actually made and showed a striped pattern of alternating protrusion-
containing A tiles and protrusion-less B tiles (b) under atomic force microscopy (AFM).
Similarly, the use of more and more sets of sticky ends (and thus more and more different tiles)
can provide more complex patterns. For example, assume four different tiles that each carry a
distinct combination of sticky ends 1 and 0, as shown in panel a below.

Reprinted with permission from Natural Computing ©2008. Reprinted with permission from [8].
Chapter 3 What Does Design of DNA Molecules Make Possible? 64

When the DNA tiles self-assemble on the initial structure (b), which includes only one site 1, a
complex pattern made of triangles (c) will form [9]. This is called a Sierpinski triangle. Likewise,
various different patterns are possible by adopting different “rules” of tiles. This process is the
same as a one-dimensional cellular automaton. The lower sides of each tile correspond to an
input, and the upper sides to an output. The right and left lower sides are designed to define
the output from the right and left upper sides (the rules in this figure correspond to the XOR
operation). The self-assembly of such DNA tiles, which can perform computation based on sticky
ends and output various patterns, is called “algorithmic self-assembly”.
Further, it is possible to have every connection differ in stick- end sequence from all the rest,
so that the positional relationship between every two tiles (thus the final structure) is unequiv-
ocally defined. This can be implemented in single-stranded tile (SST) to make any two/three-
dimensional structure, which is called a “DNA brick” (see column 1).

Column 1: DNA Bricks


DNA bricks are two/three-dimensional structures made of a category of DNA tiles called single-
stranded tile (SST) [14]. In most cases, a “motif” that defines the shape and “sticky ends” that
define the connection are separately designed for a DNA tile. However, in SST, the double
helix formed by a sticky end pair also specifies the shape of the tile. Each single strand DNA
is composed of four domains (panel a), and each portion is connected to an adjacent SST,
resulting in a structure made of multiple double helices bridged by multiple instances of half
crossover branches (a structure with a hinge formed by a single DNA strand). This structure will
allow more compact and simple sequence design than currently popular DX tiles, because each
DNA strand itself serves as a tile, avoiding the need for a further internal structure to define the
shape.
Thanks to such a structural advantage of SST, DNA bricks allow you to design any structure
at a higher resolution (8 bp in the case of three-dimensional structures) than DNA origami.
For example, imagine diversifying tiles in a given structure as much as possible. If a sticky end
sequence in a given tile is designed to only fit a sticky end sequence in a particular different
tile, every tile in the structure will be different from all the rest: i.e., the position of a given tile
in the final structure is predetermined unequivocally. This strategy does not seem feasible for
DX tiles and other tiles that require more than one DNA strand, because you will have to take
sequence orthogonality into account for the internal structure of each tile. However, you can
use this strategy in SST, where there is no internal shape and only sticky end sequence matters.
In other words, in DNA origami, it is the sequence of scaffold DNA that determines positional
relationships. In DNA bricks, it is the sequence of domains inside each tile and their combina-
tion.
In order to assemble an actual structure, as shown in panel b, prepare a set of SST sequences
that constitute a square, cubic, and other structures, which serves as a “canvas”. Then select,
mix, and anneal a subset that will form the desired structure. In this manner, you can obtain
any shape [15, 16]. Similarly designed, mutually connectable unit structures have successfully
provided a crystal structure [17].
Chapter 3 What Does Design of DNA Molecules Make Possible? 65

Two-dimensional DNA bricks and reported examples. Reprinted with permission from Nature
©2012. [15]

Crystal structure made of DNA bricks. Reprinted with permission from Nature ©2014. [17].

Column 2: “Closed” DNA Nanostructure


Chapter 3 What Does Design of DNA Molecules Make Possible? 66

In addition to DNA tiles, origami, and bricks, which are two/three-dimensional structures made
of double helices, it is possible to obtain a DNA-based “closed structure”.
(a) A cubic structure [10] can be made by ligating a plurality of circular DNA molecules of
catenane topology. (b) An octahedron structure [11] can be obtained by folding a long single
strand DNA made of chemically synthesized multiple DNA molecules, in the presence of other
short DNA strands. (c) A tetrahedron structure [12] can be formed from four single DNA strands
each of which constitutes three of the tetrahedron edges and is complementary to one of
the other strands along each edge. (d) A spherical structure can also be made using the T-
branch motif as described above [13]. By carefully choosing the concentration for annealing,
tetrahedron, dodecahedron, and buckyball-like structures were obtained.

(a) to (d) drawn based on [14], [11], [12], and [13], respectively.

—Shogo Hamada (Cornell University)

21. DNA-Modified Particles (SNA)


Among three techniques involving DNA nanostructures, SNA (Spherical Nucleic Acids) forms
a structure using gold nanoparticles or the like as a core and attaching DNA to it for selective
binding. This section outlines its concept.

SNA (Spherical Nucleic Acids)


Unlike “folding” and “hydrogel” approaches, which make structures solely from DNA, SNA uses
colloidal particles as a structural core and attaches DNA molecules to them, which then interact
to produce a crystal structure. This concept was devised by Mirkin et al. [1] and Alivisatos et
al. [2] in 1996 (reported in the same issue of Nature). The generic name SNA was proposed
by Mirkin, based on the appearance of the DNA molecules placed over the spherical particle
surface. In the case of a DNA tile, its motif structure determines the stiffness. By analogy, in the
Chapter 3 What Does Design of DNA Molecules Make Possible? 67

case of an SNA, the spherical (or geometrically anisotropic) particle determines the stiffness.
However, in an SNA, DNA molecules with sticky ends are attached to the entire particle surface.
By virtue of interactions between this DNA population, the concept of “valence” or “bond” at
the atomic level can be directly applied to the reactions between the colloidal particles. As
a result of using the highly designable hybridization between DNA molecules in inter-particle
interactions, programmable colloidal particles with “atom-like behavior” can be designed [3],
which will allow for control of the crystal structure itself.

Modification of Nanoparticles and Structure Assembly


At present, because of its relatively easy preparation, gold nanoparticles are usually chosen as a
core. In a typical procedure, chemically synthesized DNA, modified with a thiol group, is reacted
with and attached to the gold nanoparticle surface in a buffer that has an ion concentration
adjusted to suppress the repulsion between DNA molecules. For example, the resulting particles
can be connected via these DNA linkers to provide a face-centered cubic (f.c.c.) or body-centered
cubic (b.c.c.) crystal structure (a) [5,6]. Recently, a “Wulff polyhedron” monocrystal structure
was also obtained on the micro-scale through annealing [7,8] (b). This was very impressive
in that a nano-scale interaction of DNA produced a macroscopic structure that is known as
equilibrium shape in the field of crystal growth.

—Shogo Hamada (Cornell University)

22. DNA Hydrogels


DNA hydrogel is the newest among the three approaches in structural DNA nanotechnology.
This section describes technique to make a network of combined DNA molecules, and resulting
Chapter 3 What Does Design of DNA Molecules Make Possible? 68

exemplary gel structures, methods to characterize individual DNA hydrogel products, and some
applications.

Introduction
Among DNA-based structure assembly techniques, DNA hydrogel seeks to make macro-scale
(i.e. micro-to-centimeter order) structures by forming a network in a solution. This section
outlines previously reported DNA hydrogels, and their manufacture, characterization, and ap-
plications.
Chemical and Physical Gels
In general, a gel is classified into one of two kinds depending on the crosslink mechanism of its
network.

One class is a chemical gel. A chemical gel is formed by covalent crosslinking. As a result, as
long as its component molecules are unbroken, the whole network will remain intact (will not
Chapter 3 What Does Design of DNA Molecules Make Possible? 69

melt). A well-known example of chemical gel solely made of DNA has a motif-based design as
described above [1]. Specifically, sticky ends of X, Y, T, or other branched structures are joined
by an enzymatic reaction using ligase. This allows a robust network of double helices to be
produced.

Another class is the physical gel. A physical gel is formed by a non-covalent crosslink mechanism
(e.g., hydrogen bond or physical entanglement). The earliest examples of physical gel prepared
from chemically synthesized DNA include one designed by Liu et al. in a motif-based manner
[2], which has been formed by connecting Y branches using a double strand linker, both
ends of which are sticky ends. In another example, a gel has been obtained using a non-DNA
macromolecule as the network backbone in combination with the specific connection property
of DNA hybridization [3]. Recently, 3D printing was applied to such a physical DNA hydrogel to
obtain desired two/three-dimensional shapes at the macro scale [4].
Another technique to produce a DNA physical gel depends on DNA polymerase enzyme [5] (DNA
polymerase extends a DNA strand complementary to a template, with a primer being the starting
point), which is summarized below. First, a circular template DNA is prepared, to which a primer
DNA (primer 1) complementary to a portion of the template sequence is hybridized, and then
Chapter 3 What Does Design of DNA Molecules Make Possible? 70

DNA polymerase starts the extension reaction of primer 1. This polymerase has strong strand
displacement activity, resulting in a long single strand DNA product that is composed of multiple
repeats of the sequence complementary to the template. This process is called RCA (Rolling
Circle Amplification). This first reaction is allowed to proceed for a sufficiently long time, and
then primer 2 (which is complementary to a portion of the product of primer 1), and primer
3 (which has a sequence identical to primer ), are added to the solution. Primer 2 starts an
extension reaction using the extension product of primer 1 as a template. Primer 3 hybridizes to
the initial circular template DNA or the product of primer 2 to start another extension reaction.
The resulting DNA molecules become highly entangled with each other forming a gel network.
Chapter 3 What Does Design of DNA Molecules Make Possible? 71

Characterization
Is the structure obtained as above truly a gel? In most cases, the answer to this question is
determined by dynamic viscoelasticity analysis using a rheometer. In dynamic viscoelasticity
analysis, a sinusoidal stress is applied to a given sample and the strain response is measured.
Based on their phase difference and amplitude, the storage modulus G’ (the elastic portion; the
degree of solid-like behavior of the sample) and the loss modulus G” (the viscous portion; the
degree of liquid-like behavior of the sample) are calculated. G’ > G” means that the sample is a
gel.

Applications
One of the most promising applications of DNA hydrogel is the incorporation of a gene-containing
DNA such as a plasmid into a gel network to promote cell-free protein synthesis inside the gel [6].
This system has proved to be capable of synthesizing protein about 300 times more efficiently
than a solution system. This might be due to the protection of the molecular structure of the
Chapter 3 What Does Design of DNA Molecules Make Possible? 72

gene by the DNA hydrogel, a higher concentration of the gene in the gel compared to a solu-
tion system, more efficient turnover of the enzymatic reactions (thus improved transcription
efficiency), etc.
—Shogo Hamada (Cornell University)

23. DNA Nanomachines and Nanorobots: Overview


DNA cannot only make a “shape”, but also a “motion”. How can we produce motion using DNA?

Molecular Machines
A molecular machine is defined as “a molecule that goes through some directional movement
in response to an input stimulus”. In general, it has a plurality of states and repeats the transition
between them.
An organism is made of innumerable, highly sophisticated molecular machines. A wide variety
of protein-based molecular machines are at work in every cell. For example, molecular motors
essential for bacterial flagellum rotation are driven by the proton concentration gradient across
the cell membrane. Muscle contraction is the result of the sliding of actin and myosin protein
filaments, which depends on the energy of ATP hydrolysis. As another example, F-type ATP
synthase, which is responsible for ATP synthesis in most organisms, itself rotates to carry out
the synthesis reaction. These natural biological molecular machines are highly efficient.
On the other hand, artificial molecular machines employ a catenane (multiple ring-shaped
molecules mutually interlocked like a chain), a rotaxane (a dumbbell-shaped molecule threaded
through a macrocycle, interlocked by the ends of the dumbbell), and the like that function as
joints, bearings, and other movable molecular elements. These machines include molecular
shuttles, molecular tweezers, and nanocars. A nanocar has four wheels made of fullerene, and
each axle can endlessly rotate on a covalent carbon-carbon bond. A nanocar has been reported
to be driven straight over a gold surface by thermal fluctuation [1].
Chapter 3 What Does Design of DNA Molecules Make Possible? 73

Nanocar. Reprinted from Nano Letter ©2005 ACS.


These artificial molecular machines are remarkably large for synthetic organic macromolecules,
but only capable of relatively primitive motions such as shuttling and rotary motions.

DNA Nanomachines
DNA allows more complex motions to be achieved, and even for these motions to be controlled.
Driving mechanisms for DNA nanomachines include the following.
- Driving Mechanisms for DNA Nanomachine:
- External
- Strand displacement (DNA input)*
- B-Z transition (salt concentration change)
- Light-responsive base (light input)
- Autonomous
- Enzymatic reaction (DNAzyme, restriction enzyme)*
- Strand displacement (with autonomous DNA input)*
*sequence-specific (only driven by a particular sequence).
In an externally driven system, a nanomachine moves only one step every time it receives a
certain input signal provided externally. In an autonomous system, movement happens au-
tonomously without such external signals (or it appropriately generates the required molecular
signal).
Chapter 3 What Does Design of DNA Molecules Make Possible? 74

The most common external driving mechanism is strand displacement (see section 42). In
this case, different DNA single strands are added to the solution by pipetting, one at a time.
Each DNA single strand added displaces a part of the DNA nanomachine structure, causing the
nanomachine to change its shape (this constitutes one motion). A representative example is
molecular tweezers (see section 24). One advantage of the strand displacement-based driving
method is that you can choose the site of displacement. Other variations include walker (see
section 25) and pliers (see section 26). At present, the most complex nanomachine driven by
strand displacement is the nanofactory developed by Seeman’s group (see section 26).
Chapter 3 What Does Design of DNA Molecules Make Possible? 75

Nanofactory. Reprinted with permission from Nature, 2010 [3]


Chapter 3 What Does Design of DNA Molecules Make Possible? 76

The B-Z transition driving method relies on the fact that a DNA molecule with the base sequence
CGCGCG… flips from the right-handed B form to the left-handed Z form and vice versa, depend-
ing on surrounding ion concentration. You can change the ion concentration of the solution by
repeatedly pipetting so that the machine repeats the flipping motion. Another nanomachine
depends on a light-responsive artificial base such as azobenzene (see section 48). Because the
azobenzene molecule changes its form upon light irradiation, UV irradiation can control the
association/dissociation of an azobenzene-containing double stranded DNA. The B-Z transition
and light-responsive base driving methods are not suitable for complex motions, because a
single input causes all actuators to move at the same time.
There are relatively few examples of autonomous molecular machines. One example is the
autonomous walker (see section 25). The walker achieves a walking motion by binding to an
array of track DNA molecules provided on a two-dimensional DNA origami (see section 18), and
then cleaving the bound track DNA by means of a DNAzyme sequence contained in each leg (see
section 30). The walker does not require any external input because it autonomously repeats the
cycle of taking one step and binding the next track DNA upon the cleavage of one track DNA.

Autonomous walker based on a DNAzyme sequence [4].

Slime Robot
Any of the molecular machines described above works as a single molecule in a solution, by
changing its shape and/or binding state stepwise. A more elaborate system, the Molecular
Robotics Project (see column 1) seeks to achieve a slime-like motion by enclosing a DNA
computing circuit and a molecular actuator in a reactor such as a liposome (see section 89).
The behavior of microtubules can be controlled by attaching a DNA tag to them, to which an
autonomous DNA reaction system (see section 27) will be applied in the future [2].
Chapter 3 What Does Design of DNA Molecules Make Possible? 77

Slime Robots [2]. Reprinted from Accounts on Chem Res. ©2014 ACS.
—Satoshi Murata (Tohoku University)

24. DNA Tweezers


A DNA nanomachine can perform a mechanical task by changing its structure in a sequence-
specific, reversible manner, in the absence of any change in the temperature and pH of the
environment. The strand displacement reaction, which was created to drive DNA tweezers, is
now commonly used to drive many DNA nanomachines and DNA logic gates.
DNA Nanomachines
How can we make and drive a machine of a nanometer size, which is far smaller than a
micrometer? A DNA strand can hybridize to a complementary strand in a sequence-specific
manner, to change from a random coil-like single strand state to a relatively rigid, rod-like
Chapter 3 What Does Design of DNA Molecules Make Possible? 78

double strand state (see section 40). This self-assembling property can produce two/three-
dimensional nanostructures (see section 17). If such a DNA nanostructure can change its
structure repetitively, it means that a nano-sized machine can be achieved.
For example, a change in the pH and ionic strength of the environment can cause structural
change in a molecule. However, it will not be so easy to obtain a set of different molecules that
will exhibit different structure changes in response to a given environmental change and com-
bine them to perform a complex mechanical task. It is also difficult to have one environmental
change control only one of the different molecular machines or drive all of them cooperatively.
Is it possible to reversibly change a structure assembled by means of the sequence-specific
hybridization of a DNA molecule?
Thermodynamically, the self-assembly of DNA into a nanostructure proceeds toward a more
favorable state in terms of free energy. By appropriately arranging complementary sequences
in the design, a variety of structures can form spontaneously upon just mixing DNA molecules.
Theoretically, after a structure has been obtained, another DNA strand capable of binding to a
DNA strand in the structure in an energetically more favorable state can be externally added to
cause a structural change. However, for the structural change to be reversible, the base-paired
strands have to dissociate again and form base pairs with another strand, which involves an
unfavorable state in terms of free energy and will take an unimaginably long time. Thus, we
used to think it was impossible to operate such a machine on a practical time scale. However,
the “DNA tweezers” built by Yurke et al. [1] accomplished this.
Chapter 3 What Does Design of DNA Molecules Make Possible? 79

Operating Mechanism of DNA Tweezers

Reprinted with permission from C. M. Niemeyer, M. Adler, Angew. Chem. Int. Ed. 2002, 41, 3779.
DNA tweezers are driven by the sequential addition of two DNA strands serving as a signal that
directs the structural change and works as a fuel to create a simple structure made of three DNA
strands. The tweezers-like structure, two rigid rods connected by a hinge, is assembled by the
hybridization of one DNA strand (Ta) and two other DNA strands (Tb and Tc) each of which carries
a sequence complementary to the 5’ or 3’ end of Ta (top left in the figure). The two rods each
have a single strand overhang. When a DNA strand (Fc) that is complementary to both overhangs
Chapter 3 What Does Design of DNA Molecules Make Possible? 80

is added, it hybridizes to the rods to turn the tweezers into a closed structure. The added DNA
can be regarded as a “signal” to direct the structural change in a sequence-specific manner and
a “fuel” to drive the structural change toward a favorable state in terms of free energy.
If the bound Fc strand is capable of being dissociated from the structure by means of another
signal/fuel DNA strand, it means that a reversible mechanical task can be achieved. In the
structure shown in the bottom of the figure, a single strand overhang in the closed structure,
“toe-hold”, is bound by a DNA strand (Fo) that includes a sequence complementary to it. Fo
also carries a sequence complementary to a sequence in the double strand formed between
Fc and the tweezers, causing base pair displacement. Base pair displacement is a process with
no substantial change in free energy, which proceeds in a random-walk manner. When all the
base pairs in Fc have been displaced, the double strand structure formed between Fc and Fo is
released as waste, and the DNA tweezers return to the original open structure. Such a “strand
displacement reaction” is now employed as a driving mechanism in many DNA nanomachines
and enzyme-free DNA logic gates.

Performance Test of DNA Tweezers


Compared to a typical DNA origami structure of several thousand base pairs, DNA tweezers,
composed of less than 100 base pairs, are so small that their structure cannot be observed
even by AFM. Whether they are actually working is usually examined by observing changes
in the fluorescence that reflects the motion of a large population of DNA tweezers in a bulk
solution. To do this, the termini of the DNA strand Ta are modified with a fluorescent group
and a quencher group, respectively (indicated by small balls in the figure). When the Fc strand
is added and the DNA tweezers are closed, the fluorescent group and the quencher group come
into proximity so that the fluorescence decreases, demonstrating the formation of the closed
structure. If an increase in the fluorescence is observed when the Fo strand is added, it means
that the DNA tweezers have opened again through the reverse motion. The force caused by the
open/closed motion of DNA tweezers has been thermodynamically calculated at about 15 pN,
which is comparable to that caused by the natural protein molecular motors kinesin and myosin.

Recent Progress
DNA tweezers need the addition of a DNA solution to perform each open/close motion. When
the reaction solution is a closed system, this step also results in dilution of DNA tweezers; i.e.,
repeated additions of the DNA solution causes a gradual decrease in operating efficiency. The
accumulation of wastes may also induce unexpected interactions. Asanuma et al. have reported
an azobenzene-based optical control of the hybridization that activates the open/close motion
of DNA tweezers by UV light, without any addition of DNA [2].
Yan et al. made a larger, tweezers-like structure from DNA and attached an enzyme and its
cofactor to its tips, to build a nanoreactor in which an enzymatic reaction was controlled by
Chapter 3 What Does Design of DNA Molecules Make Possible? 81

the open/close motion of the DNA tweezers [3]. When the DNA tweezers are closed and the de-
hydrogenase and NAD+ come into proximity, the enzymatic reaction is promoted by more than
5-fold compared to that in the open state, demonstrating the control of an enzymatic reaction
by DNA tweezers. Expected future applications of this include more complex mechanical tasks
and elaborate reaction control systems based on DNA tweezers.
—Ken Komiya (TITECH)

25. DNA Walker


Different types of walking mechanisms have been devised to drive DNA walkers, a simple DNA
structure that moves by repeated association/dissociation from a track made of DNA. A DNA-
responsive walker, which requires the addition of DNA for each step, and autonomous DNA
walkers, driven by enzymatic reaction and hairpin DNA hybridizations, are described below.

Walking Molecule
To operate the DNA tweezers described in the previous section, a DNA nanostructure repeats
an open/close motion. In the “DNA walker”, a nanostructure moves from one place to another.
What does “walking a molecule” mean? Under the earth’s gravity, our body stays in one place on
the ground unless something happens. However, in the world of nanometer-sized molecules, a
molecule in solution will be blown from place to place by Brownian motion, unless something
happens. A molecule thus needs to bind to a “scaffold” to walk. Organisms have networks of
scaffold for motor proteins to walk around, which is important for the transport of molecules
and the like. On the other hand, the scaffold for DNA walker is also made of DNA. A DNA walker
moves on such a DNA scaffold structure by repeating the cycle of binding to one site on the
scaffold, then binding to the next site, and detaching from the previous site.

DNA-Responsive DNA Walker


Pierce et al. constructed a DNA walker that was driven by strand displacement reaction, which
had been originally devised for DNA tweezers (Fig. 1) [1]. As shown at the top of each panel in
the figure, the DNA walker is made of two partially hybridized DNA strands, wherein the single
strand overhang serves as the “legs” of the walker. At the bottom, another structure made of
multiple DNA strands hybridized to each other also shows single-stranded protrusions. These
single-strand regions are called “stators”. Two DNA strands are added sequentially to this system
and each serves as both signal and fuel for the walking motion. A1 is added first, hybridizing to
both the walker and one of the stators to place one leg of the walker on the stator. Then A2
is added, causing another leg to be placed on a neighboring stator. At this point, a region of
the added DNA remains single-stranded, serving as the toe-hold for the strand displacement
reaction (see section 42). Then, D1 is added, which is fully complementary to A1, to induce
Chapter 3 What Does Design of DNA Molecules Make Possible? 82

a strand displacement reaction that releases A1 from the leg of the walker and the stator,
resulting in the walker again standing on one leg. This walking motion has been demonstrated
by modifying the toe and the stator with different fluorescent groups and observing it by
fluorescence spectrometry (see section 81).
Chapter 3 What Does Design of DNA Molecules Make Possible? 83
Chapter 3 What Does Design of DNA Molecules Make Possible? 84

Fig. 1. DNA strands A1, A2, and D1 are highlighted in yellow, pink, and grey, respectively.
Reprinted with permission from Reference [1]. Copyright ©2004, American Chemical Society.

Enzymatic Reaction-Based DNA Walker


A DNA walker is driven by the association/dissociation cycle of complementary sequences.
Directional walkers have been attempted by combining walkers with enzymatic reactions, such
as restriction enzyme/ligase or DNAzyme. The example below employs an irreversible DNA
cleavage reaction by nicking enzyme (Fig. 2) [2].

Fig. 2. Reprinted with permission from J. Bath, A. Turberfield, Nature Nanotech. 2007, 2, 275.
Dark green: DNA walker; light green: stator DNA; Black arrowhead: cleavage site.
When this single-strand DNA walker hybridizes to a stator, it produces a nicking enzyme recogni-
tion site in a double strand state where the stator DNA can be cleaved by the nicking enzyme. The
Chapter 3 What Does Design of DNA Molecules Make Possible? 85

short DNA fragment derived from the cleavage has a low melting temperature, spontaneously
dissociating from the walker. Because all the stators have an identical sequence, the newly
exposed single strand region in the walker serves as a toe-hold to induce a strand displacement
reaction, moving the walker to a neighboring stator. Then the cleavage reaction occurs again,
causing the walker to proceed unidirectionally on the straight track of stators. This process has
been confirmed by fluorescence spectrometry, after which the walking motion was observed
by video recording under high-speed AFM for a single molecule [3].

Hairpin DNA-Based Walker


When mutually complementary sequences are present in one DNA molecule, they can hybridize
to each other and form a hairpin structure. Compared with inter-molecular hybridization to
a fully complementary DNA sequence, this hairpin structure is less stable in terms of free
energy, but can have its hybridization suppressed in a kinetic manner, allowing the researcher to
program a reaction cascade for a stepwise hybridization in the presence of a carefully designed
toe-hold sequence. Yin et al. used this technique to construct interesting reaction systems such
as catalytic branched structure formation and autocatalytic double strand DNA formation [4]. An
autonomous DNA walker system that is solely composed of DNA has also been proposed (Fig.
3). It is a structure made of two partially hybridized DNA strands, with a double strand region
flanked by single strand “leg” regions. The scaffold has stators that are each designed to form a
hairpin structure, so that the “fuel” hairpin DNA cannot bind to them. However, when the walker
binds to a stator, the hairpin structure of the stator is opened up, and the “fuel” hairpin DNA can
now bind to it, causing dissociation of the leg. As a result, the walking motion of the walker
occurs autonomously.
Chapter 3 What Does Design of DNA Molecules Make Possible? 86

Fig. 3. Reprinted with permission from Reference [4].


—Ken Komiya (TITECH)
Chapter 3 What Does Design of DNA Molecules Make Possible? 87

26. DNA Nanomachines


Because DNA tweezers and walkera are composed of only a small number of short DNA molecules,
they are not suitable for complex tasks. However, they can be incorporated into a DNA origami
structure to perform a stepwise reaction. In addition, molecular devices with a moving DNA
origami structure have also been developed.
DNA Molecular Machine Working on DNA Origami
The first example of a DNA molecular machine that worked on a DNA origami molecule came
from Seeman’s group. Seeman is a pioneer in DNA molecular machines. In 2009, they incorpo-
rated their rotary machine into a DNA origami, and successfully observed its structure change
by AFM [1]. They advanced this system to build a factory of nanometer size on a DNA origami
[2] structure. Specifically, they let a triangular DNA walker move on single strand DNA scaffolds
provided on a DNA origami. This walker possessed three rotatable arms on its sides and was
able to catch gold nanoparticles as it moved forward. This is a very complex system, but they
have shown by AFM and EM that the walker is actually loaded with desired particles.

Reprinted with permission from Nature, 2010.


Chapter 3 What Does Design of DNA Molecules Make Possible? 88

The “DNA spider” by Lund et al. is another example of a DNA walker moving on a DNA origami
[3]. The DNA spider is a streptavidin protein modified with three uniquely designed DNA legs
that have an RNA cleaving activity; it moves autonomously on multiple RNA scaffolds provided
on a DNA origami by degrading them one after another.
Turberfield et al. prepared a track of single strand DNA array on a DNA origami, on which
their enzymatic reaction-based DNA walker moved [4]. Using high-speed AFM (1 frame/sec),
they succeeded in recording the world’s first video of the motion of a DNA walker. They also
introduced a fork in the track, at which the moving direction of the DNA walker was controlled
by an input DNA strand [5].

Nanomechanical DNA Origami Device


We have developed the world’s first nanomechanical DNA origami device capable of a structural
change directly linked to a practically useful function, i.e. a pliers-like movable DNA origami
structure with two levers, each about 170 nm long, joined by a single joint (“DNA pliers”) [6-
9]. The levers, each formed from a bundle of six DNA double helices, can rotate freely about a
Holliday junction that serves as the joint, and will usually exist as an X-shaped structure with
an interior angle of 60˚ in an aqueous solution. When the DNA pliers are adsorbed onto mica
and observed by AFM, a large number of X-shaped structures are observed. On the other hand,
if the levers are chemically modified with something that interacts with the target molecule
to form a second joint between the levers, the X-shaped structure will be able to turn into a
=-shaped closed structure. For example, if you modify each lever with a biotin molecule, the
pliers can be closed upon picking just one streptavidin protein. This structure can be subjected
to single molecule imaging by AFM, demonstrating detection of a target protein at the single
molecule level. In other words, this system acts as a single molecule detection system based on
a movable DNA origami device that can simultaneously detect a wide range of targets, including
the hydrogen ion with the smallest molecular weight of 1 Da, small molecules of several 100 Da,
and even proteins of several 10 kDa.
Chapter 3 What Does Design of DNA Molecules Make Possible? 89

As another example of detecting a target based on the structural change of a DNA origami device
induced by interaction, Mao et al. reported a protein detection system based on optical tweezers
[10].

DNA Origami Robot


Douglas et al. reported a “DNA nanorobot” that recognizes and binds to a target cell in a
logical operation [11]. This DNA origami device, which has a barrel-like shape and a hexagonal
cross section of 35nm in diameter in its closed state, can open along the length (45 nm) like a
crocodile’s jaws. The crocodile’s jaws are kept closed by an aptamer that recognizes a secreted
molecule from a particular cell at the tip, which judges whether a given cell is the desired target
or not. If it recognizes the target, the aptamer-containing double helix dissociates to open the
jaws and expose an antibody fragment anchored in the interior of the barrel, resulting in the
DNA nanorobot binding to the target cell. Amir et al. even demonstrated the performance of a
similar DNA nanorobot by injecting it into a living cockroach [12].
—Akinori Kuzuya (Kansai University)
Chapter 3 What Does Design of DNA Molecules Make Possible? 90

27. Autonomous DNA Reaction Systems


The construction of autonomous DNA computers, autonomous DNA walkers, or any other
autonomous molecular robot, requires an autonomous DNA reaction system [1]. This section
summarizes autonomous DNA reaction systems. Similar systems have been applied to many
other systems, and readers are referred to the corresponding sections in this book.

Autonomous Reactions
First, we define the meaning of “autonomous” in this field, because it is a somewhat ambiguous
word. In general, in chemical synthesis, a non-autonomous reaction means a stepwise reaction
that requires external intervention at each step, e.g., “after one step, add reagents for the next
step to occur, after that, add reagents again…” In contrast, an autonomous reaction is a stepwise
reaction that, once reagents have been added to start the reaction, proceeds in accordance
with information programmed in a DNA base sequence or the like to achieve the desired result
(calculation result, synthesized molecule, structural change, motion, etc.). Below are exemplary
autonomous reaction systems.

Hybridization Chain Reaction (HCR)


Pierce et al. developed a DNA extension system called HCR (Hybridization Chain Reaction) (Fig.
1) [2]. It is composed of two hairpin DNA molecules (H1 and H2) and an initiator ssDNA (I)
that triggers the reaction, which depends solely on hybridization and proceeds autonomously
without enzymes. As shown in Fig. 1A, the presence of H1 and H2 is not sufficient for the reaction
to occur. When I is added, the sequence a* in I hybridizes the toe-hold sequence a in H1, causing
the hairpin structure of H1 to be opened by branch migration (Fig. 1B). As a result, the sequence
c in H1 hybridizes to the toe-hold sequence c* in H2, causing the hairpin structure of H2 to be
opened by branch migration. This exposes the sequence b*a* in H2, which is identical to one in
I (Fig. 1C), resulting in a chain reaction that generates a long DNA made of multiple repeats of a
double strand sequence. HCR is applicable to the detecting trace signal molecules, autonomous
DNA computing, and the like.
Chapter 3 What Does Design of DNA Molecules Make Possible? 91

Fig. 1. Mechanism of HCR.

Reverse-transcription and Transcription-based Autonomous


Computing System (RTRACS)
Suyama’s, Takinoue’s and Kiga’s groups, inspired by the process of viral infection/proliferation
in a host cell, developed a system called autonomous DNA computer RTRACS that was based on
reverse transcription (RT) and transcription reactions [3,4,5]. Fig. 2 shows the configuration of
an RTRACS-based AND gate [3]. In RTRACS, input/output molecules (data molecules) are made
of RNA and system components are made of DNA. The input RNA r21 hybridizes to the primer
DNA 1 included in the reaction system, initiating the DNA extension by RT. The RNA degrading
enzyme (RNaseH; RH) degrades the RNA, generating the single strand DNA 12. DNA 12 hybridizes
to another input RNA, r32, causing another RT extension. Then the RNA is degraded by RH,
providing the single strand DNA 123. When this strand hybridizes to the converter DNA included
in the system, DNA polymerase (DP) starts a DNA extension, which turns the T7 promoter
sequence included in the system into a double strand. T7 RNA polymerase (RP) binds to this
double strand promoter sequence to transcribe the downstream output RNA r45. Such a series
of reactions proceeds autonomously, allowing an operation through which two RNA inputs are
converted to one RNA output. Because it employs RNA as the interface, the system allows more
complex operations by combining multiple instances of the gate. For example, an oscillating
reaction based on a negative feedback loop may be possible [4].
Chapter 3 What Does Design of DNA Molecules Make Possible? 92
Chapter 3 What Does Design of DNA Molecules Make Possible? 93

Fig. 2. RTRACS-based AND gate.

Transcriptional Bistable Circuit and Oscillator


Winfree’s group has developed a bistable switch (bistable circuit) and an oscillating reaction
system (oscillator) based on transcription reactions [6,7]. It employs T7 RNA polymerase simi-
larly to RTRACS, but does not involve reverse transcriptase. See the original papers for reaction
details.

Other Autonomous DNA Reaction Systems


The autonomous reactions described above are just examples to show you how an autonomous
reaction can work. Below is a list of various autonomous DNA reaction systems discussed
elsewhere in this book.
- DNA walker (see section 25) - DNA logic gates (see section 29) - DNAzyme logic gate (see section
30) - Entropy-driven gate (see section 31) - Seesaw gate (see section 32) - DNA toolbox (see
section 33) - Whiplash PCR (DNA state transition machine) (see section 34) - DNA automata (DNA
state transition machine) (see section 34)
—Masahiro Takinoue (TITECH)

28. DNA Computing: Overview


Since Adleman succeeded in a computation using DNA molecules, many attempts have been
made to implement computation models that use DNA molecules. This section outlines DNA
reactions such as strand displacement reactions and enzymatic reactions, computation models
inspired by the reactions, and general computation models included in them.

DNA-Based Computation
Various kinds of molecule-based computation have been proposed and attempted for many
years. One is the reversible molecular Turing machine proposed by Bennett in 1982. Other well-
known examples include those by Conrad, Vaintsvaig, and Liberman, etc. Feynman, who won
the Nobel Prize in Physics, also pointed out the possibility of molecular machines in his speech
titled “There’s Plenty of Room at the Bottom” in 1959.
Adleman was the first to demonstrate the ability of molecules to perform an actual computa-
tion. In 1994, Adleman solved a directed Hamiltonian path problem by ingenious use of DNA
reactions, which launched the field of DNA computing. DNA computing in a narrow sense (i.e.
classical DNA computing) refers to the search and optimization techniques that are based on
Adleman’s solution method. However, the landscape of recent DNA computing research reveals
Chapter 3 What Does Design of DNA Molecules Make Possible? 94

that the implementation of various other computation models has also been attempted using
molecular reactions, especially of DNA. In general, DNA computing is a field that seeks to use
DNA reactions to implement computation models.
The diagram below includes a list of popular computation models that have sought to im-
plement DNA computing. In the course of attempts to implement these popular computation
models using DNA reactions, some have proposed general computation models inspired by DNA
reactions.
Adleman’s, and other similar solution methods, are each composed of multiple steps, and thus
involve many computation models. In particular, the step of generating paths in a digraph can
be regarded as a process of generating strings that represent the paths in accordance with
grammatical rules.

DNA Reactions
In order to implement computation models as described above, a set of information (data and
computation states) has to be represented by DNA molecules, and converted by DNA reactions.
The DNA molecules represent information primarily by their base sequence and/or secondary
structure, formed through hydrogen bonding. Then the represented information is converted
by a reaction that edits the base sequence and/or a reaction that alters the secondary structure
as follows.
Association (hybridization): formation of a double strand by mutually complementary single
strands.
Dissociation: reverse reaction of association; a double strand resolves into two single strands.
Restriction enzyme cleavage: cleaves a DNA double strand by a restriction enzyme at its target
site (i.e. recognition sequence). Some restriction enzymes cleave only one strand in a double
strand, producing a nick.
Ligation (ligase-mediated linkage): forms a covalent bond at a break or nick in a double strand.
Polymerase extension: extends the 3’ end of one strand in a double strand, using the other strand
as a template.
Strand displacement (branch migration): replaces one strand in a double strand.
Chapter 3 What Does Design of DNA Molecules Make Possible? 95
Chapter 3 What Does Design of DNA Molecules Make Possible? 96

—Masami Hagiya (Tokyo University)

29. DNA Logic Gates


There have been many attempts to implement a logic circuit using DNA molecules. This section
summarizes basic concepts of logic circuits, general principles in their implementation by
molecules, and examples of an enzyme-free logic gate (Seelig et al.) and a reversible logic circuit
(Genot et al.).

Logic Circuit
Logic circuit is one of the most basic computation models. Why? A bit is the most fundamental
data unit. Thus a function that receives a bit sequence as input and generates a bit as output
(Boolean function) provides the most basic operation. Such a function is implemented by logic
circuit. This type of logic circuit is called a “combinatorial circuit”. A sequential circuit is another
type of logic circuit, one that has a memory section. This section will focus on combinatorial
circuits
In mathematics, a logic circuit is a digraph without cycles. In general, the number of directed
edges entering a node in a digraph is called the indegree of the node, while the number leaving
it is called the outdegree. In a logic circuit, a node is classified as an input node or logic gate. The
indegree of an input node is 0. Basic logic gates include AND, OR, and NOT. AND and OR have an
indegree of 2, while NOT has an indegree of 1. One of the logic gates is defined as an output
node. The length of the longest path from the input node to the output node is called the depth
of the logic circuit.
From the viewpoint of computational complexity, the logic circuit as a computation model has
been extensively studied. For example, the “NC” class of computational complexity has been
defined. When a Boolean function family with an input of n bits belongs to NC, it means that
there is a logic circuit family to implement the function, with the number of gates being a
polynomial of n and the depth of the circuit being a polynomial of log n. Note that a Turing
machine with limited computational resources exists, and receives n as input and generates
(the representation of) a logic circuit for an input of n bits as output.

Implementation by Molecules
Implementing logic circuits by molecular reactions is the most basic subject in the field of
molecular computing. In order to design a molecular reaction-based logic circuit, you must
define how to represent the bits 0 and 1, first of all. Generally, a bit can be represented by
voltage, light intensity/wavelength, magnetic field direction, temperature, and many other
physical phenomena. You have to assign a specific molecule species to each gate in order to
Chapter 3 What Does Design of DNA Molecules Make Possible? 97

make a plurality of molecular reaction-based gates operate in a chain reaction, with an output
of one gate being an input to another gate.
Accordingly, in the most basic case, different molecular species are assigned to different gates.
For example, molecular species G is assigned to gate g. When the output of gate g is 0, it is
represented by the absence of molecular species G (concentration = 0), and when the output
of gate g is 1, it is represented by the presence of molecular species G (concentration ≥ a
predetermined threshold). Theoretically, an output of 1 can be represented by as little as one
molecule. Practically, a threshold concentration is defined so that an output is determined to
be 1 when the concentration of G is equal to or more than the threshold. It’s best to have the
concentration as close to 0 as possible when the output is 0. This representation method is
called single-rail representation in comparison with dual-rail representation, which is as follows.
In so-called dual-rail representation, two molecular species G and H are assigned to each
gate g. An output of g of 1 is represented by the presence of molecule G (concentration ≥
a predetermined threshold), whereas an output of g of 0 is represented by the presence of
molecule H (concentration ≥ a predetermined threshold).
Enzyme-Free Logic Circuits of Seelig
Below is a schematic of an enzyme-free logic gate by Seelig et al.

In this figure, an AND gate is implemented. Gin and Fin are inputs. First, Gin binds to the toe-
hold in the single strand G in the gate, which results in a double strand formation between Gin
and G by strand displacement reaction. Then the newly exposed toe-hold is bound by Fin, which
results in a double strand formation between F and Fin by strand displacement reaction. As a
Chapter 3 What Does Design of DNA Molecules Make Possible? 98

result, when the inputs Gin and Fin are both present, the output Eout is released as a single
strand.
The single stranded (or partially single-stranded) DNA molecules each represent a bit, and the
gate operates by generating new single strands depending on the combination of the inputs. The
strand displacement reaction to form a stable double strand is irreversible, i.e. proceeds only
unidirectionally. Therefore, this system cannot drive a reaction by the absence of a molecular
species, making it hard to implement a NOT gate in single-rail representation. When NOT is
present, dual-rail representation is required.
Reversible Logic Circuit of Genot et al.
Genot et al. has implemented a reversible logic gate as shown in the following figure, which is
also an AND gate.

Each of the two inputs (Fin and Gin) can alter the structure of the AND gate, but the single strand
region Eout as output is exposed only when both inputs are present. These reactions, though
with different reaction rates, each can proceed in either direction. Thus, when one of the inputs
is removed, the reaction proceeds in the reverse direction and the output is lost.
Therefore, a NOT gate is also possible in single-rail representation. A single strand G, which
represents the NOT gate, is put in the system in advance. An input to the NOT gate is a single
strand complementary to G. The input can form a double strand with G to remove it from the
system, meaning that G has been denied. Once G is removed, the gate reaction in the later step
proceeds reversibly, so that the logic circuit operates correctly.
—Masami Hagiya (Tokyo University)
Chapter 3 What Does Design of DNA Molecules Make Possible? 99

30. DNAzyme Logic Gates


There is a particular type of DNA sequence that has enzyme-like functions, called a “DNAzyme.”
This chapter introduces DNAzymes that can degrade a ribonucleotide backbone in a metal ion-
dependent manner. An example of DNAzyme-based logic gate is also introduced.
Metal Ion-Dependent DNAzyme
This section describes a DNAzyme with metal ion-dependent endonuclease activity. Fig. 1 is a
schematic of an Mg²+-dependent DNAzyme [1]. The red arrow represents the DNAzyme and the
blue arrow represents its substrate molecule. The blue molecule is a DNA/RNA chimera, which is
cleaved by the DNAzyme at the ribose in the backbone. In order to observe the reaction, its ends
are modified with fluorescent and quencher molecules. In the initial state, the fluorescence is
suppressed by FRET (see section 51).
Upon Mg²+ addition, the backbone of the blue substrate molecule is cleaved. As a result, the
distance between the fluorescent and quencher molecules increases and the FRET is lost,
emitting fluorescence. The curved portion in the red arrow is the essential catalytic domain. On
the other hand, virtually any complementary sequence pairs can be used for the hybridization
regions between the red DNA and the blue substrate, without affecting the activity.
In addition to the Mg²+-dependent sequence, known examples of DNAzyme with degradation
activity include Zn²+- and Pb²+-dependent sequences [2]. Apart from degradation activity, a
sequence with a ligase activity that connects backbone fragments and the like have also been
reported [3].

Schematic of DNAzyme

DNAzyme-Based Systems
A DNAzyme is not only applicable to a metal ion detection system, but also to more complex
systems that are driven by DNA inputs. Typical examples include walkers and logic gates. In a
Chapter 3 What Does Design of DNA Molecules Make Possible? 100

walker system, a DNAzyme repeats the degradation reaction, causing the site of degradation to
move around [4] (see section 25).

DNAzyme-based AND gate

A DNA logic gate has been implemented so that a substrate is cleaved only when a desired
hybridization has occurred [1]. Fig. 2 shows an AND gate. In the initial state, the hairpin struc-
tures in the top and bottom portions of the red DNA prevent its hybridization to the substrate.
Either input can open one of the hairpin structures, which is still not sufficient for the substrate
to be cleaved. Only in the presence of both inputs can the substrate be cleaved to emit the
fluorescence as output.
On the contrary, a system that is initially active but is inactivated by an input can be constructed
by placing a similar hairpin in the active site. In addition to AND, (non-implication) and XNOR
have also been reported [1].
Chapter 3 What Does Design of DNA Molecules Make Possible? 101

A cascade of such DNAzyme logic gates has also been attempted [5]. The system is schematically
shown in Fig. 3. Each cleaved product activates or inactivates the next cleavage reaction,
allowing a particular combination of inputs to generate a fluorescence as output.

Multilayered DNAzyme logic gate cascade

Fig. 3. Multilayered DNAzyme logic gate cascade [5]. Reprinted with permission from Nature
Publishing Group, 2010.

Tic-tac-toe and Display


Some examples of large-scale systems that combine DNAzyme logic gates have been reported.
One is a tic-tac-toe system [6] (Fig. 4, left). Instead of a grid, it contains wells that represent the
grid, and a human pipets DNA into the system and is represented by a particular square (input).
The system player in this game emits fluorescence from a well that represents another particular
square (output). The game proceeds by repeating the pipetting and fluorescence readout. The
game is programmed so that the system beats the human.
Another system is a display [7] (Fig. 4, right). In response to a combination of inputs, the display
outputs the result of decimal to binary conversion, four arithmetic operations, or the like as a
digit from 1 to 9 on an array of wells.
Chapter 3 What Does Design of DNA Molecules Make Possible? 102

DNAzyme-based tic-tac-toe

Reprinted with permission from American Chemical Society, 2006 and Angewandte Chemie
International Edition, 2014.
—Ibuki Kawamata (Tohoku University)

31. Entropy-Driven Gate


An entropy-driven gate is an example of a system based on DNA strand displacement reactions.
An input DNA is used as catalyst to activate an amplification reaction that generates an output
DNA. Comparison of free energy before and after the reaction reveals that this system is driven
by the entropy term rather than the enthalpy term.

Strand Displacement-Based Gate


In order to output a high concentration of DNA from a low concentration of DNA as an input,
some amplification mechanism is required. This section introduces an enzyme-free amplifica-
tion gate that is solely driven by a strand displacement reaction (see section 42) [1].
Fig. 1 summarizes the reaction mechanism. Each arrow represents a single strand DNA, and
each single letter represents a base sequence. Each pair of capital and small letters represents
mutually complementary sequences. The reaction starts at the top left and proceeds counter-
clockwise. Elementary reactions are hybridization (double helix formation), branch migration
by replacing hydrogen bonds, and denaturation (dissociation).
The gate is composed of a substrate made of three DNA strands (top left) and a fuel molecule
(bottom right). When there is no input of the catalyst (top), the reaction does not occur and the
output molecule (right) remains bound to the substrate.
When the catalyst as input is present, it hybridizes to the domain E, initiating the reaction.
Branch migration and denaturation follow to release the signal molecule (left), resulting in
Chapter 3 What Does Design of DNA Molecules Make Possible? 103

the state represented by Intermediate 1 (bottom). The domain C is now in a single strand
state, to which the fuel molecule hybridizes, inducing branch migration to form the structure
intermediate 2 (middle right), generating the output molecule. Then the denaturation reaction
releases the catalyst and a waste product (shown in purple and blue), and the latter cannot
participate in the reaction any more (top right).
The activity of the catalyst is not affected by the reaction. Once released after the reaction, it can
drive another cycle and function endlessly as long as the substrate is present.

Reaction mechanism of an entropy-driven gate

Experimental Evidence of Amplification Reaction


The output molecule is read by FRET (see section 51) as shown in Fig. 2. The x and y axes repre-
sent time and normalized fluorescence intensity, respectively. A higher fluorescence intensity
indicates a higher concentration of the output, wherein an intensity of 1 means the release of
all output molecules. The figure shows a simulation result based on the numerical analysis (see
sections 60 and 67) of a differential equation derived from a chemical reaction model [2]. The
amount of input DNA molecules relative to that of substrate molecules (in chemical equivalent)
is indicated on the right.
As seen from the graph, the concentration of the output is higher than that of the input, proving
the amplification. For example, the intensity of the line marked with 0.1x has exceeded 0.5 after
Chapter 3 What Does Design of DNA Molecules Make Possible? 104

2 hours. This means that an output of more than 50% was obtained from an input of 10%; i.e.
one input drove 5 cycles or more on average.
Some researchers have reported a two-level gate system such as an autocatalytic reaction
with the output and input having an identical sequence. Experimental and model analysis
results of the two-level system suggests that 12 hours of the reaction can even determine the
presence/absence of the input molecule at a concentration of 0.01% (in chemical equivalent).
However, the autocatalytic reaction generates some amount of the output also in the absence
of the input (leak reaction) as another consequence of the exponential amplification.

Results of a simulation of an entropy-driven gate

Entropy
What kind of energy is driving the catalytic amplification reaction system in Fig. 1? Let’s examine
the catalytic reaction step as shown in Fig. 3. Before the reaction, two molecules (i.e. substrate
and fuel) are present. The addition of the catalyst drives the reaction, which results in three
molecules (i.e. waste, signal, and output).
Chapter 3 What Does Design of DNA Molecules Make Possible? 105

When looking at the hybridized domains, all the domains B, C, and D are bound before and after
the reaction. In other words, the number of hydrogen bonds stays the same before and after the
reaction. Thus, the number of hydrogen bonds, which affects the enthalpy term (see section 37)
of free energy (see section 38), does not matter much in this reaction.
What does not stay the same after the reaction is the number of molecules (from 2 molecules
to 3 molecules, as described above); i.e., the degree of freedom of molecules becomes higher,
increasing the configuration entropy (see section 37). An experiment with a shorter fuel is
described in the paper, to show that it is really entropy-driven.
Because it is an entropy-driven system, the reaction is relatively resistant to the influence
of temperature and salt concentration on DNA hybridization. Thus, various applications are
conceivable besides amplification.

The catalytic reaction step

—Ibuki Kawamata (Tohoku University)

32. Seesaw Gate


A DNA reaction system called seesaw gate is based on the ingenious use of strand displacement
reaction of DNA (see section 42). Most importantly, multiple seesaw gates can be appropriately
combined in a circuit design to achieve a logic circuit, relay, and analog circuit.

Seesaw Gate as Basic Element (Basic Mechanism)


Assume a logic element as shown in the following figure. Each input signal to the element
is compared to a predetermined threshold, and signals below the threshold are discarded as
noise. In contrast, each input above the threshold generates an output signal. Here, the output
generation requires a fuel. As described below, seesaw gate is the implementation of this logic
element by DNA reactions.
Chapter 3 What Does Design of DNA Molecules Make Possible? 106

AND Gate by Combination


In the next figure, four basic elements are combined to construct an AND gate. Assume that the
signal level of input 1 is higher than the threshold (hereafter, ON indicates an input above the
threshold and OFF indicates an input below the threshold). Input 3 is an auxiliary input and
equals always ON. Although input1 is also ON, it requires a fuel to generate an output from gate
1. This fuel is supplied as an output from gate 3, which in turn requires a fuel to gate 3. When
input 2 is ON, gate 2 generates a fuel to gate 3 as output. Therefore, gate 1 generates an output
only when both input 1 and input2 are ON. When either or both of the inputs are OFF, gate 1
does not generate an output. This is an AND gate. Gate 4 is inserted to improve the reliability
of the AND gate. Qian et al. also have constructed an OR gate, Boolean network, comparator,
relay, and analog circuit, greatly facilitating the design of DNA reaction-based circuits [1]. In
this manner, you can construct a variety of circuits by carefully combining seesaw gates and
introducing measures to improve their functional reliability.
Chapter 3 What Does Design of DNA Molecules Make Possible? 107

DNA Reactions in Seesaw Gate


DNA reactions that constitute the seesaw gate are summarized below.
1) Threshold Processing
An input molecule is a single strand DNA with the sequence S₁-T-S₂, and a molecule that defines
the threshold is a double strand DNA that carries the toe-hold sequence T’ (see section 42).
When the input DNA meets the threshold DNA, it starts strand displacement from the toe-hold
sequence and no longer remains as a single strand. Therefore, in order to drive the operation in
the gate, a concentration of the input DNA higher than that of the threshold DNA is required.
Chapter 3 What Does Design of DNA Molecules Make Possible? 108

Reprinted from the Journal of the Royal Society Interface, 2011 [4] (open access).
2) Output
The following figure shows a chain reaction of a strand displacement reaction and is the origin of
the name seesaw gate. In the figure, reactions indicated by solid lines are predominant, whereas
those indicated by dotted lines are reverse reactions. First, when the input DNA meets a double
strand DNA included in the gate (gate:output), it undergoes a strand displacement reaction to
generate a single strand output DNA and a double strand DNA (gate:input). Gate:input exposes
a toe-hold sequence T’, to which a fuel DNA binds to undergo another strand displacement
reaction, with gate:input, converted into the input and a double strand DNA (gate:fuel). The
first strand displacement reaction is a material flow from the top to the bottom of the figure,
and the second reaction is from the bottom to the top…, constituting a “seesaw”-like reaction
cycle. Most importantly, the input is consumed in the first strand displacement reaction but
supplied in the second strand displacement reaction. Generally, in a circuit design, you must
take care that individual circuits will not only work correctly alone but also in combination
Chapter 3 What Does Design of DNA Molecules Make Possible? 109

when connected to each other. Such independence of performance of individual circuits from
connection is called modularity [2].

Reprinted from the Journal of the Royal Society Interface, 2011 [4] (open access).
Seesaw gates are highly modular, because they can generate output DNA without consuming
too much input DNA in the reaction cycle. Look at a circuit of a larger scale achieved by
combining the logic gates constructed in [1] for a comparison.

Column: More Complex Computing Circuits


Further elaborate circuits are also possible. For example, a seesaw gate-based neural network
composed of four neurons has been reported in [3].
Chapter 3 What Does Design of DNA Molecules Make Possible? 110

Reprinted with permission from Nature, 2011 [3].

—Takashi Nakakuki (Kyushu Institute of Technology)

33. DNA Toolbox


Unlike conventional DNA computing systems, a DNA toolbox developed by Montagne et al. is a
platform that combines DNA-DNA reactions and enzymatic reactions. By combining these two
reactions, a dynamic equilibrium system with a unique function can be designed.

Mimicry of Gene Regulatory Network


In living cells, multistep, complex information processing reactions of transcription from DNA
(gene) to mRNA and translation from mRNA to protein are continuously taking place. The
Chapter 3 What Does Design of DNA Molecules Make Possible? 111

produced proteins, in turn, can activate other genes or alternatively suppress them. Multiple
genes influence each other in a complex manner, creating a Gene Regulatory Network (GRN).
Complex behaviors of GRN such as oscillation [1] and memory [2] can be reproduced by
engineered systems. However, the process of DNA -> RNA -> protein includes multiple steps so
that the reaction may take a long time and may be difficult to control. Montagne et al. proposed
the following strategy to build a similarly complex system by combining DNA and enzymes [3].
A long “template” DNA, about 25 bases long, serves as a gene and synthesizes a short DNA by an
enzymatic reaction; the short signal DNA hybridizes to its target, i.e. the template, to activate or
suppress its function; the signal DNA is constantly degraded by exonuclease, so that the system
is kept dynamic.

Action on Template DNA


The template DNA is composed of two (i.e. input and output) regions, which plays a central role
in the DNA toolbox. When the signal DNA hybridizes to the input region, polymerases start the
extension process. As a result, the template DNA is turned into a complete double strand. In
order to separate the input and output, the template DNA has a nickase target sequence. The
input and output are separated by nickase and dissociate (Fig. 1).

Fig. 1: Template functions. 1. Hybridization, 2. extension reaction, 3. separation, and 4. dissoci-


ation.
Upon its dissociation from the template, the output becomes the signal DNA, which is an
important feature of a DNA toolbox. By linking multiple template DNA molecules through such
signal DNA molecules, a system like a GRN can be constructed.
As already described, the single-strand signal DNA is degraded by an exonuclease. On the other
hand, the template DNA must be modified at the 5’ terminus of its backbone and thus protected
from such a degradation reaction.
Chapter 3 What Does Design of DNA Molecules Make Possible? 112

DNA Toolbox Modules


DNA toolboxes include three basic modules of activation, autocatalysis, and inhibition. Fig 2
shows their mechanism, behavior, and graph representation. In the graph representation, a
circular node represents signal, an arrow represents template, and the T-shaped line represents
inhibition.

DNA toolbox modules

The third module for inhibition is driven by a type of signal DNA called an “inhibitor”. The
inhibitor can bind to more than one site in the target template, but does not elicit polymerase
or nickase reactions (due to sequence mismatch and incomplete recognition sequence, respec-
tively). Until dissociation of the inhibitor, the template is kept inactive. Note that the inhibitor
cannot be used as an input DNA.
Based on this platform, Montagne et al. developed the oscillator shown in Fig. 3 [3], and Padirac
et al. developed a two-state circuit and two-state switch [4]. The performance of their systems
has been experimentally verified. Theoretically, more complex systems have been proposed [5].
Chapter 3 What Does Design of DNA Molecules Make Possible? 113

Fig. 3: Oscillator and experimental results by Montagne et al. Reprinted from Molecular Systems
Biology, 2011 [3] (open access).

Simple Model
Because a DNA toolbox includes a non-linear reaction, it is hard to completely predict the
behavior of the system. For minimal refinement, the template is regarded as a black box as
shown in Fig. 4. The black box is a simple converter based on the reaction rate of Michaelis-
Menten. This model assumes that hybridization and denaturation (association and dissociation
between complementary DNA molecules, respectively) reach the equilibrium much faster than
the enzymatic reaction does. As a result, the equation in Fig. 4b is obtained for each module.
Based on the graph representation, an equation that represents the entire system can be de-
rived easily by adding reactions of templates that correspond to different signal DNA molecules
together. The degradation by the exonuclease is incorporated as an approximation by the first-
order reaction shown in Fig. 4d.

Fig. 4: Representation and Equation of a DNA Toolbox as a Black Box. A vertical line indicates
restriction, [ ] indicates concentration, and � and � are parameters; exo means exonuclease
Chapter 3 What Does Design of DNA Molecules Make Possible? 114

reaction.
Such modeling of DNA toolboxes was proposed by Padirac et al. for the first time [4]. The
equations are simple enough to allow a large system to be simulated in a very short time. This
advantage has been helpful in the evolution of a complex DNA toolbox system, which requires
several thousand cycles of evaluation. In the course of its evolution, a system with an interesting
pattern was algorithmically found [7].

Refined Model by DACCAD


Disadvantageously, the simple model does not explicitly describe the template DNA. Also, it
does not take into account the protection due to the binding of the signal DNA to the template
DNA from the degradation reaction of the exonuclease. Because the substrate concentration is
unknown, the saturation of polymerase or nickase cannot be calculated. The time lag due to
hybridization-extension-separation has been lost too.
To address these issues, Padirac et al. proposed a refined model that can determine the region
in the template the signal hybridizes to [4]. The model was then further improved by Aubert et
al. [5]. However, so many reactions are combined in the model that simulation by handwriting
is not applicable. To make a simplified simulation process, Aubert et al. developed DACCAD with
a graphical interface [5]. The software is freely available on the Internet.

Implementation for Experimentation


Once you have successfully simulated a system, you can start designing the actual DNA se-
quence. This process is described by Baccouche et al. in detail [6], including basic experimental
procedures and solutions to frequently encountered problems.
—Nathanael Aubert-Kato (Ochanomizu University)
—Translated by Ibuki Kawamata (Tohoku University)

34. DNA-Based Finite State Machine


Autonomous DNA computing, which performs staged computation by means of a series of reac-
tions that proceed autonomously, will be a key technology to achieve advanced information
processing based on molecular reactions. This section outlines two representative reaction
systems that implemented a DNA computer with internal states.

Second Generation DNA Computing


Reactions based on sequence-specific DNA hybridization can be regarded as symbol manip-
ulation based on string-encoded rules, i.e. “computation”. Adleman demonstrated that DNA
Chapter 3 What Does Design of DNA Molecules Make Possible? 115

reactions could solve mathematical problems (see section 36). Here, a DNA molecule is regarded
as a “recording tape” that contains a string of symbols, and each computation step is driven
manually. This type of DNA computing is called “classical” or “first generation”. On the other
hand, “second generation” DNA computing, or “molecular programming”, executes staged
computation by a series of reactions that proceed autonomously in a solution, for which several
computation models have been proposed.
When a DNA nanostructure is altered stepwise in a series of reactions, these structures can be
regarded as different “states” in computation. If a given structure induces a particular reaction
to change itself into a different structure, it can be regarded as the nanostructure referring to its
internal state to execute a computation. Recently, much work has been done on DNA logic gates
that is based on the strand displacement reaction of DNA (see section 29). This section will focus
on two enzymatic reaction systems that implement a “finite state machine” with internal states:
i.e., a finite automaton based on restriction enzymes, proposed early in the DNA computing field,
and a finite state machine that executes computation by repeating an extension reaction of DNA
polymerase.

DNA Automaton
“Finite automaton (FA)” is a virtual computation model that has internal states and a state
transition function. When FA receives a symbol string as input, it reads out the symbols se-
quentially to execute state transitions by referring to its state. When the final state is an accept
state, the symbol string is “accepted”. For example, in the following figure, upon input of the
symbol string “baba”, FA reads out the leftmost b, and a transition from the initial state S₀ to S₁
occurs according to the state transition function. Then the second a (from the left) is read, and
a transition from the state S₁ to S₁ occurs. When the third b is read, a transition from the state
S₁ to S₀ occurs, and when the fourth a is read, a transition from the state S₀ to S₀ occurs. In this
example, the state S₀ is an accept state, and FA accepts the input baba.

FA that accepts a symbol string containing an even number of bs. Each circled letter indicates
a state, and the double circle indicates an accept state. Each small letter above an arrow
represents a state transition function.
Shapiro et al. implemented FA in a reaction system that was composed of DNA and Fok I, a type
IIs restriction enzyme [1]. Type II restriction enzymes recognize a specific sequence in a double
Chapter 3 What Does Design of DNA Molecules Make Possible? 116

strand DNA, and cutss both DNA strands at a specific position (see section 47). Among type II
restriction enzymes, type II enzymes cut DNA outside their recognition sequences. In the figure
below, the “input DNA” encodes an input symbol string and has a terminal overhang sequence,
to which a complementary terminal overhang sequence in the “state transition function DNA”
can hybridize to cause the cleavage of the input DNA by Fok I. This corresponds to reading one
symbol to execute a state transition. The cleavage produces a new terminal overhang in the
input DNA, the next symbol is read out like before, and another state transition occurs. A 1-
to-3 base spacer sequence that is immediately to the right of the Fok I recognition sequence
beautifully encodes multiple states. For details, please refer to the original paper [1] and a review
in Japanese [2].

State transition reaction in a DNA automaton. N can be any base and each vertical line
indicates a base pair. A Fok I recognition sequence is boxed. Reprinted from Fig. 5.9 in [2].

Whiplash PCR
Hagiya et al. employed an extension reaction by a DNA polymerase that occurs when the 3’
terminus of a DNA hybridizes to a complementary sequence in the same molecule and serves as
primer, to devise a finite state machine that encoded stepwise state transition rules in a single
DNA molecule [3]. Usually, a DNA extension reaction happens in the presence of dNTP of all four
Chapter 3 What Does Design of DNA Molecules Make Possible? 117

bases in a reaction solution. If one of the bases has been depleted, the extension reaction halts
at a base complementary to it. This “polymerization stop” allows a base length that encodes a
state to undergo the extension reaction at a time.

Polymerization Stop. The reaction solution is depleted of dTTP. The extension reaction halts at
a base A in the template DNA.
The 3’ terminal sequence represents the “current state” of the finite state machine. The addition
of a new sequence to the 3’ terminus by the extension reaction corresponds to a state tran-
sition. When the added sequence hybridizes to another complementary sequence elsewhere
in the same molecule, it causes another state transition, executing a stepwise state transition
autonomously. This looks like whiplashing, hence the name “whiplash PCR”. In this reaction sys-
tem, the hybridization between complementary DNA sequences is an intramolecular reaction,
the reaction rate of which does not depend on DNA concentration, unlike in an intermolecular
hybridization reaction. Also, a plurality of finite state machines with different sequences can
execute a staged computation in parallel in a single reaction vessel. Komiya et al. have shown
that a benchmark problem of a small scale like Adleman’s can be solved using Whiplash PCR
with a limited number of cycles of the experimental procedure [4].

Challenges with DNA-Based Finite State Machines


Reaction systems described in this section have some disadvantages for using them for prac-
tical computation. DNA automatons cannot encode many states, and the hybridization rate is
concentration-dependent, raising the question of how to process a lot of data by the system.
On the other hand, Whiplash PCR, which repeats secondary structure formation and melting
of DNA, requires a high reaction temperature, and the input step is not yet optimal. In order to
implement advanced information processing and learning by DNA computing, a computation
mechanism with internal states is essential. Extensive discussion on a wide range of computa-
tion models and reaction systems to implement them will be required to propose novel reaction
systems to overcome these issues.
—Ken Komiya (TITECH)

35. Traveling Salesman Problem


This section reviews Adleman’s DNA-based solution method for directed Hamiltonian path
problems and then discusses the traveling salesman problem. Evolution computations such as
Chapter 3 What Does Design of DNA Molecules Make Possible? 118

genetic algorithms, as well as in-vitro selection, under the title of “From Search to Optimization”,
are also discussed briefly.

Adleman’s Idea and Experimentation


In 1994, the computer scientist Adleman succeeded in using DNA to solve a mathematical
problem called the “directed Hamiltonian path problem”, which can be regarded as the start
of the field of DNA computing.

For a digraph (a graph with directed edges) as shown above, the directed Hamiltonian path
problem asks to find a path that travels from a given start node to a given goal node, visiting
every node exactly once (“Hamiltonian path”).
In general, a problem that asks a solution that meets given requirements from a huge number
of candidate solutions is most often classified as an NP-complete problem. In an NP-complete
problem, in short, upon an increase in the problem size (the number of nodes and edges in the
case of a graph), the number of candidate solutions exponentially increase, so that the solution
search becomes extremely difficult. Of course, the directed Hamiltonian path problem is NP-
complete.
In classical DNA computing, in order to solve a search problem, individual candidate solutions
are represented by DNA molecules of different sequences, and all candidate solutions are
generated as a library of DNA molecules. Specifically, a large DNA library of different sequences
is prepared in a test tube. The entire library is then subjected to a series of selection steps to
identify a DNA molecule that meets the requirements of the solution, thus solving the search
problem.
Chapter 3 What Does Design of DNA Molecules Make Possible? 119

According to Adleman’s idea, each node in a graph is represented by a single-stranded DNA


molecule. Needless to say, different nodes have different sequences assigned. Each directed
edge between nodes (e.g. (3) -> (4) in the figure) is represented by the Watson-Crick complemen-
tary sequence joining a half of its tail (3) node sequence and a half of its head (4) node sequence.

By chemically synthesizing node DNA molecules and directed edge DNA molecules and anneal-
ing them in a test tube, a variety of double strand DNA molecules representing paths in a graph
can be obtained, as shown in this figure. This process corresponds to random generation of
paths in the graph. Here, neighboring DNA molecules can be covalently joined by means of
ligases.
Assume that you have obtained a sufficient number of DNA molecules, so that all paths in a
graph, especially those as long as (or shorter than) the Hamiltonian path, have been generated.
In this case, you will only have to choose a path that meets the requirements of the solution
(Hamiltonian path) from the paths on the graph that have been generated as DNA molecules.
In order to solve a Hamiltonian path problem, Aldeman performed a selection based on PCR,
electrophoresis, and magnetic beads. He used PCR primers that represented the start and the
goal, respectively, to select/amplify only paths that led from the start to the goal. From these,
he selected paths of the same length as that of the Hamiltonian path by electrophoresis. Finally,
using magnetic beads, paths that contained all nodes except the start and goal were selected.
This operation is equally applied to all DNA molecules present in the test tube. Because each
DNA molecule represents a path in a graph, such an operation is called “data parallelism”,
meaning that it is applied equally to all data. Thus, Adleman’s method can be regarded as
Chapter 3 What Does Design of DNA Molecules Make Possible? 120

parallel computing by data-parallelism.


Similarly, solution methods for other NP-complete problems have been proposed, some of
which have been experimentally investigated on a DNA basis. Particularly, the Boolean satis-
fiability problem is the most basic among NP-complete problems, for which Adleman, Lipton,
Hagiya, and others have proposed DNA-based solution methods and tested them in vitro.

From Search to Optimization


Classical DNA computing stochastically generates candidate solutions by DNA molecules. This
process seeks to prepare all candidate solutions, but it might be impossible for a problem of
larger size, even if each molecule corresponds to a specific candidate solution. In addition,
stochastic bias might result in only some of possible candidate solutions being generated.
In the real world, each solution often has a defined fitness value, wherein a solution with the
greatest (or smallest) fitness value, or at least a value as great (or small) as possible, is sought.
Such a problem is called an optimization problem.
The traveling salesman problem is a typical optimization problem and is somewhat similar to a
directed Hamiltonian path problem, but the graph has a defined edge length. In addition, it asks
for a cycle that starts and finishes at the same node: i.e., the shortest cycle that visits each node
exactly once, instead of a simple path. A problem that asks the shortest cycle is NP-complete,
and this problem can be regarded as an optimization problem to find the possible shortest cycle.
The traveling salesman problem has been extensively studied, and a wide variety of solution
methods have been proposed, providing a benchmark for other optimization problems.
A category of optimization methods applicable to various problems, called metaheuristics,
has been proposed. Major metaheuristics include annealing, Hopfield network, Boltzmann
machine, tabu search, genetic algorithm, group optimization, ant colony optimization, etc.
Among them, a genetic algorithm is a methodology to find a candidate solution with a greater (or
smaller) fitness value by subjecting a population of candidate solutions to artificial evolution.
Adleman’s method generates candidate solutions only once at the start, while genetic algorithm
generates new candidate solutions through mutation and crossover. Basically, however, both
of them depend on solution generation and selection. In addition, the mutation and crossover
procedures can be performed in a data-parallel manner. Therefore, it is quite natural to apply
Adleman’s method to genetic algorithm and other evolution computations.
On the other hand, there is an established method to search for a molecule with a particular
function (e.g., affinity to a target molecule) by subjecting a library of polymeric molecules such
as DNA, RNA, and proteins for artificial evolution. This is called artificial evolution of molecules
or in-vitro selection. In other words, this is a solution method for optimization problems about
molecules.
—Masami Hagiya (Tokyo University)
Chapter 4 Basic Textbook Knowledge
36. Importance of Multidisciplinary Knowledge:
Overview
To win BIOMOD, you have to educate yourself in molecular chemistry/physics, programming,
mechanical engineering, biology, and computer science. You can also organize a multidisci-
plinary team with students from other departments to invent a biomolecular system of great
originality that can surprise the world.

Designing Molecular Systems


In BIOMOD, the word design has a somewhat different meaning from its conventional meaning,
which is “design of a molecule”. In the field of chemistry, to design a molecule means “designing
a chemical structure with desired physical properties, based on the notion that each molecule
has distinct physical properties that depend on its chemical structure”. In BIOMOD, the meaning
of the word focuses on the programmability of DNA. DNA molecules with different sequences
are different from each other in chemical structure, but virtually the same in their physical
properties and ability to hybridize to complementary sequences. In other words, the base
sequence of a DNA molecule, rather than its physical properties, determines its interaction
partner and the strength of the interaction. By carefully combining a large amount of mutually
complementary sequences, you can construct a reaction system that behaves entirely in a
programmed fashion; i.e. you can “design the behavior of a system made of multiple molecules”.
What kind of knowledge is required to design such a molecular system?

Structural DNA Nanotechnology


“DNA nanotechnology” is the field of research that investigates the programmability of DNA.
Initially, DNA nanaotech focused on the use of sequence-specific hybridization of DNA to assem-
ble nanostructures [1]. Such study has recently been termed “structural DNA nanotechnology”,
which began with the finding by the chemist Seeman that a DNA molecule can create crossovers
with two or more other DNA molecules that are partially complementary to it. This prompted
computer scientists to systematically design DNA sequences, resulting in the development
of DNA origami and other related techniques (see section 18). In addition, we can think of
such systematic self-assembly of molecules as an information-processing step. We can use
these investigations to develop new computation models that are different from electronic
computing ones.
Chapter 4 Basic Textbook Knowledge 122

Dynamic DNA Nanotechnology


You, too, can assemble a DNA nanomachine, which is a DNA nanostructure capable of me-
chanical motion (see section 24). This technique is based on the strand displacement reaction
of DNA, identified by the physicist Yurke and colleagues [2]. The use of this reaction to build
logic gates has been extensively studied (see section 29). These reactions are generically called
“dynamical DNA nanotechnology”. To design such a dynamic system, you need to understand
thermodynamics. This chapter discusses some of relevant thermodynamic theories. We pre-
dict that, in the near future, research measuring the force generated by nanomachines will
become important; understanding some mechanical engineering will be key in this field of
study. In this field of mechanics you may have to learn about molecules, solution systems,
and other soft materials, as well as the so-called “four mechanics” (mechanics of materials,
fluid dynamics, thermodynamics, and dynamics of machinery), because the dimensions and
principles of molecular nanomachines are totally different from those that dictate the working
of conventional macro-scale mechatronics.

DNA Computing
In computer science, the field of research to develop novel computation models inspired by
natural phenomena is called “natural computing”. “DNA computing” started when the computer
scientist Adleman, the inventor of the RSA cipher, solved a traveling salesman problem using
DNA reactions (see section 35)[3]. After having succeeded in assembling DNA nanostructures
(structural DNA nanotechnology) and moving them (dynamical DNA nanotechnology), scientists
are aiming at constructing intelligent molecular systems (molecular robotics). In BIOMOD,
you don’t have to limit your project to DNA nanostructures. Try to exploit the potential of
biomolecules as much as possible to design an elaborate system that may not exist, or is not
yet known, in natural biological systems.

Transdisciplinary Team
As described above, successful BIOMOD projects require a transdisciplinary team. Because DNA
and protein are biological macromolecules, you need knowledge in chemistry and macromolec-
ular physics to experiment with and analyze them in depth. If you want to design a molecular
robot, you will need some knowledge in mechanical engineering and programming. If you are
familiar with electronics, you might be able to develop a novel measurement technique. In
addition, a broad range of knowledge in biology, medicine, and computer science will help you
identify an exciting project. However, it is impossible to learn all these things in such a short
period, which is why you need a diverse team. Team members must communicate to support
each other and work in an organized manner, putting the right people in the right places. Last
but not least, most Japanese students seem to have some trouble with their English skills. Do
not forget to give yourself language education, so you can work worldwide as a researcher or
engineer.
Chapter 4 Basic Textbook Knowledge 123

—Ken Komiya (TITECH)

37. Thermodynamics
This section introduces internal energy, enthalpy, entropy, and other concepts that are required
to understand the thermodynamic properties of DNA.

Basic Concepts: System and Surroundings


The universe is divided into two parts: a system and its surroundings. A system is what can be
analyzed, and all the rest is the surroundings. Every thermodynamic system can be classified
into one of three kinds: an open system, where transfer of matter and energy between the
system and the surroundings can occur, a closed system, where only energy can be transferred,
and an isolated system, where there is no transfer of energy or matter.

The energy of a system means the ability of the system to perform work. Work is the energy
required to move something against a resisting force. For example, when a gas has expanded, it
has exerted work on its surroundings. In thermodynamics, the energy contained in the system
is called the internal energy. It has been experimentally demonstrated that work and/or heat
from the surroundings cause a change in internal energy.

The First Law of Thermodynamics


The internal energy U is a state function that is defined solely by the system’s current state
(temperature, volume, etc.) and is independent of the path between states. Therefore, when
a system is changed from its initial state to another state, the sum of work and heat exerted
on the system is always the same and path-independent. We also know empirically that the
Chapter 4 Basic Textbook Knowledge 124

internal energy of an isolated system is constant. This is the first law of thermodynamics (the
law of conservation of energy), and expressed as follows.

(w: work exerted on the system, Q: heat transfer to the system, ∆U: change in the internal energy)

Enthalpy
When the volume of a system undergoes an infinitesimal volume change dV under a pressure
p, infinitesimal work d’w from the surroundings can be expressed as follows (pressure-volume
work).

When the pressure-volume work is the only work done on the system, the infinitesimal change
in internal energy dU can be written as follows, from the equations (1) and (2).

(d′Q: infinitesimal heat, d′w: infinitesimal work)


When the internal energy is changed at a constant volume, then dV = 0. Therefore, the change
in internal energy is equal to the heat transferred to the system. When the internal energy is
changed under constant pressure, the heat transferred is not equal to the change in the internal
energy, because the volume of the system can change. Therefore, a new state variable, enthalpy,
is introduced as follows.

An infinitesimal change in enthalpy can be written as follows.

Thus, when only pressure-volume work is exerted on the system under a constant pressure, from
the equation (3),
Chapter 4 Basic Textbook Knowledge 125

That is, the change in the enthalpy is equal to the heat transferred. In other words, enthalpy is
the energy contained in a system under a constant pressure.

The Second Law of Thermodynamics


In our daily life, some phenomena can occur spontaneously and others not. For example, boiled
water in a kettle spontaneously cools down to room temperature with time. On the contrary,
water at room temperature never boils spontaneously. The difference between spontaneous
and non-spontaneous processes can be summarized by the second law of thermodynamics (see
section 38 [2]). Clausius expressed this law as: heat transfer from a warmer to a colder body is an
irreversible process. In other words, heat can never pass from a colder to a warmer body without
some other change occurring at the same time.
“Irreversible” means that the system cannot return to the original state without some external
work exerted on the system by its surroundings. On the contrary, in a reversible process, it is
possible to return the system to its original state, without changing surroundings at all.

Inequality of Clausius
Imagine a heat engine that receives heat Qi from
Chapter 4 Basic Textbook Knowledge 126

n heat sources to perform work w in a cyclic process, then returns to the original state. With the
temperature of each heat source being T i,

holds*. This is a mathematical representation of the second law of thermodynamics, called the
inequality of Clausius. When n → ∞,

holds for any heat engine that continuously changes its state. In the inequalities (7) and (8), the
left and right hand sides are equal when the cycle is a reversible process.
Chapter 4 Basic Textbook Knowledge 127

Entropy
Entropy S is defined as follows.

The left-hand side represents a reversible change from state A to a state B. In a cycle in which
the system is changed from state A to B through an irreversible process, and then returned to A
through a reversible process,

based on the inequality (8). Therefore,

based on the equation (9). Thus, for any change of state in general,

holds. For an infinitesimal change, it can be written as: dS ≥ dQ/T. In an adiabatic process,
Chapter 4 Basic Textbook Knowledge 128

because d′Q = 0. This is called the law of entropy increase, meaning that any spontaneous
change in an isolated system does not cause a decrease in the entropy.

Entropy in Statistical Mechanics


In statistical mechanics, entropy can be expressed as follows (Boltzmann’s Principle).

W is the number of possible microscopic states of the system in a given macroscopic state. For
example, for a given volume, the position of molecules in the system cannot be determined
unequivocally. When a space of a volume V is divided into infinitesimal spaces of a volume δV
and n molecules are placed therein, the number of possible states is: W = (V⁄ V)ⁿ. Therefore,
most simply, an increase in the number of molecules and/or the volume results in an increase
in the entropy.
The higher the entropy is, the larger W is, i.e., the more disordered the system is. Thus, the law
of entropy increase means that, in a spontaneous process, the system moves toward a more
random, disordered state.
*For detailed derivation, see section 38 [2] and [3].
—Fumi Takabatake (Tohoku University)

38. Free Energy and Chemical Equilibrium


Free energy is an important thermodynamic state variable that can serve as a measure of
spontaneous process and equilibrium state. Gibbs energy is most frequently studied at constant
temperature and pressure, which is also closely related to equilibrium constant.

Gibbs Free Energy


When a state changes under a constant pressure and there is no work except pressure-volume
work, then dH = d′Q (see section 37). Because dS ≥ d′Q⁄T from entropy equations, dH -TdS ≤ 0.
Gibbs energy, a new thermodynamic state variable, is defined as follows.

An infinitesimal change can be written as dG = dH - TdS - SdT. In an isothermal process,


Chapter 4 Basic Textbook Knowledge 129

from the above-mentioned inequality. In other words, in a system at constant pressure and
temperature, a spontaneous change proceeds toward a minimum of the Gibbs energy. This
principle is used to measure the favorability of a reaction in a typical chemical experiment,
which is usually focused on a chemical reaction under a constant pressure.

Pressure Dependence of Gibbs Energy


A reversible change in a closed system where the work is only pressure-volume work is discussed
below. An infinitesimal change in three state variables (internal energy U, enthalpy H, and Gibbs
energy G) can be rewritten as follows,

wherein each depends on infinitesimal changes in two other state variables. Then, when G
is regarded as a function of temperature and pressure and totally differentiated, it can be
expressed as follows.

The subscripts in the derivative terms indicate the partial differentiation of G by T and p, with p
and T being constant, respectively. By comparing the equations (5) and (6),

Because S and V always have positive values, Gibbs energy is decreased by an increase in
temperature and increased by an increase in pressure.
Chapter 4 Basic Textbook Knowledge 130

When the pressure is changed from p₁ to p₂ at a constant temperature (dT = 0), the change in G
is as follows based on the equation (5).

In the case of ideal gas, wherein pV = nRT (the combined gas law),

Based on the Gibbs energy G⁰ for the state of a pure form (standard conditions) under the
standard pressure (p⁰ = 1 bar), the Gibbs energy under pressure p can be expressed as follows.

Alternatively, it can be written as follows by choosing a system of measurement in which p⁰ = 1.

Chemical Equilibrium
In a reversible reaction, when the rate of the forward reaction and the rate of the reverse reaction
balance each other and the concentration of the substances does not change macroscopically,
this state is called chemical equilibrium.
Look at the reversible reaction below.

The upper-case letters A, B, Y, and Z represent molecular species. Each vi is a stoichiometric


coefficient for the corresponding species (the relative molar amount used in the reaction). When
dnA mol of A and dnB mol of B have reacted in the forward reaction to produce dnY mol of Y and
dnZ mol of Z,
Chapter 4 Basic Textbook Knowledge 131

Therefore,

is the advancement (see section 39).


At a constant temperature and pressure, the total differentiation of the Gibbs energy in this
chemical reaction system is expressed as follows.

is a state variable called “chemical potential”, and this can be regarded as the Gibbs energy
per amount of substance of the component i. From the equation (12), the equation (13) can be
rewritten as follows.

Here, reaction Gibbs energy ΔrG is introduced as the change in the Gibbs energy due to the
advancement of the reaction. Based on the equation (15),

Thus, ΔrG can be expressed as the difference in chemical potential between the reactants and
the products in the reaction mixture composition. Because a spontaneous reaction proceeds
toward a decrease in Gibbs energy, the forward reaction will dominate when ΔrG < 0, whereas
the reverse reaction will dominate when ΔrG > 0. When ΔrG = 0, the reaction no longer proceeds
in either direction, resulting in an equilibrium state.
Chapter 4 Basic Textbook Knowledge 132

Equilibrium Constant and Gibbs Energy


In the case of an ideal gas, based on the equations (11) and (14),

pi is the partial pressure of substance . Therefore, using the standard Gibbs energy change,

the reaction Gibbs energy can be written as follows.

Because ΔrG = 0 in an equilibrium state,


Chapter 4 Basic Textbook Knowledge 133

based on the partial pressure piᵉq of each substance in the equilibrium state. When the ratio of
the partial pressure of the reactants and products in the equilibrium state is represented by Kp,

Kp is called a pressure equilibrium constant.


If we consider a typical solution instead of an ideal gas, we use a physical quantity ai, called
activity, instead of partial pressure, to express the following relationship:

The activity is usually a complex function of temperature, pressure, and amount of substance.
The equation is similar to that of a molar fraction or any other concentration. In particular, in
a dilute solution, it is equal to molar fraction = n�/n [1]. An equilibrium constant based on
activity

is called the “thermodynamic equilibrium constant”, based on which ΔrG⁰ = −RT lnK in a typical
solution.
In a dilute solution, the molar fraction of the solvent is substantially proportional to the molar
concentration. Therefore, by approximation, the activity is often replaced with the concentra-
tion and the following equation based on the concentration equilibrium constant Kc is used [2].

—Fumi Takabatake (Tohoku University)


Chapter 4 Basic Textbook Knowledge 134

39. Reaction Kinetics


Reaction kinetics is deals with the temporal advancement of a reaction (reaction rate). In addi-
tion to typical chemical reactions, the reaction mechanisms of DNA hybridization, denaturation,
and strand displacement have been investigated in terms of their reaction kinetics. This section
outlines some basics of reaction kinetics.

Reaction Rate Equation and Order of Reaction


Look at a chemical reaction below.

The advancement ξ, which describes the extent of the advancement of the reaction from the
initial state, is used to express the molar amount of each component at time t as follows.

Thus, the infinitesimal change in molar amount can be written as follows, relative to time.
Chapter 4 Basic Textbook Knowledge 135

When the volume V of the system is constant, the temporal change in the concentration of each
component ([A] = nA⁄V, …) can be written as follows.

v is called the reaction rate. An experimentally obtained reaction rate usually depends on the
reactant concentration, which is expressed by the following equation using a reaction rate
constant k.

This type of equation is called a “reaction rate equation”, and the sum of the exponents n = � +
ß +… is called the “overall reaction order”. The exponent of each molecular species is called the
order of reaction for that molecular species. In this case, the reaction is said to be �th in A and
ßth in B.

Elementary Reaction and Complex Reaction


A typical chemical reaction involves reaction intermediates, where the reaction proceeds step-
wise as follows.
Reactants –> Reaction intermediate –> … –> Product
A single-step reaction with no intermediate is called an “elementary reaction”, while a reaction
composed of more than one elementary reaction (i.e. it proceeds stepwise) is called a “complex
reaction”. In its reaction mechanism, the complex reaction is expressed by the order of elemen-
tary reactions.
In a typical chemical reaction, the order of reaction does not correlate with the stoichiometric
coefficients of the reactant. In an elementary reaction, they are equal to each other. For example,
an elementary reaction:
2A + B -> product
is a third-order reaction that is second order in A and first order in B. Therefore the rate equation
can be written as follows.
v(t) = k[A]²[B]
Chapter 4 Basic Textbook Knowledge 136

Integrated First-Order and Second-Order Reaction Equations


By integrating a rate equation, you can obtain the reactant and product concentration as a
function of time, which is called an “integrated rate equation”.
In a zero-order reaction, the reaction rate equation is written as v(t) = k, where the reaction
proceeds at a constant rate in a concentration-independent manner. Therefore, the reactant
concentration decreases linearly with time from the initial concentration [A]₀, like [A] = [A]₀ − kt.
What about first-order and second-order reactions? Consider the first-order reaction below.
A -> B
The reaction rate equation is as follows.

This differential equation can be solved easily to provide the following integrated rate equation,
with the initial concentration of A being [A]₀.

Thus, in the first-order reaction, the reactant concentration decreases exponentially with time
from the initial concentration. This equation can be rewritten as follows.

This means that if a chemical reaction has been observed experimentally or otherwise to be a
first-order reaction, the logarithm of the concentration will make a linear graph when plotted
against time, and we can obtain reaction rate constant from the slope.
Below is an example of a second-order reaction. When there is only one reactant, the reaction
rate equation is

and the integrated rate equation can be written as


Chapter 4 Basic Textbook Knowledge 137

or

When it includes two reactants, the reaction rate equation can be written as follows.

When the two reactants have the same concentration ([A] = [B]), the equation is the same as
the equation for a single reactant. When they have different concentrations, an integrated rate
equation can be written as follows.

Reversible Reaction and Concentration Equilibrium Constant


Here we illustrate how to express a complex reaction using a reaction rate equation, with a
simple reversible reaction as an example.

This reversible reaction is composed of the following two elementary reactions (forward (->) and
reverse (<-)).
(1)
Chapter 4 Basic Textbook Knowledge 138

(2)

The molecule A is consumed in the reaction (1) at the rate of k₁[A][B] and produced in the
reaction (2) at the rate of k -₁[Y][Z]. Thus, the apparent rate equation of the entire reaction can
be written as follows.

This can be solved either analytically or numerically to provide a concentration equation relative
to time.
At equilibrium, where the reaction rates of the forward and reverse reactions are equal, the
following equation holds.

This means that the concentration equilibrium constant Kc and the reaction rate constant have
the following relation.

DNA Reactions and Reaction Kinetics


By approximation, a DNA reaction can also be described by a similar chemical reaction rate
equation. We assume a two-state model (see section 41), in which a DNA molecule can only
take a single-stranded form or a completely double-stranded form at a single point in time.
For example, the reaction shown in the figure below can be regarded as a second-order elemen-
tary reaction as follows.
Chapter 4 Basic Textbook Knowledge 139

Therefore, you can experimentally observe the temporal change in fluorescence intensity (change
in the concentration of QL(overlined)) by fluorometry (see section 81) and fit it to the integrated
rate equation of a second-order reaction

to estimate the reaction rate constant k [1].

A schematic of the reaction and a graph of the corresponding fluorescence intensity [1]. Reprinted
with permission from PRL, 2003.
—Fumi Takabatake (Tohoku University)

40. Molecular Structure of DNA


DNA, or deoxyribonucleic acid, is a macromolecule that carries genetic information in the
nucleus or nucleoid of a cell; i.e., it stores the “program” essential to direct a biological system.
This section describes the molecular structure of DNA.

Double Helix
The double helix structure of DNA was reported by Watson and Crick in 1953 (Fig. 1)[1]. The
diameter of the double helix is about 2 nm, and its length is about 3.4 nm per turn. The double
helix contains two kinds of grooves that are different in width, with the wider one referred to
as major groove and the narrower one as minor groove. When two DNA molecules wind around
each other to form a double helix, we call this double-stranded DNA or a DNA double strand.
When they are separated from each other, each we call them single-stranded DNA or a DNA
Chapter 4 Basic Textbook Knowledge 140

single strand. A double helix usually exists as a “B form” structure, and as “A form” and “Z
form” structures only under particular conditions. These forms have different geometries and
dimensions (see the table below).

Fig. 1. DNA double helix.

Form Chirality Diameter Bases per turn Rise between


neighboring bases
A Right- 2.3 nm 11 0.26 nm
handed
B Right- 2 nm 10.5 0.34 nm
handed
Z Left-handed 1.8 nm 12 0.37 nm

Bases
DNA is composed of four distinct bases, i.e., adenine (A), guanine (G), cytosine (C), and thymine
(T) (Fig. 2). Each base, together with a sugar and a phosphate, forms a “nucleotide” unit (Fig. 2).
A compound made of a base bound to the 1’ carbon atom in the sugar is called a nucleoside, and
an ester compound made of a phosphate bound to the 5’ position of the nucleoside is called a
nucleotide [2]. In a single strand DNA, its terminus at a 5’ carbon atom is called a 5’ end, and that
at a 3’ carbon atom is called a 3’ end. The strand is written in the 5’ to 3’ direction (see Fig. 2).
Two DNA strands in a double strand DNA run in opposite directions (antiparallel).
Chapter 4 Basic Textbook Knowledge 141

Fig. 2.** Nucleotide and bases.

Base Pairing and Complementary Strands


In a DNA double helix, bases in the DNA form Watson-Crick base pairs through hydrogen
bonding. A Watson-Crick base pair can be formed only between A and T or between G and C (Fig.
3). An A/T pair is formed by two hydrogen bonds, while a G/C pair is formed by three hydrogen
bonds, providing a stronger interaction. As a result, a double helix with a higher GC content
is more stable [2]. For example, the sequence 5’-ATGCGTA-3’ can form a stable double helix
only with 5’-TACGCAT-3’. This property of bases is called complementarity, and these sequences
are called mutually complementary sequences or complementary strands. In addition, when
multiple base pairs are formed consecutively in a helix, they stack up to stabilize each other.
This interaction between base pairs is called stacking. In this manner, the DNA double helix
is stabilized both through the hydrogen bonding of Watson-Crick base pairs and the stacking
between base pairs. The length of DNA is expressed in the number of nucleotides (bases) (nt)
when it is in a single-strand state, and in the number of base pairs (bp) when in a double-strand
state. For example, a single strand DNA of 20 bases is 20 nt long, a double strand DNA made of
two completely complementary 20 nt-long strands is 20 bp long.
Chapter 4 Basic Textbook Knowledge 142
Chapter 4 Basic Textbook Knowledge 143

Fig. 3.** Base pairing between DNA strands.

Melting and Hybridization of DNA


The double helix structure of DNA resolves into single strands under heat, which is called
denaturation or melting. The reverse reaction (double strand formation) is called hybridization.
DNA is stable from 0˚C to 100˚C, so that the melting and hybridization can be repeated again
and again by multiple heating/cooling cycles [2].

Column: RNA
RNA (ribonucleic acid) is a macromolecule that is very similar to DNA. There are two major
differences between them. First, the nucleotide sugar in RNA is a ribose with a hydroxyl group
(-OH) present on the 2’ position. Second, RNA uses a base U (uracil), instead of T (thymine) in
DNA. U and T have the same structure except for one methyl group (-CH₃) that is present only
in T.

—Masamune Morita and Masahiro Takinoue (TITECH)

41. Thermodynamics of DNA Hybridization


DNA forms base pairs between A and T or G and C (see section 40). The association of two
mutually complementary DNA strands through base pairing is called hybridization. The stability
of hybridization can be estimated thermodynamically.

DNA Hybridization: Two-State Model


In a short DNA strand of about 100 bases or less, the hybridization can be modeled as the
chemical equilibrium of a reaction that involves only two states: a single-strand state before and
a double-strand state after hybridization. This is called a two-state model. This model may not
be applicable to a longer DNA of kilobases or more, because partial dissociation of base pairs
can occur.

DNA Hybridization and Melting Temperature


Let a single strand DNA be S, a complementary single strand DNA be S(overlined), and a double
strand DNA formed by them be D. In the two-state model, the chemical equilibrium is as follows.
Chapter 4 Basic Textbook Knowledge 144

With the initial DNA concentration (concentration immediately after mixing) being [S] = [S(overlined)]
= C and the hybridized fraction in the equilibrium state being �, the equilibrium constant K is as
follows.

Thermodynamically (see sections 37 and 38), with regard to free energy ∆G, enthalpy ∆H, and
entropy ∆S,
∆G = -RT lnK [3]
∆G = ∆H - T∆S [4]
holds (T: absolute temperature; R =1.987 x 10-³ [kcal mol-¹ K-¹]: gas constant; ln: loge ). From this,

Here, the temperature at which the ratio of hybridized DNA molecules to single-stranded DNA
molecules is 1:1 is called the melting temperature (Tm). Because � = 0.5 and T = Tm, the equation
[5] can be transformed into

.
If the initial concentration C and the thermodynamic parameters ∆H and ∆S are known, Tm
can be calculated. Tm is a widely used measure of stability of DNA hybridization. A higher Tm
indicates a higher stability of hybridization.

Intramolecular Reaction and Melting Temperature of DNA


The stability of a DNA structure with intramolecular base pairing, such as a hairpin, can also
be estimated from its melting temperature. Such an intramolecular reaction is not called
Chapter 4 Basic Textbook Knowledge 145

hybridization, but still can be handled similarly. Imagine a single strand DNA S and its hairpin
structure state H. In the two-state model,

{width=”3.75625in” height=”0.5399617235345582in”}
With the initial DNA concentration being [S] = C and the fraction of the hairpin form in an
equilibrium state being �,

{width=”3.75625in” height=”0.5465037182852144in”}
Similarly,

Tm does not depend on C, unlike in a hybridization reaction.

Calculation of Free Energy ∆G and Prediction of Tm Using the


Nearest-Neighbor Method
Now you can calculate the Tm, i.e. the stability of DNA, once you know its thermodynamic
parameters ∆H and ∆S. This is called the Nearest-Neighbor (NN) method. The NN method
calculates ∆H and ∆S, and is introduced using the DNA in FIG. 1 as an example. In this model,
the stacking interaction between neighboring base pairs, rather than the energy of hydrogen
bonding in each base pair, is considered to dominate the stability of hybridization. In the case
of the DNA in FIG. 1, each base pair is paired with a neighboring base pair, from the left to the
right, as follows.
Chapter 4 Basic Textbook Knowledge 146

∆H and ∆S for each pair (Table 1) are summed up to obtain the ∆H and ∆S of the entire DNA
hybridized. The effect of strand termini can be calculated using GC-term and AT-term in Table 1.
Thus,

Similarly,
∆S = -0.4827 [kcal x mol-¹ x K-¹]
When the initial concentration C = 0.5 mM,
Tm = 345.48 [K] = 72.3 [°C]
from the equation [6]. Note that this is a value in 1 M Na+. The salt concentration dependence
of Tm is due to the salt concentration dependence of ∆S. Empirically,
∆S([Na+ ]) = ∆S(1 M)+ 0.368N ln([Na+ ] M) [10]
where N is the number of stacks (base length: -1) [2]. For example, in 0.1 M Na+,
Tm = 332.80 [K] = 59.6 [°C]
because
∆S(0.1 M) = ∆S(1 M)+ 0.368 × (23-1) × ln(0.1)
= -0.50225 [kcal mol-¹ K-¹]
That is, a lower salt concentration results in a lower Tm.
Last, we discuss a structure with an unpaired base (mismatch) such as a hairpin loop and a
bulge loop. In this case, the NN method is not enough; we need a parameter to account for the
mismatch. For example, for a loop made of n bases, the following holds (Jacobson-Stockmayer
equation).
∆H = 0
Chapter 4 Basic Textbook Knowledge 147

∆S (n > 9) = DS(9)-1.75R ln(n/9)


∆S for n ≤ 9 is shown in Table 2.

Fig. 1. Hybridized DNA. Each vertical line indicates base pair formation.
Table 1. Thermodynamic parameters (1 M Na+)[2].
Sequence ∆H (kcal•mol-¹) ∆S (cal•mol-¹•K-¹)
GC/CG -9.8 -24.4
CG/GC -10.6 -27.2
GG/CC CC/GG -8.0 -19.9
CA/GT TG/AC -8.5 -22.7
GT/CA AC/TG -8.4 -22.4
GA/CT TC/AG -8.2 -22.2
CT/GA AG/TC -7.8 -21.0
AA/TT TT/AA -7.9 -22.2
AT/TA -7.2 -20.4
TA/AT -7.2 -21.3
GC-term 0.1 -2.8
AT-term 2.3 4.1
Symmetry correction 0 -1.4

Table 2. Thermodynamic parameters for a loop (1M Na+) [3]


Loop length (n) ∆S [cal mol-¹ K-¹]
3 -18
4 -18
5 -18
6 -17
7 -19
8 -18
9 -21
Chapter 4 Basic Textbook Knowledge 148

Sequence ∆H (kcal•mol-¹) ∆S (cal•mol-¹•K-¹)


GC/CG -9.8 -24.4
CG/GC -10.6 -27.2
GG/CC CC/GG -8.0 -19.9
CA/GT TG/AC -8.5 -22.7
GT/CA AC/TG -8.4 -22.4
GA/CT TC/AG -8.2 -22.2
CT/GA AG/TC -7.8 -21.0
AA/TT TT/AA -7.9 -22.2
AT/TA -7.2 -20.4
TA/AT -7.2 -21.3
GC-term 0.1 -2.8
AT-term 2.3 4.1
Symmetry correction 0 -1.4

Table 1. Thermodynamic parameters (1 M Na+)[2].


Loop length (n) ∆S [cal mol-¹ K-¹]
3 -18
4 -18
5 -18
6 -17
7 -19
8 -18
9 -21

Table 2. Thermodynamic parameters for a loop (1M Na+) [3].


—Masahiro Takinoue (TITECH)

42. Strand Displacement


A basic technique to implement a dynamic system in the field of DNA nanotechnology is a DNA
strand displacement reaction. This reaction replaces a strand in a double helix with a third
strand. This reaction has much fewer restrictions on buffer, temperature, and other conditions
than an enzymatic reaction, and only involves DNA. It is employed in the transformation of DNA
nanostructures and the conversion of DNA sequences.

Replacement of a Strand in a Double Helix


Techniques to assemble static nanostructures from double helical DNA units have significantly
progressed (see section 17). One advancement is a technique that solely relies on DNA to achieve
Chapter 4 Basic Textbook Knowledge 149

a system capable of dynamic changes [1] (see section 23); this technique is based on the DNA
strand displacement reaction (DSD) shown in Fig. 1. It is also called a “toehold-mediated strand
displacement”.
In the initial state, the green and red DNA strands form a double helix. When the blue single
strand DNA is inputted, it forms a double helix with the red DNA, releasing the green DNA as a
single strand. The number of hydrogen bonds between complementary bases is greater after
the reaction, indicating that the post-reaction state is more favorable in terms of free energy
(see section 38).

Fig. 1. Strand displacement reaction.

Reaction Proceeds through Branch Migration


This reaction proceeds from the left to the right through more than one intermediate state.
Some intermediate states are shown in Fig. 2. At first, the blue DNA hybridizes to the single-
stranded, leftmost region in the red DNA. Then, the blue and green DNA strands, which are
partially identical to each other, compete for hydrogen bonding to the red DNA strand.
The site of the competition is called a “branching point”, which wanders to the left and the right.
This reaction is called “branch migration”. In the end, as shown in the right in Fig. 1, the blue DNA
completely replaces the green DNA to form a stable double helix with the red DNA, releasing the
green DNA as a single strand.
Chapter 4 Basic Textbook Knowledge 150

Fig. 2. Intermediate states in the strand displacement reaction with the branching point wan-
dering to the left and the right, which is called branch migration.

Fuel Drives Reaction


Usually, the reaction is schematically depicted by omitting the double helix structure, and
representing each DNA by an arrow from the 5’ end to the 3’ end (see Fig. 3). The sequence is
divided into two or more reaction units, each labeled with a letter. In this figure, each uppercase
letter and the corresponding lower case letter represent mutually complementary sequences.
When the green single strand DNA is regarded as an output, the blue DNA is a “fuel” to drive
the reaction, and the blue/red complete double helix after the reaction is a waste product. The
region a in the red DNA, which is initially single-stranded, is called a “toehold” to initiate the
reaction.

Fig. 3.** Schematically depicted strand displacement reaction


and components.

Toehold Sequence Determines Reaction Rate


The reaction rate depends on the length of the toehold sequence and the DNA concentration.
To understand the reaction rate, we can consider Fig. 3 to be a two-molecule reaction that can
Chapter 4 Basic Textbook Knowledge 151

be expressed by a chemical reaction equation. The reaction rate constant (k) can be changed by
manipulating the length (n) of the toehold sequence a. According to [2], when the sequence b
to be displaced is 20 base long in 11.5 mM MgCl₂ at 25˚C, the following equation can estimate
an approximate value of k.

As understood from the equation, when the toehold sequence is lengthened by 1, 2, …, 6 bases,
the value of the rate constant increases exponentially, by 10, 100, …, 1000000-fold. Above 6
bases, it reaches a plateau. For example, when the initial concentration of two molecules in
the left hand side is 10 nM, the time necessary to consume 50% of the starting material can be
adjusted by orders of magnitude, eg. 3 minutes for 6 bases, 30 minutes for 5 bases, 5 hours for
4 bases…. This property is important when applying the strand displacement reaction to more
complex systems (see section 28).
For more detailed analysis, the reaction can be divided into three steps: hybridization (two-
molecule reaction), denaturation (single-molecule reaction), and branch migration [2]. For
computer-aided analysis and design of the reaction, an online simulator is available (see section
67). To experimentally determine the exact rate constant, the reaction can be subjected to
fluorometric time-lapse observation in a test tube (see sections 51 and 81).

Variations of Strand Displacement Reaction


The above is a basic strand displacement reaction. Many variations have been reported that in-
corporate ingenious sequence combinations. Fig. 4 schematically shows a “toehold exchange”[2],
which employs two toehold sequences to allow both forward and reverse reactions, a “remote
toehold”[3], which controls the reaction rate by placing a linker between the toehold sequence
and the sequence to be displaced, and an “associative toehold activation” [4], which is based
on a fuel composed of multiple molecules.
Chapter 4 Basic Textbook Knowledge 152

Fig. 4. Variations of strand displacement.


—Ibuki Kawamata (Tohoku University)
Chapter 4 Basic Textbook Knowledge 153

43. Secondary Structures of DNA


The secondary structure of a DNA molecule is defined by the set of base pairs formed in the DNA
molecule. In order to construct a DNA-based molecular robot, you have to predict the behavior
of a given DNA molecule, and design a base sequence that will work exactly as expected. To
prepare for that, this section discusses secondary DNA structures.

Secondary Structure of DNA


In solution, DNA can form a variety of structures. This is because it contains a multiple base
pairs made through hydrogen bonding between mutually complementary bases (see section
41). Here, the structure of a DNA described by the identity of base pairs along the base sequence
is called the secondary structure of DNA (see below).
Chapter 4 Basic Textbook Knowledge 154

A secondary structure may be composed of a single DNA molecule or a plurality of DNA


molecules, but with the proviso that every base pair is only formed between two bases. Actually,
three bases can form a triple-stranded structure through non-canonical hydrogen bonding,
or two or more base pairs can interact with each other, resulting in higher DNA structures.
However, in most practical cases, you will just have to consider the secondary structure.
Consideration of any higher structures comes later (see Column: Protein Structures).

Different Secondary Structures


There are various secondary structures, which are named after their shapes (see below).
Chapter 4 Basic Textbook Knowledge 155

-Dangling end: a single-stranded region at the end of a DNA strand, with no hydrogen bond.
-Hairpin loop: a hairpin-like loop structure. Includes only one base pair.
-Stacking pair: consecutive base pairs.
-Bulge loop: a structure that contains a loop in only one of the two DNA strands. Includes two
base pairs.
-Internal loop: a structure that contains a loop in each of the two DNA strands. Includes two
base pairs.
-Multibranch loop: a loop from which three or more helices exit. Includes three or more base
pairs.
Although not depicted in the figure, DNA can also form a “pseudoknot” structure. For details of
these structures, see [1] and other texts.

Secondary Structure Prediction and Design


In order to manufacture parts of a molecular robot from DNA and control them, you need two
techniques (see below).
Chapter 4 Basic Textbook Knowledge 156

One is to predict the structure of a given base sequence in a solution and its stability, which
is called “secondary structure prediction” of DNA (see section 63). Another is to find a DNA
sequence that will form a desired structure, which is called “sequence design” of DNA (see
section 64). Both techniques are basic to and essential for controlling chemical reactions of DNA.

Column: Protein Structures


Like DNA, proteins make various structures in vivo, but in a far more complex manner. Pro-
teins have tertiary and quaternary structures that result from complex combinations of ionic
bonding, hydrophobic interactions, and the like as well as hydrogen bonding. For a typical DNA
Chapter 4 Basic Textbook Knowledge 157

assembly, you will only have to consider its secondary structure. For most proteins, however,
you will have to take their tertiary and/or higher structures into account.
By analogy with parts of a robot, you can understand why most people use DNA to make
programmable biological parts, instead of proteins.
Parts for a robot or a similar system must be designed to work exactly as expected, but the
structure of a protein is very difficult to predict, seriously hindering designing one.
In the future, however, a day might come when we can overcome this technological limitation
and design any desired protein and accurately predict and control its behaviors. If so, it will
lead to extremely useful biological robots, because proteins have diverse functions that are
not possible with DNA.

—Fumiaki Tanaka (AIST)

44. DNA Synthesis


In the field of life sciences, chemically synthesized DNA plays an essential role in genome
sequencing, DNA profiling in criminal investigations, viral genotyping in the medical field, and
almost all processes in biotechnology that have something to do with genes.

Phosphoramidite Method
The most common method to chemically synthesize DNA is the solid-phase phosphoramidite
method. In this method, the starting material is the first base of a desired DNA that is linked to
a solid phase carrier (controlled pore glass column) through an ester bond. To its 5’ end, then,
nucleic acid monomers are coupled one base at a time. These “phosphoramidite” monomers,
named after a phosphate diester precursor moiety contained in their chemical structure, have
protecting groups introduced on the base moiety, the 5’ hydroxyl group of the deoxyribose, and
the phosphoramidite moiety.
The phosphoramidite method is composed of a total of six reaction steps.
Chapter 4 Basic Textbook Knowledge 158

Step (1): detritylation


Both the first base on the CPG column and the phosphoramidite monomer are protected with
a dimethoxytrityl (DMTr) group at the 5’ hydroxyl group. In the reaction scheme, the 5’ end of
the DNA molecule being synthesized is always protected by the DMTr group, except during the
coupling. This step removes the DMTr group using acid (3% TCA/dichloromethane) to expose
the 5’-hydroxyl group, which subsequently serves as a reaction site.

Step (2): activation


The phosphoramidite moiety is activated to allow its coupling to the 5’-hydroxyl group produced
in the detritylation step. Specifically, an activating agent such as 1-tetrazole is mixed with the
phosphoramidite monomer in a solution.
Chapter 4 Basic Textbook Knowledge 159

Step (3): coupling


The activated phosphoramidite monomer is then introduced into the CPG column and coupled
to the 5’-hydroxyl group.

Step (4): capping


When the reaction is successful, about 99% of the starting material is now one base longer
(i.e., its 5’ end has the DMTr group derived from the phosphoramidite monomer). However,
the remaining 1% still has the hydroxyl group at the 5’ end unreacted, which will react with
a phosphoramidite monomer in the subsequent cycle, resulting in an unwanted side-product
that is one base shorter. To avoid this, the unreacted 5’-hydroxyl group is destroyed using acetic
anhydride.

Step (5): oxidation


The phosphorus from the phosphoramidite monomer is in a trivalent form. It is oxidized into a
pentavalent phosphate diester.
Steps (1) to (5) are repeated the number of times that is equal to the number of bases required,
before step (6).

Step (6): detachment/deprotection


The CPG column is then charged with a concentrated aqueous ammonia solution and left at
room temperature for 1 hour. This cleaves the ester bond between the solid phase carrier and
the first base, releasing the DNA product into the concentrated aqueous ammonia solution.
Because all the protecting groups present on the phosphoramidite monomer, except DMTr, are
designed to be removable under basic conditions, you can remove them by directly heating this
concentrated aqueous ammonia solution at 55 ˚C for 16 hours.
After that, the sample is subjected to DMTr removal and desalting through a DNA purification
column (see section 78).

Column: Avoid Moisture


In the phosphoramidite method, you must note that the steps (2) and (3) are extremely
sensitive to moisture, because water competitively inhibits the coupling of the 5’ hydroxyl
group to the phosphoramidite moiety. Because reagents for automated synthesizer easily
absorb moisture, you must not let them contact air for too long. Some laboratories have a
saying a rainy day is not for DNA synthesis. Many providers of custom DNA synthesis services
are located in Hokkaido, where there is no east Asian rainy season.
Chapter 4 Basic Textbook Knowledge 160

Automated DNA Synthesizer


Conveniently, this whole process can be done automatically by an automated DNA synthesizer.
Automated DNA synthesizers from ABI, which were once employed in many laboratories, are
no longer produced. Repair parts are not readily available either. If you have a chance to use
one, take good care of it. At present, equivalents are available from Nihon Techno Service
(http://www.ntsbio.com). There is also a foreign supplier of instruments that can synthesize
up to 192 different DNA samples simultaneously (http://bioautomation.com), without which
the DNA origami technique based on multiple staple structures would never have come into
existence.
—Akinori Kuzuya (Kansai University)
Chapter 5 Techniques/Materials to
Boost DNA
45. Enzymes
Most enzymes are proteins and can catalyze a variety of chemical reactions with remarkable
efficiency. Some of them are driven by a nucleotide energy source such as ATP or GTP, while
others are nucleotide-independent. This section summarizes the most basic enzymes that can
manipulate DNA.

How Enzymes Behave


Let’s discuss a reaction between reactants A and B to form a product C that proceeds through
a transition state AB‡. An enzyme can promote the reaction by stabilizing (lowering the energy
required for) the transition state.
Chapter 5 Techniques/Materials to Boost DNA 162

Michaelis-Menten Equation

The reactants A and B are called substrates (S) of the enzyme (E). With the product C being P, the
rate equation of the reaction can be as follows.

equation 2

If the first step (E + S <-> ES) in the reaction readily reaches chemical equilibrium, the second
step (ES -> E + P) is the rate-limiting step in the reaction. When the dissociation constant Ks =
[E][S]/[ES] (equation 1) and the total enzyme concentration [E]₀ = [E]+[ES] (equation 2), equation
2 can be substituted into equation 1 to provide the following equation.
Chapter 5 Techniques/Materials to Boost DNA 163

equation 3

The rate v to yield the product can be written as follows.

equation 4

Equation 3 can be substituted into equation 4, with Vmax = k+₂[E]₀, to provide the well-known
Michaelis-Menten equation, as follows.
Chapter 5 Techniques/Materials to Boost DNA 164

equation 5

Equation 5 holds true when the second step (ES -> E+P) is the rate-limiting step (i.e., k-₁ >> k+₂).
You can derive a similar equation assuming a steady state.. In that case, Ks in the equation 5 is
substituted by Km as follows.

equation 6

If k-₁ >> k+₂ in the equation 6, then k+₂ can be ignored so that Km\= Ks.

Kinetic Parameters and Thermodynamics


The corresponding changes in the energy can be schematized as follows based on E, S, and P.
Chapter 5 Techniques/Materials to Boost DNA 165

The energy change dG⁰ in the first step (E+S <-> ES) takes a negative value, and is related to
Km (see above). After that, the energy change dG‡ in ES -> ES‡ takes a positive value, and is
related to k+₂ (see above). An enzyme can stabilize the transition state ES‡ (i.e., lower the energy)
to promote the reaction. As a result, the sum of the two energy changes (dGT‡ = dG⁰ + dG‡) is
important for the overall reaction rate, and its value can vary greatly among different enzymatic
reactions.

Enzymes Relevant to BIOMOD


In BIOMOD, you probably will use nucleic acids as building blocks, and thus will need enzymes
that act on nucleic acids. Below is a brief summary of enzymes that can amplify nucleic acid
production (polymerases), cleavage (restriction enzyme/nucleases), and connection (ligases).
See the following sections for details.

Polymerases

This family of enzymes synthesizes a nucleic acid macromolecule, such as DNA or RNA, using
DNA or RNA as a template.
DNA polymerase, which synthesizes DNA, can add a nucleotide to the OH group (hydroxyl group)
at the 3’ end of a DNA strand (i.e., the nascent chain extends in the 5’ to 3’ direction). It requires
a “primer” sequence to initiate synthesis.
RNA polymerases, which synthesize RNA, include those which use DNA as a template, those
which use RNA as a template, and those which do not require any template (e.g., poly(A)
polymerase, which add a polyA sequence to the 3’ end of mRNA). Some RNA polymerases can
synthesize an artificial nucleic acid, and some can even synthesize a natural nucleic acid using
an artificial nucleic acid as template.
Chapter 5 Techniques/Materials to Boost DNA 166

Restriction Enzymes

A restriction enzyme can recognize a specific sequence to cleave it. Each cleaved fragment ends
up with either a blunt end, in which the ends of both strands are cut evenly, or a sticky end, in
which one strand overhangs. The most popular ones are so-called “type II” restriction enzymes,
each of which recognizes a palindrome sequence (i.e., a sequence that reads the same on one
strand in the 5’-3’ direction and on its complementary sequence in the 5’-3’ direction).

Nucleases

This family of enzymes hydrolyzes the phosphodiester bond between the sugar and the phos-
phate. An endonuclease cleaves a nucleic acid sequence in the middle (endo) of it, whereas an
exonuclease cleaves it from its end (exo).

Ligases

A ligase connects the 5’ end of a nucleic acid molecule to the 3’ end of the same or another
molecule through a phosphate diester bond. These enzymes can cut and sew polynucleic acids.
Hisashi Tadakuma (Kyoto University)

46. Polymerase Reactions


Polymerases are among the most commonly used enzymes in DNA computing and assembly.
This section outlines polymerase as a replication tool, and PCR as an example of replication
using a polymerase.

DNA Polymerase
Enzymatic reactions are important to build for nucleic acid building blocks because they can di-
rectly manipulate/process DNA/RNA strands of interest. There are many examples of enzymatic
reactions producing a desired design, e.g. in the implementation of a DNA computing reaction
or the preparation of a DNA hydrogel. Polymerase enzymes are in charge of “copying”, which is
particularly important for bioengineering. The structures and mechanisms of polymerases have
been extensively studied from a purely scientific viewpoint, but are not discussed in this section.
By definition, DNA polymerase is an enzyme that extends (polymerizes) DNA. Along a template
DNA, it synthesizes the complementary DNA sequence in the 5’-3’ direction, which starts from
a primer DNA that has been hybridized to part of the template. During this process, one of
the dNTPs (deoxynucleoside triphosphates) in the solution, i.e. dATP, dTTP, dCTP, or dGTP, is
consumed as a substrate per corresponding base.
Chapter 5 Techniques/Materials to Boost DNA 167

Figure based on [2].


There are many types of DNA polymerase. For example, a major manufacturer, NEB (New
England Biolabs), now lists as many as 25 different DNA polymerases in its catalogue [1], all of
which have individual characteristics. Several parameters useful to compare them are described
below.
3’->5’ Exonuclease activity: An exonuclease activity is the ability to degrade DNA from its end,
base by base. In this case, the degradation occurs in the 3’ to 5’ direction. It may seem somewhat
contradictory to have both extension and degradation functions, but both are required for
replication error correction. If an incorrect base is inserted, this function can remove the base
and repeat synthesis at that position.
5’->3’ Exonuclease activity: Likewise, this can degrade DNA in the 5’-3’ direction.
Strand displacement activity: When the extension proceeds in the 5’-3’ direction and finds any
DNA already hybridized to the template, this function removes it to continue the synthesis.
Thermostability: At high temperature, most proteins are denatured and lose their function.
Thermostable DNA polymerases that remain active even at high temperatures exist and are very
useful for facilitating the PCR technique as described below. For example, DNA polymerases that
appear in this book have the following characteristics according to NEB [1].
Bst DNA polymerase, large fragment (used for DNA Toolbox)
3’->5’ Exonuclease: -
5’->3’ Exonuclease: -
Chapter 5 Techniques/Materials to Boost DNA 168

Strand displacement: ++++


Thermal stability: +
Phi29 DNA polymerase (used to form DNA physical hydrogel)
3’->5’ Exonuclease: ++++
5’->3’ Exonuclease: -
Strand displacement: +++++
Thermal stability: -
Taq DNA polymerase with standard Taq buffer (used for PCR)
3’->5’ Exonuclease: -
5’->3’ Exonuclease: +
Strand displacement: +*
Thermal stability: ++
Note: - indicates the absence of the corresponding activity, and the number of + indicates the
activity level. * The displaced DNA strand is degraded.

Column: PCR
The below PCR technique is among the most popular DNA polymerase-based reactions, (see
the figure below); it can amplify any desired portion of a double strand DNA. First, the double
strand DNA is heated in an aqueous solution into single strand DNAs. The solution includes
a large amount of primers that specify the portion to be amplified (green and red arrows in
the figure) in advance, which undergo hybridization as a result of annealing. In this state, DNA
polymerase starts extension to synthesize mutually complementary DNA strands (light green
and light red). After that, it is heated back into the single strand DNA state and subjected to
annealing again, which allows the primers to hybridize to the synthesized DNA, to initiate
another extension reaction. By repeating this cycle, a double strand DNA solely composed of
the desired portion is replicated exponentially, as indicated by yellow in the figure. Thanks to
thermostable DNA polymerase, you no longer have to add enzyme after each cycle.
Chapter 5 Techniques/Materials to Boost DNA 169

Figure based on [2].

RNA Polymerase
RNA polymerase is an important enzyme that serves as a central component of transcription
machinery in vivo. In particular, DNA-dependent RNA polymerase uses a DNA template to
synthesize its complementary RNA strand.
One representative RNA polymerase is so-called T7 RNA polymerase. As shown in the figure
below, T7 RNA polymerase uses NTPs (nucleoside triphosphates) as substrate to synthesize RNA
from a double stranded DNA template, proceeding from a specific “initiation sequence” called
T7 promoter sequence toward the 3’ direction. This enzyme is commonly employed in the field
of DNA/RNA nanotechnology and DNA computing, e.g. in reaction systems such as RNA tiles [3,
4] and RTRACS [4].

—Shogo Hamada (Cornell University)

47. Restriction Enzymes


A restriction enzyme can recognize a specific sequence to cleave it or a nearby sequence. In
BIOMOD, this kind of enzyme is employed not only as a processor of DNA materials, but also as
a system component for signal generation etc.
Chapter 5 Techniques/Materials to Boost DNA 170

History of Restriction Enzymes


Restriction enzyme research started after WW II, and great progress was made in the 1970s
(Smith, Arber, and Nathans won the Nobel Prize in Physiology or Medicine 1978).
In 1952, Luria et al. (University of Illinois; developed LB medium for E. coli) found that a
phage grown in a bacterial strain somehow became unable to infect another host strain. This
phenomenon of “host-controlled restriction” suggested that certain bacterial strains might have
a defense mechanism that could recognize and degrade a foreign DNA such as that of phage (but
not their own DNA).
In 1968, Yuan et al. (University of Cambridge) hypothesized that E. coli K12 should have a
“restriction” enzyme for specific DNA degradation; they tried to purify it, and discovered type
I restriction enzymes. However, type I restriction enzymes are no longer used as a genetic
engineering tool because they can only randomly cleave at sites that are at least 1 kbp away
from a recognition site.
Type II restriction enzymes, now commonly used, were proposed and discovered by Werner
Arber (University of Geneva) and Hamilton Othanel Smith et al. (Johns Hopkins University) in
1968.

Restriction Enzymes for DNA Manipulation


Each cleaved fragment has either a blunt end, in which the ends of both strands are cut even,
or a sticky end, in which one strand overhangs. The most popular ones are so-called type II
restriction enzymes, each of which recognizes a palindrome sequence. In both of the examples
shown below, it reads the same on one strand in the 5’-3’ direction and on its complementary
sequence in the 5’-3’ direction. HindIII produces sticky ends (left), while EcoRV produces blunt
ends (right).

When you use a restriction enzyme, you must pay attention to the salt concentration. The
activity of a typical restriction enzyme is dependent on monovalent ion (K+, Na+, Cl-, OAc, etc.),
divalent ion (Mg²+ etc.), and/or pH. From optimized buffer solutions available from enzyme
manufacturers, you can choose one appropriate for the enzyme to be used. When you have to
use more than one enzyme in combination, you can refer to buffer compatibility charts and apps
Chapter 5 Techniques/Materials to Boost DNA 171

provided by the same manufacturers. Recombinantly improved enzymes that can function in a
single buffer solution have also been developed.
Even when following the manufacturers’ instructions, you need to monitor the enzymes care-
fully. Each enzyme is shipped in a 50% glycerol solution for storage purposes, and too much
glycerol impairs the recognition specificity of the enzyme, inducing off-target cleavage (“star
activity”). There are commercially available recombinant enzymes that have less star activity.

Restriction Enzyme as Signal Generator


For BIOMOD, you might want to construct a multi-component system. For this purpose, a
restriction enzyme that only cleaves a specific recognition sequence can serve as a useful tool in
information processing. For example, in a process that receives an external signal and generates
an output signal, the restriction enzyme can be kept in an inactive state by any means (e.g., a
steric effect such as caging) until the signal input, and upon the stimulus input can be turned into
an active state to cleave a DNA substrate, releasing an amplified secondary signal. Alternatively,
it can be used as an external trigger signal. For these purposes, you should make sure that the
enzyme maintains the recognition/cleavage activity in a buffer for a different reaction system
(e.g., a DNA computing circuit). Alternatively, you can choose a restriction enzyme that seems
compatible with that buffer.

Column: Gibson Assembly


This technique was developed by Daniel Gibson at the J. Craig Venter Institute. Advanta-
geously, compared to the classical procedure based on restriction enzymes and DNA ligase,
it can join multiple DNA fragments more efficiently, in a shorter time and with fewer steps, and
does not leave additional sequences derived from restriction sites (because such additional
sequences may affect the activity, their absence is particularly important in protein expres-
sion). Specifically, it involves three enzymes as listed below (also available as a kit from NEB:
E2611S or L). Although it employs the same basic strategy (i.e. annealing of complementary
sequences and filling of any inter-strand gap) as the procedure solely based on restriction
enzymes and DNA ligase, the combination of the three enzymes allows an efficient, one-pot
reaction. Because it also allows the use of synthetic DNA fragments as starting material, this
methods is getting more popular.
T5 exonuclease: Degrades the 5’ end of the starting material (double strand DNA) to expose
shared single-stranded sequences for annealing.
Thermostable DNA polymerase (Phusion polymerase etc.): Fills a gap between annealed DNA
strands by synthesizing a complementary strand along the single-stranded region.
Taq ligase: Sews up the nick that remains after the polymerase has filled the gap between
annealed DNA strands.
Chapter 5 Techniques/Materials to Boost DNA 172

Column: Total Gene Synthesis


Thanks to recent advances in DNA synthesis techniques, high-fidelity DNA synthesis has be-
come much less expensive. Now it is practical to synthesize previously impossible long strands.
Specifically, organisms can synthesize DNA at a fidelity of 10-⁶ to 10-⁷, but chemical synthesis
also has reached a fidelity of 10-² to 10-³. For example, commercial DNA synthesis services for
up to about 200 bp are available (IDT etc.). In addition, because these chemically synthesized
DNA strands can be joined together or combined with natural enzymatic reactions, gene
synthesis services for more than 10 kbp have become popular. For these services, you send
sequence information through a website form or e-mail and get DNA of any desired sequence in
several days to weeks. As a result, instead of gene manipulation based on restriction enzymes
and ligase, you can obtain any desired DNA by just placing an order. This may be good news
for non-biologist BIOMODers.

—Hisashi Tadakuma (Kyoto University)

48. Artificial Bases


Researchers conducting chemical synthesis of DNA (see section 44) made remarkable progress
in 1980s. Researchers all over the world have been competing to develop artificial nucleic acid
bases with useful functions that are never found in nature. Japanese researchers also have
contributed much to this field. This section introduces commercially available artificial bases
with unique functions that might be applicable to your BIOMOD project.

]
Azobenzene Photoswitch [1]
This artificial base, developed by Hiroyuki Asanuma (Nagoya University) and Xingguo Liang
(Ocean University of China) et al., can photo-switch DNA double helix formation/dissociation.
Chapter 5 Techniques/Materials to Boost DNA 173

When this azobenzene residue absorbs UV light with a long wavelength (300 to 400 nm), it can
photo-isomerize from the trans form (top) to the cis form (bottom). Upon absorption of visible
light of >400 nm, it can photo-isomerize from the cis form to the trans form. The standard trans
form has a planar structure, and thus stabilizes the DNA double helix when inserted between
base pairs in DNA. The cis form has a slightly twisted, bulky structure due to steric repulsion
between the benzene rings, and thus greatly destabilizes the DNA double helix. This difference
allows the selective dissociation of a DNA double helix controlled by UV irradiation and its
Chapter 5 Techniques/Materials to Boost DNA 174

selective formation by visible light irradiation. Because the isomerization efficiency is below
100%, it may not be effective to insert only one moiety. Amazingly, multiple moieties, with one
inserted after every two bases, greatly stabilize the double helix in the trans form, and greatly
destabilize it when isomerized to the cis form. If financially feasible, you should include multiple
moieties.

Photo-Crosslinkable Artificial Base CNVK [2]


When exposed to light, this artificial base, developed by Kenzo Fujimoto et al. (JAIST), can fix a
DNA double helix through covalent bonding.

When it is incorporated into one of the strands, it can be covalently bonded to T in the comple-
mentary strand upon one second of 366 nm UV irradiation. In order to separate them, you only
Chapter 5 Techniques/Materials to Boost DNA 175

have to expose them to 312 nm light. To make the connection more efficient, the base T in the
complementary strand should be in the position facing CNVK or in a position next to that.

]
Photocleavable PC Linker [3]
Another photo-controllable artificial base is the photocleavable (PC) linker, which allows you to
cleave DNA by means of light.

p117-3.gif

When this linker is inserted in the backbone of DNA, you can readily cleave the backbone by
just exposing it to long wavelength UV (> 300 nm) from a transilluminator or portable UV lamp.
After the reaction, the ends of both fragments, i.e. the fragments 3’ and 5’ to the PC linker, have
a phosphate monoester. Accordingly, you cannot ligate the 5’ fragment to any other fragment
unless you dephosphorylate it.

]
Universal Bases dK and dP [4]
Chapter 5 Techniques/Materials to Boost DNA 176

These artificial bases have been developed to take advantage of the tautomerism of nucleic acid
bases to allow base pairing with more than one type of base.

p118-1.gif

dK is an analog of A and G, and can form a stable base pair with either of T and C. Likewise, dP
is capable of base pairing with either of A and G.

]
Artificial Base Pair isoG-isoC [5]
This world’s first artificial base pair was developed by Benner et al. (formerly of Florida Univer-
sity) as a third base pair to add to A-T and G-C pairs.
Chapter 5 Techniques/Materials to Boost DNA 177

p118-2.gif

Compared to naturally occurring G and C, the positions of the amino and carbonyl substituents
are all inverted. They are not recognized well by enzymes such as polymerases, but will suffice
to serve as a third base pair in ordinary DNA computing systems.

]
Artificial Base Pair Ds-Pa [5]
Below is another third type of base pair developed by Ichiro Hirao et al. (RIKEN).
Chapter 5 Techniques/Materials to Boost DNA 178

p118-3.gif

Surprisingly, this only relies on shape complementarity and not on hydrogen bond in base
pairing, and can still be recognized by enzymes and correctly replicated.

Column: Biotin-Streptavidin Interaction


Streptavidin is one of the proteins most commonly used in the field of biochemistry. This
protein, made of four identical subunits associated with each other, can bind to a com-
pound called biotin (vitamin B7) with extreme strength (highest in biological systems), and
is employed in many applications, e.g. in the immobilization of an enzyme. Biotin amidite,
applicable to DNA synthesis, is also commercially available and can be introduced anywhere
in a DNA strand, i.e. in the 5’ end, the 3’ end, or even the middle of it.
Chapter 5 Techniques/Materials to Boost DNA 179

{width=”1.3452088801399824in” height=”0.7768755468066492in”}
Chemical structure of biotin. It can be conjugated to a different molecule through the carboxy-
late side chain.
Biotin has many merits in DNA nanotechnology. For example, it is free of nonspecific adsorp-
tion onto mica, and can be observed as a bright 5 nm dot, which is taller than DNA. Our
laboratory is so fond of this protein that there is a proverb: “Call streptavidin if you need help”.
Chapter 5 Techniques/Materials to Boost DNA 180

{width=”0.9441666666666667in” height=”0.8616666666666667in”}
Streptavidin (bright dot) caught by closed DNA pliers.

—Akinori Kuzuya (Kansai University)

49. Non-Watson-Crick Structure


A-T and G-C pairs, which are the source of DNA complementary, are called Watson-Crick base
pairs after their discoverers. Most of naturally occurring DNA molecules exist as a double helix
formed through these Watson-Crick base pairs, but some have been reported to form a triplex
or even a quadruplex.
In Watson-Crick base pairs, the imino proton (the 1 position) of a purine base participates in base
Chapter 5 Techniques/Materials to Boost DNA 181

pairing. In “Hoogsteen” base pairs, the 2 and 7 positions participate in base pairing, wherein T
binds to an A-T pair and protonated C+ binds to a G-C pair.
Chapter 5 Techniques/Materials to Boost DNA 182

DNA Triplex
Chapter 5 Techniques/Materials to Boost DNA 183

As seen from the figure above, only pyrimidines are capable of Hoogsteen bonding with normal
Watson-Crick base (purine-pyrimidine) pairs. In line with this, naturally occurring DNA triplexes
have been discovered that are each formed by Hoogsteen bonding between a homopurine-
homopyrimidine region in a double helix (a sequence of consecutive purines or pyrimidines
in one strand, such as AAAAAAAA/TTTTTTTT) and a third homopyrimidine strand (TTTTTTTT),
which have been implicated in the regulation of gene expression.

Of the two kinds of triplets (triplet = a combination of three bases), the T-A-T triplet can form
under any conditions. The C+-G-C triplet can only form under acidic conditions, because it
requires protonated C+.
In general, the binding of the third strand is less stable than the formation of the duplex. In the
UV melting curve of such a triplex (see section 82), two-step melting is observed.
Because a DNA triplex is not very stable, you may not be willing to use it in your BIOMOD project.
Anyway, a sequence that includes consecutive purines or pyrimidines is not recommended in
DNA sequence design.

G quadruplex
Naturally occurring DNA quadruplexes [2], composed of even more strands, have also been
discovered. The most famous one is the guanine quadruplex.
Chapter 5 Techniques/Materials to Boost DNA 184

This is derived from a G quartet structure that is formed by Hoogsteen bonding among four
guanine bases surrounding a coordinated metal ion. The metal ion trapped in the center is
usually Na+ or K+ (K+ provides a stronger bond).
In general, a sequence with three or more consecutive G bases tends to make this structure.
In vivo, especially the telomere region at each end of a chromosome is enriched in G-repeat
sequences (in the case of human genome, TTAGGG is repeated over and over). It has been
proposed that possible quadruplex structure formation in this terminal single-stranded region
may contribute to the stability of the chromosome itself.
In principle, a typical guanine quadruplex is more stable than a DNA double helix. In sequence
Chapter 5 Techniques/Materials to Boost DNA 185

design, you should avoid three or more consecutive G bases. If a single strand has too many
G-repeat sequences, it might be folded in a complex manner into a stable intramolecular
quadruplex. For example, a strand with four repeats of a sequence, (TTAGGG)₄, is well known
to be folded in diverse fashions to form an intramolecular quadruplex.
Conversely, you can make use of a guanine quadruplex to make a DNA-based molecular ma-
chine that can be triggered by Na+ and/or K+. For example, it’s possible to make a single
molecule metal ion sensor, made of DNA pliers that detect Na+ and/or K+, close by introducing
a plurality of (TTAGGG)₂ into the levers in the pliers (see section 26).

i-Motif
The so-called i-motif[3] structure is another DNA quadruplex.. This structure does not involve
Hoogsteen bonding, but is derived from mismatched base pairing between C and protonated
C+.
Chapter 5 Techniques/Materials to Boost DNA 186
Chapter 5 Techniques/Materials to Boost DNA 187

One C-C+ base pair only results in a double strand. The i-motif is formed as a very unique
structure; i.e., two double strands each formed by the C-C+ base-pairing align with each other
in a head-to-tail orientation, through mutual intercalation of the hydrogen bonds in the two
double strands.
Chapter 5 Techniques/Materials to Boost DNA 188
Chapter 5 Techniques/Materials to Boost DNA 189

Like the C+-G-C triplet, this requires the protonation of C, which means that an i-motif can be
formed only under acidic conditions. Accordingly, several i-motif-based, pH-responsive DNA
molecular machines have been reported.
—Akinori Kuzuya (Kansai University)

50. Protein Motors


Protein motors are a class of proteins that can convert the chemical energy of the hydrolysis
reaction of adenosine triphosphate (ATP), which is an energy currency in vivo, to a kinetic
energy. These include myosin, which moves along actin and is involved in muscle contraction,
and kinesin, which moves along microtubules and is involved in the intracellular transport of
transport vesicles and organelles.

Myosin Structure and Function


The most important function of muscle is to generate a force to move the body. The principal
source of its contraction force is two different proteins, myosin and actin.
Myosin has a molecular weight of 520,000, which contains a head for actin binding and ATP
hydrolysis and a tail for self-assembly into a filament (FIG. 1(A)). Actin filament is a filamentous
polymer of a right-handed double helix structure, with about 13 monomers of globular (G) actin
per turn, which has a molecular weight of about 42,000 (FIG. 1(B)). G-actin is composed of four
subdomains that are different in size, conferring a polarity to actin filament. Myosin moves actin
toward the minus end to cause muscle contraction.

Actin (A) and myosin (B) filaments


Chapter 5 Techniques/Materials to Boost DNA 190

Molecular Mechanism of Muscle Contraction


The swinging crossbridge hypothesis has been proposed to explain contraction by myosin and
actin; in this hypothesis, the myosin head binds to actin and changes its angle to generate
a force. This hypothesis has been examined using X-ray crystallography, optical tweezers,
etc. After measurement with optical tweezers, the step length of myosin proved to be 9 to
13 nm. A glass needle measuring the displacement of myosin revealed that myosin moves
through successive sub-steps of about 5.3 nm, which is comparable to the diameter of the actin
monomer. Moreover, a single-molecule observation of myosin under a varying external load has
demonstrated that the step length of myosin decreases in response to an increase in the load;
i.e. it is load-dependent.

(A) The use of a biotin-streptavidin interaction to observe the displacement of a myosin head
Chapter 5 Techniques/Materials to Boost DNA 191

moving along an actin filament. (B) Step length observed by the needle. An average sub-step of
5.3 nm was observed. Reprinted with permission from Nature, 1999 [2].

Kinesin Structure and Function


Because a kinesin protein, which is necessary for in vitro motility assays, can be recombinantly
expressed in E. coli and purified, we can prepare either its pure form or a recombinant mutant.
Another characteristic of kinesin is the capability to move continuously. Once kinesin interacts
with a microtubule, it continuously moves about 100 steps. This is suitable for single-molecule
observation.
The structure of kinesin is a tetramer made of two heavy chains and two light chains. The
heavy chain has a motor domain (head) composed of an ATP-hydrolyzing subdomain and a
microtubule-binding subdomain, a stalk domain composed of coiled-coil regions, and a tail to
bind to a cargo. The rail for kinesin, i.e. microtubule, is a hollow tube made by the polymerization
of tubulin dimers, which is composed of α-tubulin and β-tubulin. A microtubule has polarity, i.e.
a plus end and a minus end defined by their polymerization rate.
Kinesin can bind a cargo to its tail, and repeat ATP hydrolysis and the association to and
dissociation from a microtubule through its motor domain to move toward the plus end,
resulting in the transport of transport vesicles and organelles.

Mechanism of Kinesin Motion


Characteristically, kinesin is capable of continuous motion, which has been demonstrated in in
vitro motility assays. Principally, there are two types of such assays. One works by immobilizing
kinesin on a glass surface and observing a microtubule moving over it (FIG. 3(A)); the other works
by immobilizing a microtubule on a glass surface and observing the movement of a kinesin-
adsorbed bead or fluorescently labeled kinesin along it (FIG. 3(B)). The latter assay has also been
combined with optical tweezers to demonstrate that kinesin moves along a microtubule with a
step length of 8 nm.
Chapter 5 Techniques/Materials to Boost DNA 192

In vitro motility assays

The “hand-over-hand” model has been proposed to explain the mechanism of kinesin moving
along a microtubule. In this model, kinesin alternately moves two heads to “walk” like a human
(FIG. 4). In the solution, each kinesin head has ADP bound to it but, upon microtubule binding,
one of the heads dissociates from ADP and binds to ATP. Then, ADP dissociation from another
head is promoted. In this manner, the two heads alternately repeat ADP dissociation, allowing
the stepwise movement of kinesin along a microtubule.
By attaching a fluorescent dye to one motor domain of kinesin and subjecting it to a single-
molecule measurement, researchers have demonstrated that the motor domain achieves an
average step length of about 17 nm and that kinesin moves hand-over-hand.
Chapter 5 Techniques/Materials to Boost DNA 193

Hand-over-hand model

—Ren Sasaki and Akira Kakugo (Hokkaido University)


Chapter 5 Techniques/Materials to Boost DNA 194

51. FRET and Molecular Beacons


Light can be useful in detecting an interaction, because it can pass through a wide range of
materials and detect the interaction state in a non-destructive manner. Fluorescence is a light
emitted by fluorescent dye, which only requires excitation light. You can choose an appropriate
one for your purpose from a variety of fluorescent molecules in the visible spectrum. A reaction
that emits light upon the interaction is useful for detecting an interaction. “FRET” and “molec-
ular beacon” techniques are often used in BIOMOD.

FRET and Molecular Beacons


FRET is short for fluorescence resonance energy transfer or Förster resonance energy transfer.
Only when the distance between two dyes is very short (about 10 nm) can the fluorescence can
be detected. Therefore, FRET is used to detect whether two substances are in proximity and/or
interacting with each other.
In principle, a molecular beacon is similar to FRET, except that one of two dyes is a quencher,
which only absorbs and does not emit light. By attaching a fluorescent dye to one end of a
DNA or RNA strand and a quencher to another end of the same strand, with the sequence
designed to be capable of intra-strand hybridization, the fluorescence is usually not observed
because the fluorescent dye and the quencher are positioned very close to each other so that the
fluorescence is quenched. However, once the molecular beacon binds to its target, the distance
between the fluorescent dye and the quencher gets longer so that the light emitted by the
fluorescent dye can be observed.

Fluorescence
Fluorescence is emitted during the process of a molecule being excited from its ground state
(S0) to an excited state (S1) and then returning to the ground state, releasing energy. This
process can be illustrated by a Jablonski diagram. The time scale of the entire phenomenon
is in nanoseconds, wherein the transition to the excited state occurs on a scale of femto (10-
¹⁵) seconds, and the emission of fluorescence occurs on a scale of nano (10-⁹) seconds. Due
to thermal vibration etc., the energy of the fluorescence is lower than that absorbed by the
fluorescent dye (i.e. the wavelength shifts to a longer wavelength), which is called a Stokes
shift. Because the properties of the fluorescence (wavelength and half-life) can be affected
by the motion of the fluorescent molecule and its interaction with its surroundings, various
fluorescence spectroscopy techniques have been developed.
Chapter 5 Techniques/Materials to Boost DNA 195

Principles of FRET
A fluorescent molecule (donor) emits energy as fluorescence and returns to the ground state.
When another fluorescent molecule (acceptor) that can receive the energy is present in close
proximity, a dipole-dipole interaction occurs between the transition dipoles, and the energy is
transferred. One result is that, even when the donor is excited, the energy is transferred to the
acceptor, which (instead of the donor) then emits light. The probability of the energy transfer
depends on the relative orientation of the donor and the acceptor. The efficiency of the transfer
is inversely proportional to the distance between them⁶. In commonly used organic fluorescent
dyes and fluorescent proteins such as GFP, energy transfer occurs at a distance of about 10 nm.
The energy transfer efficiency EFRET, which is a measure of the energy transfer, can be ex-
pressed as
Chapter 5 Techniques/Materials to Boost DNA 196

equation 1

with R being the distance between the two fluorescent dyes, ID being the fluorescence intensity
of the donor, and IA being the fluorescence intensity of the acceptor. R₀ is the distance at which
EFRET is 50% (Förster distance), calculated as follows.

equation 2

equation 3

εA(λ) : molar extinction coefficient of the acceptor at wavelength λ


FD(λ) : fluorescence intensity of the donor at wavelength λ
λ: wavelength
Chapter 5 Techniques/Materials to Boost DNA 197

κ²: orientation factor


ΦD: quantum efficiency of donor
n : refractive index of solvent
J represents the spectral overlap between donor emission and acceptor absorption, and has a
unit of [M-¹ cm³].

Principles of Molecular Beacons


In FRET, an acceptor that receives energy and converts it into heat, instead of fluorescence,
is called a dark quencher (quencher). No fluorescence is detected when the donor and the
acceptor are in close proximity to each other, but the donor fluorescence becomes detectable
when they move apart.
Usually, they are attached to both ends of a single-stranded nucleic acid (artificial nucleic acid)
of a stem loop structure. In the absence of target binding, the donor and the quencher are in
close proximity to each other, so that the donor fluorescence is not emitted. When the molecular
beacon binds to its target through a probe sequence included in the beacon (a sequence
complementary to the target sequence), the distance between the donor and the quencher
becomes longer as the hybridization proceeds, allowing the donor fluorescence to be observed.
This method can be used to detect an information state by means of fluorescence.
—Hisashi Tadakuma (Kyoto University)

52. Artificial Cell and Liposome (and Some Related


Topics)
Lipid bilayer membranes hold promise for use in DNA nanotechnology. The membrane’s closed
vesicle (microcapsule) provides a supramolecular structure called a liposome or a shell for
artificial cell [1]. In the future, molecular designs will increase in scale from single molecule to
supramolecule and even supramolecular complexes. Lipid vesicles may serve as a platform for
design, so we will outline some principles in this section.

Introduction
Unlike a programmable artificial life model in a computer, artificial life in the real world is
difficult to realize. Practically, all you can expect is to reconstitute parts of the various functions
of a cell. A simple set of molecules can be introduced into a microcapsule as a model of a cell,
to produce a so-called artificial cell [1]. A lipid membrane vesicle, i.e. liposome, is often used as
a carrier.
Chapter 5 Techniques/Materials to Boost DNA 198

Lipid Membrane
Lipid molecules are amphiphilic; each contains both hydrophilic and hydrophobic functional
groups. When dispersed in water, they assemble into a structure that hides the hydrophobic
portions inside. Different shapes of lipid molecules can produce different structures, including
a sphere (micelle), rod (micelle), film (membrane), and plate (membrane). The field of research
studying these assemblies of molecules based on non-covalent weak interactions is called
supramolecular science (chemistry). In addition to macromolecular chemistry, you should learn
about supramolecular chemistry if you want to win BIOMOD [3]. Well-known examples of
artificial lipid membranes include black film, LB membrane, water-in-oil (w/o) emulsion, and
liposomes (FIG. 1).

Liposome
A liposome is a closed vesicle made of a lipid bilayer membrane, first reported in 1964 by
Bangham. The lipid membrane is an excellent hydrophobic barrier to enclose and separate the
inside microvolume from the outside. It has a lipid bilayer structure similar to a cell membrane,
and thus its application to the biological membrane model, cosmetics, DDS (drug delivery
system [4]), and artificial cells has been extensively studied. A liposome can be classified as
SUV, LUV, MLV, or GUV depending on its size and membrane structure. There is also growing
interest in polymersomes, which are made of amphiphilic macromolecules (polymers), because
of their robustness and ease of functionalization. Since Hotani et al. succeeded in the direct
observation of GUV by dark field microscopy in 1980s, high-resolution observation techniques
using fluorescently labeled molecules to look at cell-sized liposomes have also been developed
[1]. There have been a number of attempts to install a biochemical reaction system inside a
liposome, and monitor/assess it. Thanks to advances in microprocessing technology and other
techniques, it has become much easier to prepare cell-sized liposomes of a uniform size (see
section 89).
Chapter 5 Techniques/Materials to Boost DNA 199

Artificial Cell
Attempts to assemble an imitation of a cell or even a true cell from substances, together with
questions about the origin of life and xenobiology, have produced a wide range of models,
the most famous of which are those by Traube and Oparin. Artificial cells can be used to solve
the mystery of the asymmetry of a cell or how cells accomplish one-way traffic at certain
times. It is possible to operationally clarify how the substances and their systems behave by
observing an artificial, “cell-like” environment instead of a cell. For example, researchers have
already achieved a simplified protein synthesis system [5], mimicry of Darwinian evolution by
incorporating spontaneous mutation into RNA replication [6], synthetic lipid-based coupling
of membrane replication with DNA replication [7], a mycoplasma cell with a fully synthetic
genome (dubbed as Synthia) [8], and protein synthesis by a liposome that contains E. coli
extract, which aims at manufacturing a whole cell [9]. In Japan, the Japanese Society for Cell
Synthesis Research was launched in 2005 [10].

“Synthetic Biology” in Short


Synthetic biology is a novel discipline that seeks to modify and extend the molecular system
of a cell. Although it partially overlaps with artificial cell/molecular robotics, which is usually
Chapter 5 Techniques/Materials to Boost DNA 200

focused on scratch building, it is thought to be important primarily in terms of applied science.


Synthetic biology has a large advantage in that it can rely on self-replication mechanisms of an
organism. An international student competition of synthetic biology, iGEM, is an elder sibling of
BIOMOD.

Artificial Cell and DNA Nanotechnology


A typical lipid membrane is a loose supramolecular structure in which each lipid molecule
continuously undergoes translational diffusion. If a DNA origami structure is put on the mem-
brane, it will show a two-dimensional diffusion movement similar to the movement of a raft
on the sea. The Tohoku University team won BIOMOD 2012 with a project called “Cell Gate”,
which used DNA origami to build a transmembrane channel for DNA transport from scratch
and introduced it into the cell membrane. One week after this group returned from BIOMOD,
Langecker et al. reported an origami channel in Science [10] . In 2013, the Dresden team won 2nd
place with a polymersome loaded with a prototype Cell Gate. In 2013 and 2014, the University
of Tokyo Kashiwa team reported the construction of an artificial origami receptor and discussed
its application to artificial cells. Non-student researchers have also published a series of related
papers, including a report on the implementation of DNA computing in an artificial cell system.
Artificial membrane/artificial cell systems are highly compatible with DNA nanotechnology in
nature. The next challenge will be to fuse them with living cells. A report published by S. Douglas
et al. in the first year of BIOMOD [11] has set a high hurdle. Maybe your team will be able to clear
it.

Concluding Remarks
DNA, lipid membranes, liposomes, cells, and organisms are all tangible, i.e. engineerable en-
tities, but “life” is just a word and a concept. What kind of system is needed to produce an
artificial cell that surpasses life? This is an interesting question to think about. It might be that
the engineering technology of mankind leaves much to be desired as long as artifacts belong to
a subset of life. Feynman asked how to put a toothed gear meshing with another into words.
Let’s try to create something hard to express in words, taking advantage of this engineering
opportunity [12].
—Shinichiro M. Nomura (Tohoku University)
Chapter 6 Software Techniques
53. Basic Software Programs: Overview
Because “design” of a biomolecule, the subject matter of the BIOMOD competition, is invisible
to the naked eye, you must demonstrate the results of your project in a readily understandable
manner. Your scores, therefore, are based on your Wiki, YouTube, and presentation. Your docu-
ments, tables, graphs, and presentation slides, including animations and videos, should be high
quality and effective.

Office Suite
You can use Microsoft Office to prepare all your materials (see section 3). Microsoft Office is an
office suite (a collection of business software) provided by Microsoft; the regular suite contains
Word (document preparation), Excel (spreadsheet), and PowerPoint (presentation). Another
example of an office suite is iWork, provided by Apple Inc., which includes Pages, Numbers, and
Keynote.

Necessity of Different Software Applications


However, without exception, previous winners of BIOMOD Project Award Gold (see section 5)
employed not only an office suite like Microsoft Office but also other diverse applications to
prepare their Project Website, YouTube Video, and Presentations. This is because “intuitively
understandable”, “persuasive”, and “enjoyable” contents have to include images, videos, and
2D/3D animations, and dedicated software works better to produce these materials.
Chapter 6 Software Techniques 202

To Make Images Impressive

The graphic tools in PowerPoint allow you to not only draw figures but to adjust the brightness,
contrast, and tone of an image. However, using Illustrator (graphic design software) and Photo-
shop (photo editor), provided by Adobe, will improve the visual elements of your materials (the
Chapter 6 Software Techniques 203

illustration below was drawn on Photoshop). While these two programs are payware, there are
free applications that will also work, such as Inkscape and GIMP.

(See section 56)

To Make 3D Images Intuitively Understandable

3D images are often necessary to make your Wiki, YouTube, and presentations easier to under-
stand. There are a wide variety of 3D CGI applications, but all of them involve modeling, material,
animation, and lettering, which are described below.

To Make Images More Persuasive

Image-J is a popular processing/analysis application for creating image data from wet experi-
ments. In spite of being free and open-source, it performs well and has advanced functions that
can even satisfy professional researchers. It supports a variety of file formats ranging from TIFF,
PNG, JPEG, BMP, AVI, and other basic ones, to uncommon formats. You should master it as a
powerful tool not only for use in BIOMOD but also for your graduation thesis, master’s thesis,
and other researches (see section 58).
Chapter 6 Software Techniques 204

To Make a Video Enjoyable to Watch

According to the judging criteria in previous BIOMOD competitions, the YouTube video is 25%
(see section 3) of your score. Thus, video making skills are key. The commercial software Pre-
miere Pro and the freeware programs such as Windows Movie Maker and AviUtl are introduced
below.
Chapter 6 Software Techniques 205

(See section 59)


—Takashi Nakakuki (Kyushu Institute of Technology)

54. Document/Table/Graph Preparation


This section introduces Gnuplot for graphing.

Document Preparation
When your project starts, you will discuss the schedule and roadmap (a chart that lists tasks
necessary to achieve your goals in a timeline) with other team members. In the course of the
project, you will go through many meetings to discuss the design specification, the status of
each member’s task, and so on (see section 7). You should make full use of document prepa-
ration software such as Word to make the group work more efficient and effective. In business
settings, meeting documents and proceedings are often prepared in Word and electronically
distributed. Similarly, you should make use of document preparation software as a tool for
efficient group work.
Chapter 6 Software Techniques 206

Spreadsheet
Many of you have probably used Microsoft Excel, a program that creates spreadsheets. For
BIOMOD, Excel can help manage data management/analysis, and can also output results as
tables and various graphs.
For data management, its “Data” menu provides Sort, Filter (data extraction), Remove Dupli-
cates, Subtotal, PivotTable, and other basic functions which allow the import of external data
sources. It can also help the management of your group work. For example, if the project is
divided among the team members, you can create a table of the tasks of individual members,
share the Excel file on the web, and have each member update the status of her/his task in the
table, to see the overall status of the project at a glance. Like Word, just one spreadsheet can be
a powerful tool to dramatically improve the efficiency of your group work when combined with
IT.
Chapter 6 Software Techniques 207

In relation to data analysis, the use of various Excel functions allows basic statistical analyses,
e.g. mean and variance, and more complex statistical analyses, e.g. test of a difference between
population means, nonparametric test, correlation, and multivariate analysis. There are many
textbooks on the use of Excel in statistical analysis and multivariate analysis. For examples, see
reference [1].
If you want to combine Excel functions to perform complex calculations/tasks efficiently (a
series of tasks automatically), it can be greatly facilitated by “macro” (“Tool”-“Macros” in the
menu). You can write a macro using the programming language VBA (Visual Basic for Applica-
tions) for Excel. Interested readers are referred to reference [2] and other guides.
Last but not least, tables and graphs can be easily created by selecting “Insert”-“Table”, “Graph”
in the menu. Particularly, you can choose an appropriate graph from column, line, pie, bar,
area, scatter, and other charts (stock, contour, doughnut, bubble, and radar) depending on your
purpose, which is an advantage of using the graph drawing tool of Excel.

Graph Drawing
You should also learn the software Gnuplot, which allows for efficient graph drawing. This free,
open-source software program with advanced functions can be regarded as a programming
language for 2D/3D graph drawing. Excel is more convenient in that it allows you to make a
graph by just following a wizard, but Gnuplot provides more functions and performs better at
processing a large amount of data quickly into a graph.

Column: TeX
Undergraduates science majors often write their graduation thesis on TeX. Scientific papers
in some field are also often written in TeX (needless to say, many are Microsoft Word users).
TeX is a free typesetting system, wherein the users prepare texts and images/tables (i.e.
contents), while the computer takes care of layout and other non-content elements of the
document based on class file (or style file) and other settings. Writing in TeX is rather like
making a webpage in the HTML language, and thus more cumbersome than writing in Word.
Chapter 6 Software Techniques 208

Nevertheless, many people are still using TeX all over the world, possibly because it can make
equations particularly beautiful.

—Takashi Nakakuki (Kyushu Institute of Technology)

55. Preparation of Presentation Slides


In BIOMOD, most of you will use PowerPoint to make a presentation. This section introduces
points, tips, and taboos in the use of PowerPoint to prepare slides.

Presentation
As already discussed in section 10, a presentation at BIOMOD is different from a typical confer-
ence presentation, in that you can add drama and entertainment to it. But the most important
thing is to make the content of your project understood, and to explain it clearly.
Presentation structure (example):

1. Title, team name, and affiliations


2. Background
3. Goal of the project
4. Details of the design and its operating principles
5. Experimental/simulation results
6. Conclusions
7. Future prospects and discussion

Presentation Structure

In a presentation, simply showing the results of your project is not enough. You need to empha-
size the points of your story and pace the talk effectively. For that purpose, the structure of the
presentation should also take the psychology of the audience into account. If your experiment
has been successful, then show it at the climax. If your experiment has not been successful but
has prompted a unique principle, place more emphasis on its description. In addition, arrange
the results in the best order to make them understood.
In any kind of presentation, the goals and conclusions should agree with each other. If they
don’t, the audience may not believe your argument. In addition, you can also strongly impress
the audience by talking about the future progress that can result from your results.
Chapter 6 Software Techniques 209

Dos and Don’ts in Preparing Slides


There are a number of textbooks on general presentation techniques and PowerPoint. You
should read at least some of them if you are involved in these tasks. Shown below are the most
basic points.
Good Slides:
- Title of each slide summarizes its point.
- Texts and figures are clear and easy to understand.
- Provides necessary and sufficient information.
- An appropriate proportion of time/space is allotted to each piece of information depending on
its importance.
- Sufficiently understood even in the absence of the words of the speaker.
- Text is consistent with the speaker’s words.
- All slides have a consistent format.
Bad Slides:
- The focus is unclear.
- Texts and images are too small to read.
- An important point is written in a small font.
- An elaborate figure is used to describe a less important thing.
- Terms used in the slides differ from those in the speaker’s words.
- Different slides have different formats.
Appoint a Director to Standardize Slides:
The slide work, probably done during one week after the Wiki deadlines of the domestic trial
and the final, should be divided among all of the team members. An appointed director should
standardize all the slides in terms of font, text size, coloring, icon, and many other things.
Time Allocation:
Chapter 6 Software Techniques 210

Read the speech script to time the presentation on a stopwatch. Adjust the number of slides and
the length of script for each part so that you can allocate enough time to the most important
points.
Synchronization between the speech script and the slides:
Terms used in the script have to completely match those in the slides. In addition, the flow of
the script for a slide must match the order of things displayed in that slide (from the top to the
bottom, from the left to the right).
Simple and clear slides:
Each slide should not contain too much information. Texts should be in a large font and brief. In
general, one slide should not contain more than 7 lines.
Use of consistent terms and icons:
Each thing should have one name, one icon, and one shape of the same coloring throughout
the slides. Since your group all are working, some of you might be using different names for the
same things. You need to make sure these are changed to reflect the same terminology. This is
where communication among the team members matters.
Graph Requires Text Summary:
A look at each slide should tell the viewer approximately what it means. Make sure each graph
of your results has text that summarizes what your graph indicates.
Last Slide:
This slide will be on the screen throughout the discussion time, and thus should include more
than something like “Thank you for your attention”, such as a summary of the results or a
positive message.
Movie and Visual Effects:
Unfortunately, a movie embedded in a slide sometimes fails to play at the BIOMOD final. Instead,
you should rely on animation wherever you can. Do not abuse visual effects during slide change.
Save them for where they are really needed.
Appoint a Slide Flipper:
Let the speaker concentrate on reciting the speech script, by appointing someone else to switch
the slides. The slide switcher should rehearse again and again in advance to learn where to
change slides in the script.
Chapter 6 Software Techniques 211

—Satoshi Murata (Tohoku University)

56. Figure Preparation


Graphic information is an important means of aiding understanding. Effective figures will greatly
improve the impression of your Wiki, YouTube, and presentation. If you want to prepare higher-
grade figures, you should master Illustrator (representative graphic software) and Photoshop
(photo editor).

Two Ways to Prepare a Figure


Many of you probably have already used some graphic software or paint tool to draw figures.
Windows “Paint” is one of the most popular paint tools. There are two kinds of image drawing
software, one that uses raster images and one that uses vector images. If you want to prepare
higher-grade figures, you should know what the difference is between them and how to choose
a software program(s) appropriate for your purpose. In the figure below, the letter on the left,
Chapter 6 Software Techniques 212

which is composed of “pixels”, is called a raster image. The letter on the right, which is composed
of “coordinates and lines”, is called a vector image.

Raster Images are Drawn with Pixels


A raster image is also called bitmap image. Each image is made of the information on the color
(tone) of pixels. In the case of a raster image of 640 x 480 pixels, the entire color (tone) informa-
tion of the 307,200 pixels can be directly written in an image file, which will have the extension
“BMP”. Alternatively, the information can be compressed into an image file of a smaller size,
which will have an extension such as JPG and PNG. Windows “Paint” is a representative raster
image drawing software program. Photographs taken with a digital still camera are all raster
images (guess why if you are familiar with cameras). A raster image is apt to have a large file size
and has the disadvantage that its edges become rough when magnified.
Chapter 6 Software Techniques 213

Vector Images are Drawn with “Paths” and “Fills”


The information of a vector image is basically provided by draw commands (see the right image
in the following figure). You can draw it solely based on the information of the coordinates of the
start points and the end points and the nature of the line segments between them (commands).
When draw commands for a vector image are outputted as an image file, it will have an extension
such as AI, PDF, and SVG. A vector image has a smaller file size than its raster image counterpart,
and maintains smooth edges when magnified. For example, in a vector image, a 10-fold increase
in its width will not roughen the edges because the only difference is something like the use of
the command “Draw a line from the point (0, 0) to the point (100, 0)” in place of the command
“Draw a line from the point (0, 0) to the point (10, 0)”.
Chapter 6 Software Techniques 214

Even if an image is drawn based on draw commands, it is not practical for you to input the draw
commands like a program. What you need is a software program that allows drawing actions of
the user to be performed intuitively, and automatically converts them into draw commands.

Illustrator: A Software Program for Vector Image Drawing

In a software program for vector image drawing, its “user-friendliness” is more important than
in a raster image drawing program,. The graphic software Illustrator (Adobe) is a representative
software program for the former [1].
To master Illustrator, you should learn about the terms anchor point, segment, and path (see
the figure below for an example). (1) Selecting the Pen tool and (2) clicking on the start point
will create an anchor point there. Then (3) clicking on the end point will create another anchor
point there and a segment (a straight line in this case) will appear between these two points.
To continue the drawing, (4) clicking on another point will create the next anchor point, which
will add another segment. An uninterrupted line composed of anchor points and a segment(s)
is called a path. If Illustrator is new to you, you should start from the Pen tool and proceed to
adding a new anchor between segments, connecting anchors with a smooth curve, and then
drawing a more complex curve.
Chapter 6 Software Techniques 215

{width=”4.2711122047244094in” height=”3.48125in”}

Inkscape: Free Vector Drawing Software

Illustrator is a high-performance software program that can provide a collection of advanced


functions such as those demanded by professional designers, but you have to pay for it.
Fortunately, there is the freeware Inkscape [2]. It has basic functions comparable to those of
Illustrator, and will be a powerful tool for you to prepare higher-quality figures.

Photoshop: A Software Program for Raster Image Drawing

Apart from various vector images, photographs taken with camera are a kind of raster image.
Therefore, you will also need a software program to add effects to a photograph or adjust its
contrast, brightness, and tone. Among such raster images, photographs and other complex
images are usually processed with a specific kind of software, which is called a photo editor.
Chapter 6 Software Techniques 216

Photoshop (Adobe) is a photo editor [3]. You can also scan a free-hand drawing and process it
with Photoshop to have a digital illustration as follows.

Alternatively, you can use GIMP, a freeware program with basic functions comparable to those
of Photoshop.
—Takashi Nakakuki (Kyushu Institute of Technology)

57. 3D Animation
Animation can be very effective in explaining molecular reactions and mechanisms. However,
creating an animation requires as many as thousands of images and thus enormous labor. How-
ever, you can create it relatively easily with the help of three-dimensional computer graphics (3D
CGI).

Modeling
First, this section describes the implementation of a moving model (modeling) for animation.
In 3D CGI, a human character or any other object is composed of vertices, edges, and faces,
which is called a polygon. There are three modeling methodologies: polygonal modeling, which
directly edits a polygon, curved surface modeling, which indirectly edits a polygon using Bezier
and NURBS curves, and sculpt modeling, which creates an object like clay working. Polygonal
modeling is the most basic, whereas curved surface modeling is suitable for symmetric struc-
tures such as cylinders and sculpt modeling is good for making complex structures with many
Chapter 6 Software Techniques 217

bumps. Increasing the number of polygons (high poly) means you can model a more complex
structure, but this also requires more of your PC.

Material
You can change the color, material, and reflectivity of your model by changing material settings.
Texture settings are used to add surface patterns, smaller features, and other details. A texture
that carries color and similar patterns is called a color texture, and a texture with the information
on features drawn on it is called a normal map texture. Although these textures can be created
using a graphics editor, they can be downloaded from the CG Textures site and other similar
websites, too. To use a texture, a process of projecting the texture onto your model (UV mapping)
is required.
Chapter 6 Software Techniques 218

UV mapping

Animation
In 3D CGI animation, a motion is made by registering a pose of a model for each time point and
interpolating the poses. An animation made by registering the poses, or “setting key frames”,
is called a key frame animation. You can use Graph Editor to perform the motion interpolation
in detail. To move a human, animal, or any other multi-jointed model, bones (a skeleton) are
used. Each bone is subjected to “skinning” to associate it with the polygon of the model, which
allows a move of the bones to change the morphology of the polygon. Bones (a skeleton)
have a hierarchical structure: a structure in which a child follows the motion of its parent is
called FK (forward kinematics), and a structure in which the motion of a child determines the
position of its parent is called IK (inverse kinematics). In the motion of bones of hands, legs, and
other object-contacting parts, IK is usually employed. Although FK provides a higher degree of
freedom of motion, the use of IK in walking, pushing an object, and similar motions can improve
your animation. An animation is usually created by hand drawing, but it is also possible by
motion capture to generate the motion of a CG model from that of a real person or object, or
by baking physical computing results to key frames. Using these techniques, you can create an
animation that is too complex to input by hand.
Chapter 6 Software Techniques 219

Model (left) and skeleton (right).

Key frame animation.


Chapter 6 Software Techniques 220

Rendering
In rendering, a light source(s) is set to illuminate a model and an image is created based on the
lighting and camera settings. The overall quality of your animation is greatly affected by the
settings of rendering. Lighting usually includes key light, fill light, and back light (“three-point
lighting”). In rendering, you can use ray tracing to add reflection and refraction by tracing the
path of a light to reach the camera, global illumination to compute lights reflected on an object
(indirect light), ambient occlusion to add 3d appearance by shading gaps and hollows in the
model, and other functions to obtain high-quality graphics comparable to live action films.
2D animation and similar graphics can also be rendered by emphasizing the edges of a model,
binarizing the shades, and other techniques (cel shading).

Closing Remarks
The above are just a taste of 3D CGI creation. You can also render a fluid, the destruction of an
object, and so on by employing physical computing, Boolean, and other functions. Commonly, it
Chapter 6 Software Techniques 221

is more practical to reduce the rendering time required for retake by outputting the background
and the model in your movie separately. Output them as numbered images that support
transparency (e.g. in a PNG format), which will be combined later using a video editor. This can
also be applied to combining them with a real background.
Useful Sites for 3D CGI
- Metasequoia (modeling software): http://metaseq.net/jp/
- MAKEHUMAN (human character making software): http://www. makehuman.org/
- Sculptris (sculpt modeling software): http://oakcorp.net/zbrush/sculptris/index.php
- Maya (3D CGI software): http://www.autodesk.com/education/free-software/maya
- Blender (3D CGI software): http://blender.jp/
- BLEND SWAP (free models for Blender): http://www.blendswap.com/
- MOLECULAR FLIPBOOK (molecular animation software): http://www.coolhunting.com/tech/molecular-
flipbook
- CG textures (texture download): http://www.cgtextures.com/
—Masaru Tsuzawa and Ibuki Kawamata (Tohoku University)

58. Image Processing


There are two kinds of graphics for BIOMOD, namely figures and raw images for data extraction.
This section outlines the process to extract data from an experimentally obtained image. You
should try to obtain high-quality data by processing a digital image in a highly reproducible
manner using reliable software.

What is Image Processing?


Image processing extracts any desired information from a given image. The world’s earliest
drawing may be the wall paintings at Lascaux Cave, which include a line drawing of a colored
bison. At present, most images are digital data, any of which you can copy easily and perfectly
and treat as a set of pixels (dots), each with a discrete brightness, on a computer. You can
not only enjoy their patterns and colors but also quantify data contained therein (e.g., length,
brightness distribution, particle size distribution, and complexity of closed curve). The process
is outlined below.

Different Types of Images


As a result of a given experiment, you will often obtain a digital image from electrophoresis,
microscopy (optical, fluorescence, atomic force, and electron), or the like.
There are essentially two kinds of graphics for BIOMOD: figures and raw images for data
extraction. An image for data extraction can be processed into a figure, but not vice versa. Note
that good data comes from a good image.
Chapter 6 Software Techniques 222

There are a variety of formats digitized image data can take, and popular ones have an extension
like .JPG, .PNG, .TIFF, or .BMP. In addition, each software program may have its own format.
Uncompressed TIFF format is the de facto standard for images for data processing.

Pre-processing is Critical
In a thriller TV or movie, you may have seen a blurred face of a suspect getting sharpened
through image processing. However, image processing does not always go like that. Of course,
you can use macros to automate data processing. Extraction from an image with too much
noise will be very difficult to do. There are a variety of noise reduction algorithms (for example,
a Gaussian filter, a median filter, or a two-dimensional Fourier transform, a window function,
and then an inverse Fourier transform [*]). It is the job of the experimenter who has actually
obtained an image from an experiment to process the image into an automatically analyzable
state for data extraction.

Examples of Image Processing


Image-J is one of the most popular image-processing freeware programs in the field of science
and technology [1]. Free, useful macros have been developed and published by its users all over
the world.

ImageJ, in spite of its somewhat antique UI, is compatible with the latest OSs. Below is an
exemplary process of particle analysis using it.
Original image -> store in a TIFF format -> import into ImageJ: file information is provided at the
top left, such as 800 x 600 pixels; RGB; 1.8 MB.
Chapter 6 Software Techniques 223

Scaling: for example, draw a line over the 20 μm scale bar by hand, and select Analyze -> SetScale
to display Distance in pixels: 45 -> then enter Known distance: 20 and Unit of Length: μm.
Binarization: Process -> Binary -> Make Binary. Then confirm that you see black particles on a
white background. Otherwise, invert it by Select All -> Edit -> Invert.
Chapter 6 Software Techniques 224

Particle analysis: Analyse -> Analyse Particles -> input counter conditions (see below for an
example) -> OK

to start counting.
Chapter 6 Software Techniques 225

This example includes 27 particles, with the area of each particle displayed. It can be outputted
into Excel.

Instead of ImageJ, you can also use the Adobe Photoshop and the freeware GIMP [4]. For
scientific purposes, ImageJ and its wide range of macros will suffice. See reference [5] for details
and sections 56, 57, and 59 for CG/image preparation.

Universal Rules
Quality requirements may differ depending on your purpose, but will usually include no debris
(only the original experimenter can decide if something in an image is truly debris), good
contrast, less noise, good size of the object in the image, high resolution, and good tone levels.
Always try to obtain a good image that facilitates image processing.
—Shinichiro M. Nomura (Tohoku University)
Chapter 6 Software Techniques 226

59. Video Editing


In BIOMOD, each team is required to make a Youtube video, which is worth up to 25 points.
There is no single best way to make it. This section introduces some software-based editing
techniques, referring to videos of previous participants.

Overview
Once you have set the policy on how to make your video and have the scenario and casts set (see
section 9), you actually start making the video. In general, video editing proceeds by collecting
movie, audio, and other data, editing and adding effects to them, and then setting the timeline.
As the video is nearing completion, show it to other team members and get feedback to revise
it again and again. Usually, these different steps need different software programs.
Necessary data include movies, sounds, BGM, and subtitles. The movies are prepared by CG (see
section 57) or live action. The TITECH team 2011 won second place in the Youtube category with
a movie that combined PowerPoint animation and live action. The Tohoku University team 2012
won first place on the Youtube Award with a movie that combined live action and CG. You may
also add complex effects to your movie using the software program After Effects [1] or the like.
You can record sounds and BGM, or you can use found materials. Sound editors are not
discussed here. The Arizona State University team 2014 won second place for Youtube Award
using melodies that they composed. In addition to these examples, you should refer to other
highly evaluated videos in previous BIOMOD competitions (see section 5).

Video Editing
When it comes time to do the video editing, you will further process the collected movie and
audio data, copy and paste them, and arrange them in place. There are various applications that
can assist in these processes, and you can choose the one you like. An exemplary commercial
application is Premiere Pro [2], while Windows Movie Maker [3] and AviUtl [4] are freeware. Their
execution screens are shown in FIG. 1.
Chapter 6 Software Techniques 227
Chapter 6 Software Techniques 228

In any of these software programs, timeline editing is a central part of the task. Timeline specifies
the timing of shots, sounds, effects, and subtitles. As an example, FIG. 2 shows a timeline of
about 10 seconds on AviUtl, made by the Tohoku University team in 2012.
The timeline proceeds to the right. Vertically, layers are stacked that specify materials used for
each time point. Blue indicates a picture, including subtitle. Red indicates a sound or BGM, and
green indicates an effect at a scene change.
As seen from the above, each scene of the video will be made by combining multiple types of
information, where your editing skills matter. Do it carefully, referring to the preview window.

Aviutl timeline
Chapter 6 Software Techniques 229

Output and Upload of Video


Once the timeline is complete, then you can encode it to output as a video. Encoding is very
complicated and not discussed in detail here. In principle, you will refer to Youtube settings [5]
to perform encoding using a software program. As specified by the link, you should care about
resolution, aspect ratio, frame rate, bit rate, container format, and codec. Play the resulting
video and go through every scene to see if there is any problem, e.g. distorted picture, sound
too large, or sound too small. Note that the encoding can take a long time.
After the video is complete, you can upload it to Youtube. You can use the annotation function
of Youtube to embed a link in the video. A link to the website of your team (see section 8) will be
effective.
—Ibuki Kawamata (Tohoku University)

60. Numerical Analysis Library


Scilab is a freely available, beginner-friendly, high-level programming language for numerical
analysis, visualization, signal processing, and data analysis. It is a powerful tool you can use for
BIOMOD for experimental data analysis and numerical simulation.
C Language is Versatile but not Always the Best
For numerical calculation or for data analysis, homework, or a graduation thesis, you can use C
language to write a program(s) sufficient for that purpose. The greatest advantage of C language
is its execution speed; it can quickly process even a large-scale numerical calculation with the
help of memory allocation and pointers. C language has can do a lot of things, but is not the best
choice for every calculation. Typical cases are shown below.

Case 1: Calculation of the eigenvalues of a matrix


You have to learn linear algebra and code an algorithm to calculate the eigenvalues of a matrix
of n rows x n columns. Therefore, debugging will also take a considerable amount of time.
Alternatively, you have to find a matrix calculation library for C language, install it, and use call
functions in the library from your source code. Of course, the library must be reliable (one you
have happened to find on the Internet will not be safe to use in various senses).
Typical Case in BIOMOD
In BIOMOD, you will often have to numerically simulate a DNA reaction to validate its operating
principles and examine experimental data.
Chapter 6 Software Techniques 230

Case 2: Simulation of the time course of DNA concentration


This involves two problems for a programmer to solve: i.e., “numerical solution to the initial
value problem of an ordinary differential equation” and “visualization of the results”. For the
numerical solution, you will have to find a numerical integration library, which is similar to the
Case 1 situation. For the visualization, a convenient method is to output a file of simulation
results, import the file into Excel, Gnuplot, or the like (see section 54), and draw a graph.
However, there an easier method.

Scilab
Scilab can do numerical analysis, visualization, signal processing, and data analysis, and has
the following features:
Freeware
This is particularly important in BIOMOD. You can download it from the Scilab website [1] and
install it on your personal computer. It works with Linux, Mac, and Windows. There are a number
of Japanese textbooks on it.

Provided with a large numerical analysis library


This frees programmers from the problem of how to get a numerical analysis library. You only
have to install Scilab to get numerical analysis functions (solvers) for linear algebra, numerical
integration, data analysis, signal analysis, and so on. That is, all you have to do is learn “how to
use each function”. For example, in Case 1, define a matrix A and then use spec(A), which will
return the eigenvalues of the matrix A. In Case 2, use the numerical integration function code.
Easily creates 2D/3D graphs
This means that, like Gnuplot (see section 54), Scilab has advanced graph drawing functions
that allow beautiful 2D/3D graphics.
Chapter 6 Software Techniques 231

Scilab is a programming language that allows you to carry out a series of tasks such as data
import, numerical analysis, and graph drawing in a seamless manner, has a grammar easier to
understand than that of C language, and does not require programmers to worry about data
type and memory allocation (students in my laboratory mastered its use for basic operations in
a few days).

Demo: Simulation of DNA Hybridization


An example of programming for Case 2 is described below. See the hybridization reaction
between a single strand DNA (X) and a complementary DNA (Y) in the following figure, and
simulate the increase in the concentration of a double strand DNA (Z) over time.

The time course of the concentration of DNA can be expressed by the following differential
equations based on reaction kinetics (see section 39).
Chapter 6 Software Techniques 232

The figure below is the start-up window of Scilab. Type the following program line by line in the
left console. Here, the initial concentration of DNA is X(0) = 10nM, Y(0) = 20nM, and Z(0) = 0nM.
“//” in the program marks a comment. Its grammar is not discussed here, but is easy to learn if
you know the definition and use of vector.

DNA0 = [10 ; 20 ; 0]; // initial concentration


t = 0 : 0.1 : 20; //start time: interval: end time [sec]
kf = 0.1; kr = 0.01; //association constant, dissociation constant
// Definition of differential equations
function xdot = diffEqs(t, x);
xdot(1) = -kf * x(1) * x(2) + kr * x(3);
xdot(2) = -kf * x(1) * x(2) + kr * x(3);
Chapter 6 Software Techniques 233

xdot(3) = kf * x(1) * x(2) - kr * x(3);


endfunction
// Solve differential equations
DNA = ode (“stiff”, DNA0, 0, t, diffEqs);
// graph drawing
plot ( t, DNA(1,:), t, DNA(2,:), t, DNA(3,:) );

The axes in the graph are appropriately scaled, and the DNA concentration curves are differently
colored.
Chapter 6 Software Techniques 234

Column: Statistical Analysis Software R


Experimental data are often subjected to some statistical analysis. Basic statistics values can
be calculated in Excel. If you need more complex data analysis, you should use the statistical
analysis software R, also called the R language. In spite of being downloadable for free,
it includes a large numerical library for statistical analysis, which is comparable to that of
commercial statistical analysis software. It is also compatible with parallel processing with
a PC cluster system to analyze big data, and thus widely used in universities and research
institutes all over the world.
Official R Project HP: http://www.r-project.org/index.html

—Takashi Nakakuki (Kyushu Institute of Technology)

61. Software for Molecular Design of DNA


In order to design a system composed of DNA molecules, you have to know about the physical
and chemical properties of DNA and be able to use them. When you want to design a complex
or large-scale system, use software-assisted design. This section outlines relevant software
programs and a strategy to use them for appropriate purposes.

Software-Assisted Design
Various systems have been developed based on the geometrical, thermodynamic, and reaction
kinetics properties of DNA molecules. As the scale of such systems become larger and larger,
risk of errors that cannot be foreseen by an empirical design methodology grow.
To circumvent such problems, use a software-assisted design program. In relation to the design
of DNA molecular systems, a basic strategy to apply different software programs to different
purposes is introduced in terms of geometry, dynamics, thermodynamics, and reaction kinetics.

Geometric Properties
Geometric properties of DNA double helix (see section 17) have been used to assemble two/three-
dimensional structures. You must consider the phase, pitch, and interatomic distance of a helix
when designing DNA systems. Applicable software programs include Namot, which can model
all atoms in DNA, Nanoengineer-1, specialized for three-dimensional modeling of double helix
structures, and caDNAno to design DNA origami structures based on crossovers (FIG. 1). You
must choose the right one depending on the size and features of your system (see section 66).
Chapter 6 Software Techniques 235

Structure design software (from the left, Namot, Nanoengineer-1, and caDNAno).

Dynamical Properties
By predicting the behavior of a DNA structure in a solution, you can assess its structural stability
and motility. For an all-atom model, an existing molecular dynamics software program, such
as NAMD, is effective. A structure file generated by caDNAno can be used in stability analysis
by CanDo and coarse-grained molecular dynamics simulation by oxDNA (see section 65). In any
instance, the Brownian motion of a DNA structure in a solution can be predicted (FIG. 2).

Dynamics simulation (from the left, results of NAMD, CanDo, and oxDNA).

Thermodynamic Properties
In order to design a DNA molecular system, it is important to predict the hybridization between
sequences, i.e. the secondary structure of DNA, rather than its three-dimensional structure. It is
also possible to predict the thermodynamic free energy and Tm from the sequence information
(see section 41), which can help to determine if an expected or unexpected secondary structures
will form. Such a measure can also be applied to an inverse problem, that is, to design a
sequence that can form the desired secondary structure [1].
Chapter 6 Software Techniques 236

Suitable software programs are NUPACK and DINAmelt (see section 62). Both allow secondary
structure and free energy prediction. NUPACK is good at designing a simple sequence and pre-
dicting the secondary structure of multiple DNA strands, while DINAmelt is good at predicting
Tm and heat capacity (FIG. 3).
When you want to design a set of DNA sequences for DNA computing or tile assembly, you should
use a software program specialized for that purpose, e.g. DNA Design, to exclude the possibility
of their hybridization in any undesired combination (see section 63).

Prediction of thermodynamic properties (from the top, NUPACK and DINAmelt).


Chapter 6 Software Techniques 237

Reaction Kinetics Properties


In DNA computing, you will need to predict the time course of the concentration of molecules
in a solution, rather than look at individual DNA molecules. Specifically, an ordinary differential
equation(s) is formulated based on a chemical reaction equation(s), and subjected to numer-
ical analysis. The differential equation(s) itself is written by a program or by using simplified
notation.
In the design of a system driven by strand displacement reaction (see section 42), the software
DSD (see section 67) is effective (FIG. 4). For a Toolbox reaction system based on enzymatic
reaction, DACCAD is effective (see section 33). In any case, neither is applicable to every DNA
structure. Thus a numerical analysis library of general purpose, such as that of Scilab, is required
in most simulations [2,3].
Chapter 6 Software Techniques 238

Reaction kinetics simulation on DSD

—Ibuki Kawamata (Tohoku University)

62. DNA Secondary Structure Prediction


DNA secondary structure prediction can predict a stable secondary structure of DNA based on
its base sequence. This section introduces NUPACK and DINAMelt, which are programs used for
secondary structure prediction. Both can work online, so you should try them at least once.
Chapter 6 Software Techniques 239

NUPACK
NUPACK is a program developed by Dr. Robert Dirks (Caltech). For an input DNA (RNA) base
sequence, it outputs all possible secondary structure(s) in a solution, i.e. secondary structure(s)
that are stable in terms of free energy. Below is the list of the types of information submitted to
NUPACK and information returned to the user.
[Input information]
RNA or DNA?
Reaction temperature
Number of DNA strands
Maximum number of DNA strands to form a secondary structure
Base sequence and concentration of each DNA
Concentration of Na+ and Mg²+
[Output information]
Stable secondary structure(s) and its concentration in the solution
Melting profile
Probability of base pairing for each base
Advantageously, it can operate online, and has an intuitive interface and a demo program. You
should try it at least once (http://www.nupack.org/).
The figure below is a screen shot of the results of inputting an exemplary DNA base sequence,
“GGTATGCGATGAGCACACCCGCGGGTTTCGCGCGT”. According to the figure, DNA with this base
sequence is most stable in a solution when it forms a structure composed of hairpin loops and
internal loops, with a minimum free energy (see section 42) of -12.27 kcal/mol.
Chapter 6 Software Techniques 240

In addition, NUPACK can design a base sequence that can form a desired structure, which is also
available online (see section 64).

DINAMelt
DINAMelt, developed by Dr. Nicholas R. Markham (Rensselaer Polytechnic Institute), is a web-
based program with functions similar to those of NUPACK. NUPACK and DINAMelt use essentially
the same structure prediction algorithm, which is an extended version of the algorithm first de-
veloped by Dr. Zuker in 1981. Moreover, both programs use free energy parameters calculated by
Chapter 6 Software Techniques 241

Dr. Santa Lucia, which are required for secondary structure prediction, so that their secondary
structure prediction results will be essentially the same.
However, only DINAMelt is capable of detailed thermodynamic analysis. In the figure below, two
different DNA strands are reacted, and the concentration change of reaction species is plotted
against temperature.
In addition to a stable secondary structure and the free energy for it, the enthalpy, entropy,
melting temperature, and heat capacity are also calculated.

In contrast, what NUPACK can do but DINAMelt cannot is to “predict the secondary structure
made of three or more DNA strands” and “design the base sequence of DNA”. Your purpose will
unequivocally determine which program to choose.

Column: BLAST
First of all, DNA is the design of life. In order to study the genetic nature of DNA, we sometimes
want to know the similarity between two base sequences. BLAST (Basic Local Alignment
Search Tool) is a popular program in bioinformatics. BLAST can receive a DNA base sequence
and search for and return base sequences similar to it from huge databases.
Chapter 6 Software Techniques 242

For example, you can readily investigate if a human has any gene similar to a gene in monkey,
at http://www.ncbi.nlm.nih.gov/BLAST/.
Typical gene databases are too huge to compare sequences one by one in a reasonable
amount of time. Therefore, BLAST initially checks similarity by an easy and quick method to
exclude unpromising sequences from candidates. Such improvements have achieved a great
reduction in calculation time. In software programs to predict the behavior and properties of
biomolecules, both prediction accuracy and prediction time are very important factors. BLAST,
NUPACK, and DINAMelt, introduced in this section, all meet both of the requirements, and thus
are actively used by many.
The time required for DNA secondary structure prediction is proportional to the third power of
the base sequence length. For example, a 10-fold increase in base sequence length will result
in a 1,000-fold increase in computing time. Keep this in mind when you want to use NUPACK
or DINAMelt.

—Fumiaki Tanaka (AIST)

63. Sequence Design


For any DNA sequence, its secondary structure can be algorithmically predicted. Sequence
design is the inverse problem. You put constraints on the secondary structure and sequence,
and search for a sequence(s) that satisfies the condition. A random sequence(s) may suffice
when you want to assemble a simple secondary structure, but an increase in the number
of conditions will be extremely problematic. This section introduces guides and software for
sequence design.
Secondary Structure Prediction and Sequence Design
Secondary structure prediction is useful for predicting the domains in a given DNA sequence
that will hybridize to each other to form the most stable structure (see section 43). Efficient
prediction algorithms are already known, and several software programs are available (see
section 62).
In order to design a DNA tile structure (see section 20) or a system driven by strand displacement
reaction (see section 42), you have to solve its inverse problem—that is, you have to find a base
sequence that will form a predetermined secondary structure.
A simple solution is to give a random sequence to each domain. Once the sequence is given, its
complementary strand is automatically defined. If these sequences are subjected to secondary
structure prediction, the desired structure will be obtained.
It may seem a simple problem, but such random sequences might cause pairing between wrong
domains, resulting in a meta-stable by-product(s). Such a by-product can impair structure
formation and state transition.
Chapter 6 Software Techniques 243

To avoid this problem, we can use known orthogonal sequences [1]. These include 37 different
DNA sequences, each 23 bases long, which have been reported not to interfere with each other
in an unexpected manner.
Sequence Design Software
However, the length of the number and the kind of bases will differ between designs, and each
design will need a sequence(s) that meets its conditions. Here, sequence design software is
effective. It has been developed by many groups, but there is no de facto standard software
program because different sequence design problems have different conditions in detail. Some
software programs are introduced below.
Design Function of NUPACK
The online secondary structure prediction program NUPACK [2,3] has a sequence design func-
tion. Access the NUPACK website and select the Design tab. The succeeding steps are described
below, using a demo.
First, set the parameters at the top. For example, Nucleic acid type: DNA, Temperature: 26°C,
and Number of designs: 2. Then write a desired secondary structure in the Target structure box
by the dot-bracket-plus notation. In this notation, as shown in FIG. 1, “.” represents an unpaired
base, corresponding left and right brackets represent a hydrogen-bonded base pair, and “+”
represents a DNA nick.
Chapter 6 Software Techniques 244

Dot-bracket-plus notation

Then, push Design at the bottom right and wait while the sequence design is executed and
base sequence sets are outputted as many as the number of designs set in FIG. 2. FIG. 2 shows
sequence design results for the secondary structure of the stick figure. The higher a set is placed,
the better sequence(s) it has.
Chapter 6 Software Techniques 245

Sequence design results

Push “To Analysis” on the right side of the designed sequences to subject the corresponding
sequence to secondary structure analysis. You can check if the expected secondary structure is
formed. If any unexpected structure is formed, you have to change the conditions and structure
and repeat the sequence design.
You can add a variety of constraints in the Target structure box (see NUPACK Help for details).
For example, you can give a predetermined sequence to a portion of the structure, or you can
specify a sequence to be excluded.
DNA Design for Tile Design
A DNA Design toolbox [4] is useful for designing DNA tiles. This tool is written in MATLAB. A
base sequence is written as a string of alphabetical letters, and a Watson-Crick pairing region is
represented by a string of numbers. Each base to be designed is represented by an alphabetical
letter other than ATGC, e.g. N. Although you need some knowledge of programming, you can
design any combination of sequences once you get familiar with it.
Software for Combinatorial Optimization
To perform design and avoid unexpected structures, you need a sequence design program that
takes sequence homology and the free energy of secondary structure (see section 41) into
account [5,6]. Under conditions with various trade-offs, it searches for a sequence of the best
combination.
A few software programs for this are available online. One example is CircDesigNA [7], which
provides sequence design based on energy calculation. This software has the interface shown
at the top in FIG. 3, through which the user inputs the letter string that describes a desired
Chapter 6 Software Techniques 246

secondary structure.
Tohoku University has also developed the sequence design software Sequence Design based
on references [5,6]. This sequence design software for DNA computing was employed by the
Tohoku University team in 2014. It has an interface as shown at the bottom in FIG. 3, and
describes the required DNA with a string of alphabetical letters. Contact [8] to use it.

CircDesigNA and Sequence Design

—Ibuki Kawamata (Tohoku University)


Chapter 6 Software Techniques 247

64. Design of DNA Origami


One characteristic of DNA nanostructures is that their three-dimensional structures can be
predicted on a computer. This is a great advantage in molecular design, as opposed to the
tertiary structure of a protein that is not easiy predictable from its primary structure. There are
several design software programs for this purpose. This section introduces caDNAno developed
by Shawn Douglas, a founder of BIOMOD, and colleagues.

Cadnano
This freeware can design two-dimensional/three-dimensional structures on a GUI basis. The
latest is ver2.2.0 (as of March 2015).

Installation of caDNAno

Access the HP to download necessary files (http://cadnano.org/). For Mac, Maya2012 has to be
installed in advance. For Windows, Maya2012 and Python2.7.2. have to be installed in advance.
These can be installed even in a laptop and tablet and work fine for design purposes. However,
you may sometimes need the large memory and graphics board of a desktop when you want to
display a three-dimensional structure on Maya.

How to Use caDNAno

The start-up window of caDNAno is composed of two areas. The left one displays the top view of
a DNA nanostructure, the right displays the DNA nanostructure projected to the plane and seen
from the side.
Select the “Honeycomb” or “Square” button at the top left. When a three-dimensional structure
is desired, “Honeycomb” is usually selected. In Honeycomb, each designed structure has a
base unit of 7 bases (a rotation of 240˚) (Douglas et al. Nature 2009). In Square, each designed
structure has a base unit of 8 bases (0.75 turn = a rotation of 270˚. 10.67 bp/turn) (Rothemund
Nature 2006). Note that an actual B-form DNA has a pitch of 10.4 bp/turn, which is different from
the designed value in Square, resulting in a slightly twisted structure. Thus you have to eliminate
one base every 48 bases (Woo & Rothemund NatChem 2011).
Numbered cylinders are created in the order you click. Neighboring cylinders do not need to
have serial numbers, but serial numbers will be easier for beginners to handle. The coordinates
of the mouse pointer (cylinder number and base number) are shown at the bottom left.
Chapter 6 Software Techniques 248

Each cylinder is made of two rows. Use the upper row to draw a line rightward, and the lower
row to draw a line leftward. The square in each arrow represents the 5’ end, and the arrowhead
represents the 3’ end. This line represents the scaffold. When you use “select” or the pen,
focus your interest (for example, only on the scaffold and endpoints (red circle) to facilitate the
operation.
Chapter 6 Software Techniques 249

After drawing lines for cylinders, connect the cylinders (lines). When you click approximately
where you want to introduce a connection, a number and square brackets will appear. The
number is the destination cylinder number. Click the square brackets to connect the lines and
form a crossover. Select and delete the remainder by the delete key. If you want to extend a
scaffold, select it and drag & drop the square or arrowhead in the arrow. If you want to move a
crossover, turn “(X)overs” selectable by “Selectable” at the center of the upper toolbar before
that.
Chapter 6 Software Techniques 250

After connecting all the scaffolds by one stroke drawing (if you want to connect places other
than corner brackets, do it by force using the Pen tool), add staple strands. Push “AutoStaple” to
automatically arrange them, and then correct them by hand, one by one. This correction is for
their length and crossover positions, and is empirical for most part. Each staple to be corrected
is represented by a bold line. Through cutting, length change, or rearrangement, adjust each
staple to a practical length.
Chapter 6 Software Techniques 251

Subsequently, assign specific sequences. When you push the “Seq” button at the bottom right,
scaffolds are suggested. Choose appropriate ones, and the sequences will be automatically
assigned. Unless you want something different, make sure that neither too many nor too few
bases are assigned to each scaffold in this step.
You can output the sequence in csv format by pushing the “Export” button at the top left. If
the sequence is a string containing “?”, that staple is not paired with the scaffold. Unless it is
deliberate, you have to adjust the length of the staple and/or the scaffold, and then repeat “Seq”.
You can also use “Paint” to display different staples in different colors, one base insertion
(“Insert”), one base deletion (“Skip”), and other buttons, as appropriate.

Check of Structure Stability

You can use CanDo developed by an MIT group (http://cando-dna-origami.org/). Soon after
the upload of a sequence, it will return the calculation results. Because this program has
considerable constraints, you should not expect too much from it.

Ordering Custom Sequences

You can order custom sequences from any oligonucleotide synthesis service company. You will
receive them in about 4 business days, in separate tubes or in a multiwell plate format.
—Hisashi Tadakuma (Kyoto University)
Chapter 6 Software Techniques 252

65. Simulation of DNA Origami


It’s too expensive to examine the actual structure of a designed DNA origami by a chemical
experiment. Here, finite element simulation and coarse-grained molecular dynamics simulation
are effective. They can predict any twist from a desired structure, or deliberately design a twisted
structure.
Input File in caDNAno Format
This section introduces two simulators for DNA origami, namely CanDo [1] and oxDNA [2]. Either
can accept a json file that has been outputted by caDNAno, a DNA origami design tool (see
section 64), as input.
By way of example, imagine a simulation of a single-layer two-dimensional DNA origami struc-
ture on caDNAno, as shown in FIG. 1. Which of the front face and the back face will look upward
when the structure is adsorbed by a mica substrate (see section 92)? Does it have any instability
coming from the design?

DNA origami structure on caDNAno

CanDo is Based on Finite Element Method


The online software CanDo (FIG. 2) performs finite element simulation, which regards each
base pair as one element. The input to be uploaded is a json file, and the output is an image.
Chapter 6 Software Techniques 253

Alternatively, it can output a video or pdb file (see section 66). The simulation process is roughly
as follows.

In principle, parameters do not need to be changed from their default values. Select Square
or Honeycomb for “Lattice type” as appropriate. If you want a video and/or pdb file, set corre-
sponding options. Note that it may take several hours to days to get the result for a submitted
file because requests submitted will be processed in the order they are received.
The simulation results are shown in FIG. 2. The output is a structure that is stable in a solution,
with the color indicating the degree of thermal vibration. In the video, you can see it is actually
undergoing thermal vibration. In the exemplary structure, a small difference in phase between
the double helices accumulates, resulting in a slightly twisted structure. From the structural
characteristics, it is likely to be adsorbed with the back face up. When observed under AFM (see
section 18), about 60% of the structures actually had their back face up.
Chapter 6 Software Techniques 254

The top page and output images of CanDo

oxDNA for Coarse-Grained Molecular Dynamics Simulation


The software oxDNA allows for more detailed analysis and performs coarse-grained molecular
dynamics simulation. In its model, each nucleotide (see section 40) is represented by three
particles, providing an understanding of dynamic motions at the nucleotide level. oxDNAcan
not only simulate origami structures but also strand displacement reactions (see section 42)
and so on [5].
oxDNA is not distributed in an executable format and requires compiling. It has no graphical
interface, and you have to execute programs through a console, imposing a higher barrier to
introduction than CanDo. Its input file is in a unique format, but it includes a script for conversion
from a json file. Its output is the time course of the coordinates of each particle and the like,
which can be rendered as a video using VMD [3] or Chimera [4]. Below is an exemplary command
set to execute a simulation.
Chapter 6 Software Techniques 255

The first line converts a json file into an appropriate file. The second line is for relaxation, and
the third line executes the simulation. Temperature, time interval, and other conditions are
described by “input_relax” and “input”, for which sample files attached to the software can
be modified before use. The fourth line converts the output file into a different file format
to visualize it. Note that a standard personal computer will need several hours to simulate a
phenomenon that takes nanoseconds.
The visualized simulation results are shown in FIG. 3. The formation of a stable structure through
thermal vibration, with cracks and distortions in the structure, can be observed. By writing a
script to analyze it, the time course of the distance between any two points can be tracked.

Execution results of oxDNA


Chapter 6 Software Techniques 256

Uses of Simulation
Because simulation can provide a kind of information that you would never obtain through
designing DNA origami using a classical methodology, you should do this early in your project so
that you can create a more detailed design. In addition, the output of three-dimensional models
can be used to make a video or the like (see sections 57 and 59). A recent study reported an all-
atom molecular dynamics simulation [6], but it may not be usable in BIOMOD due to its high
computing cost.
—Ibuki Kawamata (Tohoku University)

66. Molecular Modeling of DNA


DNA molecules and structures can be observed under electron and atomic force microscopes.
However, it is impossible to directly observe the physical properties of a molecule beyond the
resolution of these instruments. To understand the behavior of a molecule in more detail, a
three-dimensional molecular model is effective. This section introduces how to make all-atom
and double helix-based models.

Atoms and DNA Structure


To design a DNA structure, you sometimes need to know the three-dimensional position of
each atom and the geometry of a helix. However, electron microscopy (see section 85) and
atomic force microscopy (see section 85) cannot directly observe such information beyond their
resolution, and lost information cannot be restored.
Here, molecular modeling is effective. By creating a model on a computer and understanding
the structure, you can understand the physical properties and constraints of DNA in advance.
Software programs to create an all-atom model of DNA and a three-dimensional model of
double helix structure are introduced below. The created structures can also be used as 3D
models in video preparations (see sections 57 and 59).

All-Atom Model by Namot


A DNA molecule is a string of so-called nucleotide units, composed of hydrogen, carbon,
nitrogen, oxygen, and phosphorus atoms (see section 40). Mutually complementary strands can
form a double helix through hydrogen bonding, which adds geometrical information. It is not
practical to set the coordinates of all atoms one by one by hand. Instead, modeling is performed
using software.
A so-called Namot2 software program [1,2] can put all atoms of DNA in place, and create a double
helix structure or much larger structures.
Chapter 6 Software Techniques 257

Its executable file is not directly available, and thus the source code must be compiled in an
environment with X Windows System. An execution screen is shown on the left in FIG. 1. For a
simple single or double strand DNA, you just have to select File -> Generate and input a sequence
to obtain its all-atom model. “Deoxy” in the “B” form should be selected.
You can store the obtained model in a format compatible with other applications, such as pdb,
by selecting File -> Write. The structure can be imported into the software programs VMD [3,4]
and Chimera [5,6] and displayed.
In Namot, you can use the “add unit” command to program the position of each nucleotide. For
details, see the online manual [1]. The right panel in FIG. 1 shows a structure generated from a
demo file included in Namot, visualized by VMD. You can create a superhelix structure formed by
a double helix, a branched structure formed by a double helix protruding upward, downward,
leftward, rightward, forward, and backward, etc.

Execution screen (left) and demo structures (right) of Namot.

Recent studies have investigated an all-atom model of DNA origami. As shown in FIG. 2, an all-
atom model can be created from a caDNAno file (provided from Tohoku University).
Chapter 6 Software Techniques 258

All-atom model of a DNA origami

Molecular Dynamics
Based on the all-atom model obtained, you can perform molecular dynamics simulations
based on classical physics. Popular simulation software programs are NAMD [7], Amber [8], and
Charmm [9].
You can consult the NAMD tutorial for a molecular dynamics simulation of DNA, with a pdb file as
input. FIG. 3 shows the result of a simulation of double helix dissociation process at an elevated
temperature. Note that all-atom molecular dynamics usually requires expensive computing.

Molecular dynamics simulation on NAMD

Nanoengineer-1 for Helix Structure Modeling


When you only need a less detailed structure instead of an all-atom model, Nanoengineer-1
is effective [10]. FIG. 4 shows its execution screen displaying the modeling of a spoked-wheel
Chapter 6 Software Techniques 259

structure with T-motifs [11]. By selecting Build -> DNA -> Insert DNA, you can generate a three-
dimensional model of a double helix that has the same geometry as DNA. It allows you to draw
differently oriented helices, cut and connect terminals, and so on, in an intuitive manner. It is an
effective software program for a design where only the phase and pitch of helix are important.

Execution screen of Nanoengineer-1

—Ibuki Kawamata (Tohoku University)

67. Simulation of DNA Logic Gate


Implementation of information processing systems based on DNA have been extensively stud-
ied. However, the design and analysis of such a system will become more and more difficult
as the system size increases. You can overcome this problem by predicting the behavior of the
system in advance. This section introduces a software program that can simulate DNA strand
displacement reactions by means of numerical analysis.

DNA Logic Gates and Chemical Reaction Equations


DNA has been used as input and output to logic gates (see section 29) and oscillators (see section
33). In these systems, more than one chemical reaction proceeds simultaneously, making
it difficult to quantitatively predict behavior. When an existing gate is extended to design a
Chapter 6 Software Techniques 260

larger system, newly introduced sequences might interfere with the original system, causing
unexpected reactions.
To avoid this, the behavior of the system should be predicted in advance by numerical sim-
ulation based on chemical reaction equation, i.e. continuous-deterministic simulation [1,2]
by formulating an ordinary differential equation based on reaction kinetics (see section 39)
and using a numerical analysis library (see section 60). We can apply discrete-event/stochastic
simulation to a relatively small number of molecules in a compartmentalized environment, such
as liposomes (see section 89).
As the number of molecular species involved increases, the differential equation will include
more and more variables, and thus will become difficult to manually formulate and analyze
repeatedly. It will also require a technique to determine the reaction rate constant and other
parameters.
In the case of DNA reactions, the elementary reactions of which are very limited, e.g. to strand
displacement reaction (see section 42) and enzymatic reaction (see section 45), there are design
assistant software programs that can automate formulation and analysis. In the design and
simulation of a system such as an enzymatic reaction-based oscillator, the DACCAD tool is
effective [3] (see section 33). It features intuitive, graph-based operations.

Design/Analysis Software DSD


This section introduces the online software “DSD” [4,5], which can design and simulate a logic
gate that is driven by strand displacement reaction and does not involve any enzyme.
DSD, developed by Microsoft Research, is available by clicking “MAIN” in its webpage [4], which
also includes a link to the tutorial and manual. A basic guide is given below.

Notation for DNA

DSD uses its original notation to describe DNA [6]. FIG. 1 summarizes an exercise found in the
manual. Each domain is indicated with a letter with/without a number, and its complementary
strand is marked with “*”. Each short toehold sequence is marked with “^”. The symbol of each
rightward single strand is placed in “< >”, each leftward single strand in “{ }”, and each double
strand in “[ ]”. “:” denotes concatenation along a leftward single strand, while “::” denotes
concatenation along a rightward single strand.
Chapter 6 Software Techniques 261

DNA structures & corresponding notation

Programming of Input File

The concentration of each molecule at the initial state is written as a program in the Code
window (left), which is then inputted into the software. You can also add reaction rate constants,
simulation time, and molecules to be displayed in the graph to the program. The software
includes various sample programs, and you can create a program for amplifying gate (see
section 31) by selecting “Catalytic” from “Examples”.

Below is an exemplary simulation of an AND gate, with the initial state of ”.


molecules and their concentration as shown at the top in FIG. 2. The
corresponding program is shown at the bottom of the figure. The first 6 lines
define variables and the last line describes an initial state. The initial state
includes the concentration and species of molecules, which are described in
“( )”, separated from each other by “
Chapter 6 Software Techniques 262

Schematic and programming of the initial state

Compilation and Numerical Simulation

Once the programming is done, compile it by clicking the “Compile” button at the top left. If
the program has not been written appropriately, this step will return an error (then you have
to correct it). After the compilation is successful, possible structures that can be reached from
the initial state are investigated exhaustively to formulate ordinary differential equations. Select
the Graph tab under Compilation over the right window to display a graph of reaction paths, as
shown in the upper panel of FIG. 3.
Then, click the “Simulate” button on its side to display the numerical simulation results. Unless
Chapter 6 Software Techniques 263

you change them, the parameters have default values and settings. You can choose between
“Stochastic” (stochastic simulation) at default and “Deterministic” (deterministic simulation).
A result of selecting the Plot1 tab under Simulation in the bottom right window is shown
at the bottom of FIG. 3, wherein the horizontal axis represents time and the vertical axis
represents concentration. It provides various options; for example, you can change the settings
of Compilation and Options at the top right to perform an in-depth simulation.
This software program can predict the time course of concentration, and is useful for system
design/analysis.
Chapter 6 Software Techniques 264

Compilation & simulation results

—Ibuki Kawamata (Tohoku University)

You might also like