Professional Documents
Culture Documents
Manga (Japanese comics) are popular and enjoyed all over the 2.1 Segmentation
world. Manga have been created by pen and ink on white pa-
per, and most existing manga are monochrome. Recently, colored In preprocessing, we segment the reference and target images by
manga have become more popular, and colorizing monochrome computing superpixel-based filling and segmentation [Shi and Ma-
manga has attracted the attention of manga publishers and authors. lik 1997]. Because a manga image usually consists of flat regions,
If previously published monochrome manga are colorized and re- this tends to work well. For example, a face of a character in Fig-
distributed, they gain readers’ attention and readers can enjoy the ure 1 is a single, flesh-colored, flat region. Users can modify results
richer representations of manga. if they are not satisfied.
However, colorization of manga is a tedious and time-consuming
task that must be performed manually using image editing software 2.2 Finding Corresponding Regions by Graph Match-
such as Photoshop or by scribble-based manual colorization meth- ing
ods [Levin et al. 2004; Sýkora et al. 2009]. Although automatic
colorization methods for natural images exist [Gupta et al. 2012; In manga, the shape and the position of a region are often deformed.
Irony et al. 2005; Welsh et al. 2002], it is hard to apply them to Therefore, when we try to match a region in the target image to a
manga because their work is based on the assumption that neigh- region in the reference image, it is difficult to find the best matching
boring pixels that have similar intensity have similar colors, which simply by comparing the regions one by one. Thus, we aim to find
is not true for manga. The nature of manga images is very differ- the overall correspondence using relative positions between neigh-
ent from that of natural images because manga consist only of line boring regions and keeping the two images as similar as possible.
drawings and textural patterns. Qu and colleagues [Qu et al. 2006]
proposed a colorization method for manga that manually colorizes In the proposed method, regions in the colored reference image and
plain and textural pattern regions in a unified manner. However, a monochrome target image are represented as nodes in a graph
in their method each area of a manga image still requires an initial structure. We call these the reference graph and target graph, re-
operation of manual color assignment by user scribbles, so their spectively. Then, colorization is performed by formulating the
method is still time-consuming when many manga images need to matching of the two graphs as a constrained 0-1 quadratic program-
be colorized. ming problem.
∗ e-mail:{sato, matsui, yamasaki, aizawa}@hal.t.u-tokyo.ac.jp
We denote the reference graph as Gr = (Vr , Er ) and the target
graph as Gt = (Vt , Et ), where V represents a set of nodes and E
represents a set of edges. Examples are shown in Figure 1. A so-
lution can be defined as a permutation matrix X = (xij )Nr ×Nt ∈
{0, 1}Nr ×Nt , where Nr and Nt denote the numbers of nodes in
Gr and Gt , respectively.
1 Whilecharacters are considered here, our method can be extended to
other objects.
Graph matching
Segmentation
Colorized Colorized
target target image
graph 𝐺𝑡
Target image Target
graph 𝐺𝑡
Figure 1: System overview. (a) The colored reference image and a target image are segmented into small regions. This can be done by simple
superpixel-based filling. (b) A graph that represents the image is constructed in which each region is represented by a node, for both reference
and target images. (c) The reference and target graphs are matched. (d) Each node of the target graph is colored. (e) By using the colored
target graph, the target image is colored.
When the ith region in the reference image (ith node from Vr ) cor-
responds to the jth region in the target image (jth node from Vt ),
xij is set to one, and zero otherwise. For one-to-one correspon-
dence, each row and column of X must satisfy the conditions:
X X
xij = 1, xij = 1. (1)
j i
Here, x is a concatenation of column-wise vectors of matrix X, Q Here, λmin is the smallest eigenvalue of matrix Q and I is an iden-
is a matrix containing Q(i1 , j1 , i2 , j2 ) as its elements, and Cx = tity matrix. Thus, the expression Q − λmin I is positive semidefi-
1, where 1 is a column vector of all ones and C is a matrix that nite, and the objective function becomes convex. Any number with
imposes one-to-one correspondences between the two sets of nodes value −λmin or greater can be used for the diagonal matrix to con-
(equivalent to Eq. 1). vexify the objective function; however, it can be shown that the
error caused by the relaxation is minimized when −λmin is used
[Hammer and Rubin 1970].
By relaxing the variable and replacing the second term using the
vector notation λmin · 1 = λmin , Eq. 8 above becomes:
min xt (Q − λmin I)x + λtmin x
2 Reference image Target image Colorization result
s. t. Cx = 1, x ∈ [0, 1]N . (9)
(a) Even though the shapes of the hands change greatly, they are correctly
Equation 9 can easily be solved using the interior-point-convex al- matched.
gorithm.
Once the solution is obtained in the continuous domain, we must
convert it back to the original binary domain. Before that, we again
2
represent the continuous solution xc ∈ [0, 1]N in a matrix form
N ×N
Xc ∈ [0, 1] . To obtain the binary solution Xb ∈ {0, 1}N ×N
from Xc , we choose one element from each row and column of Xc
without duplication to satisfy Eq. 1.
Reference image Target image Colorization result
3 Colorization Results
(b) The character’s elbows do not appear in the target image, but the arm
We tried to color more than 30 images. In this paper, only some parts are correctly colored.
sample images are shown because of space limitations. We show
colorization results in Figures 3(a) to 5(c). In all cases, the in- Figure 3: Successful results (1)
put reference and target images of manga characters were cropped
from manga titles (or reproductions) and all reference images were
colored. In preprocessing, we segmented the target characters into
small parts automatically as discussed in Section 2.1. If the result We also show an application of our proposed method to key frames
was not good, we modified the result manually. Then, we applied of an animation. In key frames, characters have small posture
the proposed colorization method to the target images. This step changes between successive images in one scene, and the proposed
was automatic and was completed very quickly. method can work well. An example is shown in Figure 6.
Reference image Colorized target image 1 Colorized target image 2 Colorized target image 3
(t=0) (t=1) (t=2) (t=3)
Figure 6: Colorization of animation images. Target image 1 (t=1) is colorized using the reference image (t=0), target image 2 (t=2) is
colorized using colorized target image 1 (t=1), and target image 3 (t=3) is colorized using colorized target image 2 (t=2).
Disappears wrong
in the target
image
Splits in the
target image
(a) Part of the straw hat fails because there is a balloon in the target image
(a) The positions of the arms are changed but they are properly colored. and the straw hat is partly covered and divided. With such splits or merges,
the proposed method tends to fail.
+ =
Reference image Target image Colorization result
(b) The hair band is not correctly colored because the number of patterns in
Reference image Target image Colorization result it is different. Note that other parts, like the mouth, are properly colored.
(b) Although part of the hair is not colored, almost all of the body is correctly
colored. Merge in the
target image Failure
Figure 4: Successful results (2)