Professional Documents
Culture Documents
d Δy(t)
Δx(t)
q ae dx ah w ix n dcl d ow sil
at a window
Figure 4. Features of the vicinity
Figure 5. Individual timestep classification
with a standard RNN
was removed at the considered horizontal position.
forward
Hid Hid
states
Hid Hid
backward
Hid Hid
states
Hidden Layer
I I I I
t t+1 t t+1
Network Inputs
end of stroke
x − position
y − position LSTM (BLSTM), which has been shown to outperform
time
other neural network architectures in framewise phoneme
recognition [6]. In the experiments described below, we use
a BLSTM network with a CTC output layer to transcribe
samples of handwritten text.
Reconstructed Image