Sei sulla pagina 1di 10

A Digital Palaeographic Approach towards Writer Identification in the

Dead Sea Scrolls

Maruf A. Dhali1 , Sheng He1 , Mladen Popović2 , Eibert Tigchelaar3 and Lambert Schomaker1
1 Institute of Artificial Intelligence and Cognitive Engineering (ALICE), Faculty of Mathematics and Natural Sciences,
University of Groningen, PO Box 407, 9700 AK, Groningen, The Netherlands
2 Qumran Institute, Faculty of Theology and Religious Studies,

University of Groningen, PO Box 407, 9700 AK, Groningen, The Netherlands


3 KU Leuven, Faculty of Theology and Religious Studies, Leuven, Belgium

{m.a.dhali, s.he, m.popovic}@rug.nl, eibert.tigchelaar@kuleuven.be, l.r.b.schomaker@rug.nl

Keywords: Dead Sea Scrolls, Handwritten Document Analysis, Digital Palaeography, Writer Identification, Handwriting
Recognition, Pattern Recognition, Feature Representation, Machine Learning.

Abstract: To understand the historical context of an ancient manuscript, scholars rely on the prior knowledge of writer
and date of that document. In this paper, we study the Dead Sea Scrolls, a collection of ancient manuscripts
with immense historical, religious, and linguistic significance, which was discovered in the mid-20th century
near the Dead Sea. Most of the manuscripts of this collection have become digitally available only recently
and techniques from the pattern recognition field can be applied to revise existing hypotheses on the writers
and dates of these scrolls. This paper presents our ongoing work which aims to introduce digital palaeography
to the field and generate fresh empirical data by means of pattern recognition and artificial intelligence. Chal-
lenges in analyzing the Dead Sea Scrolls are highlighted by a pilot experiment identifying the writers using
several dedicated features. Finally, we discuss whether to use specifically-designed shape features for writer
identification or to use the Deep Learning methods on a relatively limited ancient manuscript collection which
is degraded over the course of time and is not labeled, as in the case of the Dead Sea Scrolls.

1 INTRODUCTION With regard to choosing the right methodology,


optical character recognition (OCR) methods are not
This paper is part of a pioneering project on the sufficient for historical manuscripts. There are mod-
Dead Sea Scrolls that is sponsored by the European ern forms of neural networks (Deep Learning) hav-
Research Council (EU Horizon 2020). This multi- ing exceptionally good results (LeCun et al., 2015) in
disciplinary project brings together the natural sci- many aspects of pattern recognition including hand-
ences, artificial intelligence, and the humanities in or- written document analysis. But these performances
der to shed new light on ancient Jewish scribal culture can only be achieved in case of millions of training
by investigating two aspects of the scrolls’ palaeog- examples, which are contrary to the number of doc-
raphy: handwriting recognition (the typological de- uments in many historical manuscripts, especially in
velopment of writing styles) and writer identification. the DSS.
Recognizing the handwriting would solve the when, Here, we will present preliminary results of writer
which and where questions, and identifying the writer identification in the DSS using several hand-crafted
would end up answering the who question. These features. Although this gives us fast results with-
are the four most important perspectives (figure 1) in out lengthy training on the limited labelled data of
the study of palaeography and book history (Stokes, the DSS, they are certainly not the best results to
2015). The digitization of the Dead Sea Scrolls (DSS) be expected. We consider the results as a baseline
has opened the door for pattern recognition to be ap- measurement for later experiments. We suggest how
plied in answering those four questions (4-W). We to improve the results by exploiting the power of
aim to bridge the gap between computational science parameter-heavy machine learning methods using this
and traditional palaeography by solving the 4-W with small dataset. In solving this, we make a three-fold
a potential impact on digital palaeography beyond proposition of advanced statistical modelling, data-
DSS studies. augmentation, and the use of pre-trained networks.

693
Dhali, M., He, S., Popović, M., Tigchelaar, E. and Schomaker, L.
A Digital Palaeographic Approach towards Writer Identification in the Dead Sea Scrolls.
DOI: 10.5220/0006249706930702
In Proceedings of the 6th International Conference on Pattern Recognition Applications and Methods (ICPRAM 2017), pages 693-702
ISBN: 978-989-758-222-6
Copyright c 2017 by SCITEPRESS – Science and Technology Publications, Lda. All rights reserved
ICPRAM 2017 - 6th International Conference on Pattern Recognition Applications and Methods

1.1 Dead Sea Scrolls image processing techniques need to be applied for
optimum results of segmentation. Starting with edge
The DSS are a collection of ancient damaged detection, morphological operations, filling gaps and
manuscripts that were discovered in the mid-20th cen- then finding connected components help to automati-
tury in the Judaean Desert, in between Jerusalem and cally segment the hand-written fragments. Then fur-
the Dead Sea. Most were written over a period of ther processing can localize and extract the charac-
almost four centuries (ca. 250 BC to ca. 135 AD) ters. Due to the difference in the textures of papyrus
(Tigchelaar, 2010; Popović, 2012; Popović, 2015) in and animal-skin, individual measures must be taken
characters commonly known as the Hebrew alpha- on their distinctive periodic structures.
bet, which actually derives from the Aramaic script We have explored different feature-extraction
(Yardeni, 2002). The manuscripts, written by many techniques on the images of the DSS. Feature-
different writers, some of whom may have written representation maps the raw pixel intensity into a dis-
multiple manuscripts, display a broad variety and de- criminant high-dimensional space (Mikolajczyk and
velopment of different styles of this Hebrew-Aramaic Schmid, 2005; Li et al., 2015) in order to capture
script. The study of ancient handwriting provides specific information of the characters which can be
the chronological framework, but the typological se- processed by algorithms in computers. This step is
quence of writing styles has to date not been system- an important element in the field of computer vision
atically assessed for the DSS. and pattern recognition. There have been a lot of ef-
This project carries out the first systematic assess- forts to design discriminative and powerful features
ment of the palaeographic framework of the scrolls (Li et al., 2015). Though the (Deep) Learning-based
by combining two approaches. First, we will con- feature representation may achieve better results in
duct new radiocarbon (14 C) dating on a number of many cases, hand-crafted features have several advan-
physical samples of the scrolls, kindly provided to tages in the analysis of handwritten documents, espe-
us by the Israel Antiquities Authority (IAA). Sec- cially for historical manuscripts. This is due to the
ond, we will generate for the first time quantitative amount of data in historical manuscript collections,
data for palaeographic handwriting recognition by which is usually not big enough to train deep neural
means of Artificial Intelligence, using the Monk sys- networks. In contrast, the ImageNet data set (Deng
tem, designed by Schomaker’s research group at AL- et al., 2009) contains millions of samples for train-
ICE (Schomaker, 2016; Van der Zant et al., 2008; ing the network. The challenge becomes even higher
Bulacu and Schomaker, 2007). The challenging is- when the total number of usable pages comes to a
sue of writer identification in the DSS has not been count of hundreds in the DSS. To take the opportu-
systematically dealt with before. The tools of digital nities offered by the Deep Learning methods, the as-
palaeography enable new, significant steps forward. sociated challenges need to be overcome in order to
In this paper, we focus on this second approach of analyse the DSS.
digital palaeography.

Who? - Writer identification


2 DATA
When? - Temporal alignment

Which? - Manuscript identification 2.1 Manuscript Images


Where? - Localization
We will use digital images of the DSS as our primary
Figure 1: The four interesting questions for handwritten data. There are various sources for digital images of
manuscript understanding (image from the DSS manuscript the DSS manuscripts. The source used in this study
PAM 43.754, source: Brill scans). is kindly provided to us by Brill Publishers (Lim and
Alexander, 1995). There are 2463 images in the Brill
1.2 Challenges in Digital Palaeography collection with varied resolutions from 600 by 600
pixels to 2800 by 3400 pixels, approximately. An-
In order to achieve both goals, i.e., handwriting recog- other source is the high-resolution multi-spectral im-
nition and writer identification, specific challenges at ages of the DSS kindly provided to us by the Israel
several levels of Computer Vision and Artificial Intel- Antiquities Authority (IAA), which derive from their
ligence must be overcome. Initial analyses are needed Leon Levy Dead Sea Scrolls Digital Library project.
for the proper extraction of characters (foreground, In this project the IAA produces multi-spectral im-
ink) from the background, which is mostly either an- ages of scrolls fragments on both the recto and verso
imal skin or papyrus in the case of the DSS. Several in 28 exposures, creating a file of 56 monochrome ex-

694
A Digital Palaeographic Approach towards Writer Identification in the Dead Sea Scrolls

posures per fragment. The system then generates a Additionally, the possibility of using the IAA
57th file of a colour image that combines all visible material will improve our results.
wavelengths. The resolution of the files is 1,215 pix-
els per inch at a 1:1 ratio, capturing approximately 2.2 Ground Truth
4 gigabytes of information per fragment (Shor et al.,
2014). Unlike many other historical manuscripts, the DSS
The Brill images are single-layered grayscale im- do not have a structured and complete dataset nor the
ages with 300 ppi (pixels per inch) on both axis. They ground truths for testing. Before diving into any sort
have shadows and reflection from external lighting. of computer aided writer identification, the ground
Additionally the lighting throughout all the images is truths must be there to analyse the results. To estab-
not uniform. Among the images, the ones containing lish the ground truths, we need experts in the field
several fragments are mostly not aligned in a horizon- and also their proper access to the data. We have this
tal way for text reading. This poses the issue of rota- two-folded advantage in our group: first through the
tion variance in characters. Many of the images also presence of palaeographic experts and second through
contain paper calibration strips for scale representa- the Monk system which is accessible through web
tion and contemporary hand-written numbers. The browsers. By integrating these, we started to label the
digitization noise can also be noticed in many of the DSS image data for ground truths.
images (see figure 2). We have proposed two different methods for la-
belling. The first one is to detect the region of interest
in the DSS images. The second one is to create the
ground truth for character-labels. Both these tasks re-
quire manual labour from experts with palaeographic
background knowledge on image level to pixel level.
This is the bench-mark in identifying the writers and
aligning the temporal developments in script style
Figure 2: Two of the Brill images; PAM 40.456 (left) and for the DSS. In this paper, we will only use the la-
PAM 40.531 (right); the images show digitisation noise, belled regions of interest (we call them FragmROIs,
alignment issues with the small fragments, shadows near by shortening the term fragment region of interest)
the border and lighting problems. in order to build algorithms to extract features (using
available methods) and identify (recognition) writers.
The IAA images are clear, properly aligned and From the DSS images, FragmROIs were selected and
free from the problem of lighting and shadow un- labelled by the palaeographic experts using the Monk
like the Brill ones. Additionally the different expo- system (see figure 3). Those rectangular FragmROIs
sure bands of the IAA hold important underlying in- could consist of the entire text on an image, or of only
formation regarding the fragments providing essential a section of text selected from an image. Different
attributes for the scrolls. For example, one particu- FragmROIs from one and the same manuscript were
lar band provides clear information on the ink (fore- labelled as stemming from one writer or scribe, unless
ground) whereas another one gives more details on palaeographers distinguished two scribes as writers of
the underlying leather/papyrus (background) on the the manuscript.
retro side. Some bands are useful for the textual con-
tents and some other bands give a better understand-
ing on the textural properties of the scroll material.
Extraction of this useful information is possible on
both single images and multi-spectral-fused images.
As a whole, digital image data has provided a new
and broader perspective in the quantitative analysis
and processing of the scrolls.
The scope of the current paper is limited to the Figure 3: Using the Monk system, the palaeographic expert
images in the Brill collection, but in the near future can select the region of interest (FragmROI), then put the
we expect to publish our results on the digital data associated attributes (scribe, style, comment etc.) to pro-
from the IAA’s Leon Levy Dead Sea Scrolls Digital duce an XML file, which will later be used as labelled data.
Library. The quality and the challenges of the Brill Currently the experts can only select a rectangular region,
but the provision of choosing a polygonal region of interest
images can not be seen as a set-back, rather it would will be added to the system in near future.
be a starting benchmark to the robustness of our work.

695
ICPRAM 2017 - 6th International Conference on Pattern Recognition Applications and Methods

While labelling the writers, we have set up a provi- this section, we will present the methodology used in
sional naming rule starting with the name scribeAxxx, our pilot project in writer identification to benchmark
where xxx are numerical values starting from 001. our works in analysing the DSS.
Each of the human labelled new-writers will be al-
located with an individual value. The term A is put 3.1 Writer Identification
before the numerical values in order to preserve the
tag of original labelling from the palaeographic ex- Identifying writers using computers has been done for
perts. If at a later stage of our study, two of the decades (Plamondon and Lorette, 1989), which is a
writers are found to be the same one according to problem of recognizing the writer of a given docu-
the system, then they can be referred to with the new ment based on handwriting styles. A number of dif-
name of scribeBxxx having two child-node of format ferent features have been proposed and studied for
scribeAxxx, preserving the original label. writer identification on scripts from several languages
The present pilot study is based on two distinct including Dutch (Bulacu and Schomaker, 2007), En-
set of writers. The first set is a limited sample of glish (Schomaker and Bulacu, 2004), Indic (Adak
323 FragmROIs labelled as having been written by and Chaudhuri, 2015; Karunakara and Mallikarju-
13 scribes, namely the scribes of 1QIsaa columns naswamy, 2011), and Arabic (Bulacu et al., 2007).
1-27, 1QIsaa columns 28-54, 1QS, 1QSa, 1QSb, In the case of the DSS, we will be identifying the
1QM, 1QpHab columns 1-12, 1QpHab columns 12 scribes behind the scrolls with Hebrew characters, and
end-13, 4Q53, 4Q175, 11Q5, 11Q19 columns 2- a hand-crafted feature specially for these characters is
5, 11Q20. We labelled them from scribeA001 to yet to be proposed and studied. Instead of design-
scribeA013. Distinct manuscripts were labelled as ing a new feature, we initially started working with
deriving from different writers, even though in sev- some of the existing textural-based and grapheme-
eral of the manuscripts of the first set palaeographers based features. Textural-based features are based on
think that one and the same writer produced multi- the statistical information about slant and curvature
ple manuscripts (Tigchelaar, 2002). To incorporate of the handwritten characters, and grapheme-based
the palaeographic opinion, we then merged those 13 features, inspired by the bag-of-words model, extract
scribes into 7 scribes by introducing scribeBxxx se- local structures and then map them into a common
ries. Then we took the second set of 13 scribes space (He and Schomaker, 2016). We briefly discuss
with a limited sample of 124 FragmROIs labelled the preprocessing techniques and the features used in
as scribeA014 to scribeA026 (the scribes of 4Q266, this work in the following sections.
4Q504, 11Q10, 1Q22, 4Q209, 4Q167, 4Q6, 4Q286,
4Q381, 4Q405, 4Q491, 4Q431, 4Q525). The main 3.1.1 Preprocessing
difference between these two sets are the amount of
characters per scribe. The first set has a higher num- As the feature extraction technique is applied on the
ber of characters than the second set. Thus, for this binarized images, first we pre-processed the DSS im-
pilot project, we have 447 FragmROIs labelled as ages. Binarizing the Dead Sea Scrolls images is quite
20 distinguishable scribes according to palaeographic challenging, given their diverse intensity, similarity
opinion. between ink and background traces, and image qual-
ity. We first started with Sobel edge detection (So-
bel, 1990) and then removed the connected objects on
the border to get rid of the markings. Morphological
3 A PILOT PROJECT operation was then used followed by image thresh-
olding. We used the global Otsu threshold selection
The DSS image data has its own distinctive char- method (Otsu, 1975) as it is efficient and parameter-
acteristics compared to other historical manuscript less (see figure 4).
datasets. This data set has with quite a different ap-
pearance from, e.g. historical manuscripts such as the 3.1.2 Feature Representation
Medieval Palaeographic Scale data set (Monk, 2016)
from a previous project (He et al., 2016) in three as- Previous studies showed that the textural-based
pects: (1) the number of characters in fragments from feature extraction methods perform better than
some documents can be as low as one; (2) the ink of grapheme-based methods (He and Schomaker, 2016;
each character has been faded out over the course of He and Schomaker, 2017). Additionally, a more pow-
time, making it more difficult to observe and process; erful approach was introduced by using the spatial co-
(3) the large diversity and lack of uniformity among occurrence among features (Bulacu and Schomaker,
text blocks, presenting a challenge for analysis. In 2007; Ito and Kubota, 2010; Qi et al., 2014). The

696
A Digital Palaeographic Approach towards Writer Identification in the Dead Sea Scrolls

∆n Hinge. The ∆n Hinge is a rotation-invariant tex-


ture feature (He and Schomaker, 2014), computed by
building a feature network with the differential oper-
ator between Hinge kernels as the kernel function K i :
( n−1 n−1
∆n α(xi ) = ∆ α(xi )−∆δl α(xi +δl)
n−1 n−1 (2)
∆n β(xi ) = ∆ β(xi )−∆δl β(xi +δl)
Figure 4: The left one is a FragmROI from Brill collection
(PAM 43.787A) and the right one is the binarized image where (α, β) is the Hinge kernel and n is the order
using the Otsu threshold selection method. of the differential operator. Although many different
features can be generated based on the feature net-
later idea has been extended in a previous work (He work with different n, we work with the ∆1 Hinge fea-
and Schomaker, 2017) with the introduction of the ture with a feature-dimension of 780.
joint feature distribution principle (JFD principle). By Quadruple Hinge (QuadHinge). QuadHinge is a
accommodating these facts, we used eight textural- powerful feature representation following the JFD
based methods (three of them following the JFD prin- principle, which incorporates the curvature informa-
ciple) and one grapheme-based method. tion of the contour fragments in the Hinge kernel by
Hinge. The Hinge feature is the joint probability dis- computing a fragment curvature measurement (FCM)
tribution of orientations of the legs of two contour C (Fc ) for contour fragments (Benhamou, 2004).
fragments attached at a common-end pixel on the ink Quill and QuillHinge. The Quill feature (Brink et al.,
contours (Bulacu and Schomaker, 2007). Figure 5 2012) is the joint probability distribution p(α, w) of
shows two examples of the Hinge kernel on contour the relation between ink direction α and the ink width
fragments with leg length l and the joint probability w characterizing the writing material properties. The
of the two orientations, α and β (α < β), are quan- QuillHinge is an extension of the Quill and Hinge,
tized into a 2D histogram. Empirically we have set which is the probability of p(α, β, w), resulting in a
l = 7 and the number of bins of α and β is set to 23. 3D histogram. We use the same parameters of the
Finally, the dimension of the feature vector is 253. Quill and QuillHinge as the original paper (Brink
et al., 2012), and the dimensions of Quill and Quill-
Hinge are respectively 1600 and 31200.
C(F1 )=1.06
Triple Chain Code. The triple chain code feature
C(F1 )=0.53
α1
α2 (Siddiqi and Vincent, 2010) is based on the chain code
β1 β2 on a pixel of the writing contours, which is the one of
eight directions where the next pixel is on, denoted
C(F2 )=1.06 C(F2 )=1.06
from 1 to 8.
TCC(xi , xi+l , xi+2l ) = [CC(xi ), CC(xi+l ), CC(xi+2l )]
Figure 5: The two figures show two contour fragments with (3)
the same Hinge kernel (α1 =α2 and β1 =β2 ) but different
fragment curvature values C (Fc ).
where CC(xi ) ∈ {1, 2, · · · , 8} is the chain code value
on position xi , and l is the Manhattan distance along
the writing contours. We take the same value of l = 7,
Co-occurrence Hinge (CoHinge). The CoHinge fea- similar as the CoHinge feature. The feature dimen-
ture is the joint distribution of Hinge kernel following sion is 512.
the JFD principle on two different points xi and x j Cloud Of Line Distribution (COLD). COLD is a
with Manhattan distance l (figure 7) on the contours curvature-free feature designed with the fact that writ-
as equation 1. ing contours can be approximated by a set of line seg-
  ments obtained by the sequential polygonization al-
CoHinge(xi , x j ) = Hinge(xi ), Hinge(x j ) (1)
gorithm (Siddiqi and Vincent, 2010) and the lengths
Each Hinge kernel has two values α and β, and and orientations of these straight lines can capture the
therefore, the CoHinge kernel has four values [α(xi ), handwriting styles. The high ordered curvature points
β(xi ), α(x j ), β(x j )], which can be quantized into a on the writing contours are obtained using the method
4D histogram. The Manhattan distance l is set to (Prasad et al., 2011), denoted by P = {pi (xi , yi ), i =
7 based on our previous study (He and Schomaker, 0, 1, 2, · · · , n}, where (xi , yi ) is the coordinate of the
2017). We set the number of bins of the angle to 10, point pi (see figure 6). The line segments can be
and finally the dimension of the CoHinge feature is obtained between any pair of the dominant points
10 ∗ 10 ∗ 10 ∗ 10 = 10, 000. (pi , pi+k ), where k is the parameter which denotes the

697
ICPRAM 2017 - 6th International Conference on Pattern Recognition Applications and Methods

distance on the dominant sequence P . Each line can


be measured by a pair (θ, ρ) in the polar coordinate
space, where θ is the line orientation and ρ is the line xi
length. All the lines in a given handwritten document l
m
can form a distribution in the polar coordinate space xj
and can be quantized into a log-polar histogram in-
spired by the Shape Context (Belongie et al., 2002).
The features obtained with k = 1, 2, 3 in the log-polar
space with the radius 7 and the angular intervals 12 Figure 7: Left: Co-occurrence patterns on ink contours.
are concatenated into one feature vector with the di- Right: An illustration of the stroke-length distribution on
mension: 7 ∗ 12 ∗ 3 = 252. a reference point (the blue point in the center). The green
rays are the partial length in each direction, and the yel-
low curve is the distribution of the partial length in the po-
lar space. The red line is the skeleton line of the stroke
ink. m is the maximum measurable stroke length (He and
Schomaker, 2017).
(a) (b) (c) 3.1.3 Identification Methodology

Writer identification is simply answering the who


k=1 question. For a query document QscribeA
scripti
, where
x
scripti is the script of the hand-written manuscript and
scribeAx is the writer which we want to identify, all
(d) (e) scripti
the documents in the database dssscribeA ∈ DSS scripti
i
are sorted according to the feature distance between
scripti scripti
QscribeA and dssscribeA to produce a hit-list where the
k=2 x i
writer of the top document is assigned to scribeAx .
Here scribeAi is the label of all the writers and for our
case scripti is a single script of Hebrew. The nearest
(f) (g) neighbour classification method is performed using
the leave-one-out (Brink et al., 2012; Siddiqi and Vin-
Figure 6: Illustration of the process of the COLD construc- cent, 2010) strategy. We take the query document out
tion on the Shin character: (a) The given binarized con-
nected component; (b) The contour extracted from the bina- and sort the remaining documents according to their
rized image (a); (c) Detected dominant points (red points); distance function to an output hit-list. For the distance
(d) Line segments (red lines) obtained between pair domi- function of the feature vectors, we have taken the χ2
nant points when k = 1; (e) The distribution of lines from (chi-squared) distance for its better performance (Bu-
(d) in the polar coordinate space; (f) Line segments when lacu and Schomaker, 2007).
k = 2 (Note that some long lines are not shown in order to
make the figure more clear); (g) The distribution of lines
from (f) in the polar coordinate space.
4 RESULTS
Junction Features. Junclets (He et al., 2015), a
grapheme-based feature, is the stroke-length distribu- In this section we present the performance of writer
tion in every directions from 0 to 2π around a refer- identification based on the features and methodology
ence point (see Figure 7) inside the ink trace. When explained in section 3.1. 447 FragmROIs were used
the center point lies on the junction points, such as for the pilot test. In the first set we took 323 Fragm-
the fork points and high curvature points on the skele- ROIs with 13 writers labelled from scribeA001 to
ton line of the ink strokes, the corresponding feature scribeA013 having 74, 33, 14, 13, 26, 37, 58, 3, 25,
is the junction feature, which contain the junction in- 24, 4, 10 and 2 FragmROIs respectively. The first set
formation around the joint point. We have taken the consists of writers with a large number of characters
stroke length distribution in 120 directions equidis- in their corresponding FragmROIs.
tantly sampled from 0 to 2π and the feature dimension We first calculated the feature vectors for all the
of each junction is 120. FragmROIs. Then we performed the writer identi-
fication using the methodology explained in 3.1.3.
We produce the output hit-list of all the FragmROIs
sorted out in accordance with their distance to the in-

698
A Digital Palaeographic Approach towards Writer Identification in the Dead Sea Scrolls

put FragmROI. The top-n performance is calculated Table 3: The top-1 and top-10 performance (in percent-
when the query FragmROI is recognized as the writer age) of writer identification for another 13 scribes from
of the FragmROI on the top n of the hit-list. For ex- scribeA014 to scribeA026, with limited text fragments.
ample, the top-10 hit-list signifies the overall percent- Feature Top-1 Top-10
age of finding the same writer as input within the first Hinge 61.90 92.06
ten candidates (shortest distanced) of the output hit- CoHinge 62.69 89.68
list. Similarly the top-1 means the top-most candidate ∆1 Hinge 43.65 85.71
in the output hit-list corresponds to the same writer QuadHinge 63.49 90.47
as the input. The performance of top-1 and top-10 Quill 48.38 89.51
hit-list for first set is presented in Table 1. Accord- QuillHinge 45.16 74.19
TripleChainCode 61.90 88.09
Table 1: The top-1 and top-10 performance (in percentage) COLD 58.87 88.71
of writer identification for 13 scribes from scribeA001 to
scribeA013.
Junclet 31.45 76.74

Feature Top-1 Top-10 Table 4: The top-1 and top-10 performance (in percentage)
Hinge 87.61 97.83 of writer identification for all 20 scribes.
CoHinge 81.11 95.97
Feature Top-1 Top-10
∆1 Hinge 79.87 94.73 Hinge 78.30 94.40
QuadHinge 89.47 96.59 CoHinge 79.19 89.93
Quill 80.80 93.80
∆1 Hinge 68.23 85.48
QuillHinge 76.78 89.78
QuadHinge 79.64 89.04
TripleChainCode 84.82 96.59
Quill 71.58 86.57
COLD 82.35 94.42
QuillHinge 69.57 82.10
Junclet 81.42 95.04
TripleChainCode 79.19 91.28
ing to the majority palaeographic opinion, scribeA001 COLD 76.95 88.37
and scribeA002 are the same scribe, and so also Junclet 40.71 72.93
scribeA003, 004, 005, 008, 009 and scribeA010, 012,
which are then labelled as scribeB001, B002 and
B003 respectively. The result is presented in Table 2
for these seven scribes. Then we took the second set
5 DISCUSSIONS
Table 2: The top-1 and top-10 performance (in percentage) 5.1 Performance Evaluation
of writer identification for 7 scribes: scribeB001, B002,
A006, A007, B003, A011 and A012 .
We presented the results of writer identification on a
Feature Top-1 Top-10 limited number of scribes. Of the shape based meth-
Hinge 92.26 98.76 ods, the QuadHinge performs the best in the top-1
CoHinge 93.80 97.52 hit-lists for three out of four cases (only for the case
∆1 Hinge 90.71 96.28 of seven scribes in Table 2, CoHinge performs better
QuadHinge 93.50 96.90 with a small difference of 0.30% than QuadHinge),
Quill 88.54 96.28 whereas the Hinge feature gives better result in all
QuillHinge 88.85 96.90 the top-10 hit-lists. The reason for this performance
TripleChainCode 92.26 98.14 can be deduced from the design criteria of the fea-
COLD 91.33 96.28 tures themselves. Hinge feature takes into account
Junclet 56.03 88.85 the joint probability distribution of the orientations of
legs of two contour fragments from a common end
of 13 scribes with 124 FragmROIs. The amount of pixel on ink contours, which proves to be a strong
text is lower in this set than in the first one. The identical property for individual scribes of these an-
result is shown in Table 3. Finally, we took all the cient manuscripts. Additionally, the incorporation of
scribes together for testing. Table 4 presents the result FCM to the Hinge feature following the JFD principle
of these 20 scribes together ( i.e., scribeB001, B002, gives the QuadHinge feature a boosted performance.
B003, A006, A007, A011, A013, A014 to A026). We The directional measurement of the ink-trace
briefly discuss the results and our propositions in the width makes the Quill feature, which is quite infor-
next section (Section 5). mative on quill-based medieval scripts, a weak can-

699
ICPRAM 2017 - 6th International Conference on Pattern Recognition Applications and Methods

didate for the DSS. This is due to the uniformity of networks on massive not-labelled handwriting collec-
the ink-trace in these documents coming from a prob- tions. Such networks are trained to reconstruct im-
ably fairly blunt tip of the ancient writing equipment. ages over (via) a very limited number of values (hid-
Consequently, the QuillHinge fails to provide a higher den units). After training, such a network implicitly
performance in this test set. The ∆1 Hinge has a lim- knows a lot about historical handwritings in general.
ited performance, indicating that on Hebrew charac- In a second stage, such a network is then applied to
ters, loss of the angle with respect to the horizontal the DSS, using those hidden unit vectors as feature
removes too much of the writer-specific information. descriptors.
The grapheme-based feature, Junclets, gives lower 3) Data augmentation can be utilized in the pro-
performance than the cross-script writer identification cessing of the DSS. If there is a believable random
(He et al., 2015) due to the lower variability in the transformation of the DSS’ text patterns, i.e., one that
stroke-length distribution in every direction around a remains legible by humans, then for each natural sam-
reference point inside the ink of the DSS’ Hebrew ple of a character, a number of N derived random ver-
characters. sions of it may be added to the training set, effectively
enlarging the amount of labelled data. Known already
5.2 Propositions in the nineties (Baird, 1992) this was later made pop-
ular in handwriting recognition later by the use of hid-
The challenges in analysing the DSS are unique den Markov models (Varga and Bunke, 2003; Ha and
and unprecedented. Using the dedicated features Bunke, 1997).
(in 3.1.2), we found fast results without lengthy train-
ing on the limited labelled data of the DSS. But 5.3 Conclusions
they are certainly not the best results to be expected.
Especially when the amount of data is small with In this paper, we have introduced digital palaeogra-
large variability (as in Table 3), the performance be- phy of the DSS by presenting a pilot project, which
comes lower. To overcome this situation, we need to is part of a pioneering multi-disciplinary project that
consider a pragmatic approach incorporating several brings together the natural sciences, artificial intel-
propositions. ligence, and the humanities. By introducing the
1) Statistical modelling can be used in the case rule to establish ground-truths, we performed writer-
of the DSS, where the sample size is low and there identification tests using dedicated features on provi-
are differences in the scholarly opinion of writers as sionally labelled data. The varying performance of
well. We can use the differences in writing attributes results for different sets of writers led us to the propo-
of a set of different manuscripts to build a population sitions of statistical modelling, transfer learning, and
model. A writer model can be built using the query data augmentation for this largely diverse collection
manuscript. The classification is then carried out by of manuscripts.
evaluating the similarity of a further manuscript sam- We consider the results of this paper as a baseline
ple with respect to the models. We can build our pro- measurement for our later experiments. We will com-
visional model, similar to the work of speaker identi- bine both the aspect of specifically-designed shape
fication (Leuzzi et al., 2016), as follows: features and the Deep Learning methods to produce
pb (d(Wi ,W j )) fresh empirical data for the study of the DSS. Addi-
Λ(d(Wi ,W j )) = (4) tionally, we will conduct new radiocarbon (14 C) dat-
pw (d(Wi ,W j ))
ing on a number of physical samples of the scrolls.
Here, d(Wi ,W j ) is the distance computed from Wi , The outcome of 14 C dating will then be subjected to
the query writer to W j , the suspected writer. Λ de- Bayesian statistics methods in combination with the
notes the likelihood ratio over d(Wi ,W j ). The distri- results from temporal alignment using pattern recog-
bution of distances between the suspected writer and nition to reach more accurate and precise dating of the
the population is denoted by pb (d(Wi ,W j )), which DSS.
can be referred as the between-group distance among
the writers. pw (d(Wi ,W j )) is the distribution of dis-
tances taken within different instances of the sus-
pected writer (within-group distance). The collection ACKNOWLEDGEMENTS
of statistical models (Fisher, 1925), analysis of vari-
ance (ANOVA), can be used to analyse the within- The authors would like to thank Ruwan van der Iest
group and between-group variances of the writers. (research assistant for the ERC project at the Qum-
2) Another possibility is transfer learning (Long ran Institute) for his valuable inputs in labelling the
and Wang, 2015). It starts with the use of pre-trained regions of interest through the Monk-system.

700
A Digital Palaeographic Approach towards Writer Identification in the Dead Sea Scrolls

This work has been supported by an ERC Starting Ito, S. and Kubota, S. (2010). Object classification us-
Grant of the European Research Council (EU Hori- ing heterogeneous co-occurrence features. In Euro-
zon 2020): The Hands that Wrote the Bible: Digital pean Conference on Computer Vision, pages 701–714.
Palaeography and Scribal Culture of the DSS (Hand- Springer.
sandBible # 640497). Additional support comes from Karunakara, K. and Mallikarjunaswamy, B. (2011). Writer
identification based on offline handwritten document
NWO (Netherlands Organisation for Scientific Re- images in kannada language using empirical mode de-
search) and FWO (the Research Foundation Flan- composition method. Writer, 30(6).
ders): Models of Textual Communities and Digital LeCun, Y., Bengio, Y., and Hinton, G. (2015). Deep learn-
Palaeography of the DSS (# 326-25-001). ing. Nature, 521(7553):436–444.
Leuzzi, F., Tessitore, G., Delfino, S., Fusco, C., Gneo, M.,
and Zambonini, G. (2016). A statistical approach to
REFERENCES speaker identification in forensic phonetics field.
Li, Y., Wang, S., Tian, Q., and Ding, X. (2015). Feature
Adak, C. and Chaudhuri, B. B. (2015). Writer identification representation for statistical-learning-based object de-
from offline isolated bangla characters and numerals. tection: A review. PR, 48(11):3542–3559.
In ICDAR, pages 486–490. IEEE. Lim, T. and Alexander, P. (1995). Volume 1. In The Dead
Baird, H. S. (1992). Document image defect models. In Sea Scrolls Electronic Library. Brill.
Structured Document Image Analysis, pages 546–556. Long, M. and Wang, J. (2015). Learning transferable
Springer. features with deep adaptation networks. CoRR,
Belongie, S., Malik, J., and Puzicha, J. (2002). Shape abs/1502.02791, 1:2.
matching and object recognition using shape contexts. Mikolajczyk, K. and Schmid, C. (2005). A perfor-
IEEE PAMI, 24(4):509–522. mance evaluation of local descriptors. IEEE PAMI,
Benhamou, S. (2004). How to reliably estimate the tortu- 27(10):1615–1630.
osity of an animal’s path: straightness, sinuosity, or Monk (2016). Medieval palaeographic scale data set (online
fractal dimension? Theoretical Biology, 229(2). collection).
Brink, A., Smit, J., Bulacu, M., and Schomaker, L. (2012). Otsu, N. (1975). A threshold selection method from gray-
Writer identification using directional ink-trace width level histograms. Automatica, 11(285-296):23–27.
measurements. PR, 45(1):162–171. Plamondon, R. and Lorette, G. (1989). Automatic signature
Bulacu, M. and Schomaker, L. (2007). Text-independent verification and writer identification - the state of the
writer identification and verification using textural and art. Pattern recognition, 22(2):107–131.
allographic features. IEEE PAMI, 29(4):701–717. Popović, M. (2012). Qumran as scroll storehouse in times
Bulacu, M., Schomaker, L., and Brink, A. (2007). Text- of crisis? a comparative perspective on judaean desert
independent writer identification and verification on manuscript collections 1. Journal for the Study of Ju-
offline arabic handwriting. In ICDAR, volume 2, daism, 43(4-5):551–594.
pages 769–773. IEEE. Popović, M. (2015). The ancient ‘library’ of qumran be-
Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., and Fei- tween urban and rural culture. In The Dead Sea Scrolls
Fei, L. (2009). ImageNet: A Large-Scale Hierarchical at Qumran and the Concept of a Library, pages 155–
Image Database. In CVPR. 167. Brill.
Fisher, R. A. (1925). Statistical methods for research work- Prasad, D. K., Quek, C., Leung, M. K., and Cho, S.-Y.
ers. Genesis Publishing Pvt Ltd. (2011). A parameter independent line fitting method.
Ha, T. M. and Bunke, H. (1997). Off-line, handwritten In ACPR, pages 441–445.
numeral recognition by perturbation method. IEEE Qi, X., Xiao, R., Li, C.-G., Qiao, Y., Guo, J., and Tang,
PAMI, 19(5):535–539. X. (2014). Pairwise rotation invariant co-occurrence
He, S., Samara, P., Burgers, J., and Schomaker, L. (2016). local binary pattern. IEEE PAMI, 36(11):2199–2213.
A multiple-label guided clustering algorithm for his- Schomaker, L. (2016). Design considerations for a large-
torical document dating and localization. IEEE Trans- scale image-based text search engine in historical
actions on Image Processing, 25(11):5252–5265. manuscript collections. it-Information Technology,
He, S. and Schomaker, L. (2014). Delta-n hinge: Rotation- 58(2):80–88.
invariant features for writer identification. In ICPR, Schomaker, L. and Bulacu, M. (2004). Automatic writer
pages 2023–2028. identification using connected-component contours
He, S. and Schomaker, L. (2016). Writer identification us- and edge-based features of uppercase western script.
ing curvature-free features. PR. IEEE PAMI, 26(6):787–798.
He, S. and Schomaker, L. (2017). Beyond ocr: Multi- Shor, P., Manfredi, M., Bearman, G. H., Marengo, E.,
faceted understanding of handwritten document char- Boydston, K., and Christens-Barry, W. A. (2014). The
acteristics. PR, 63:321–333. leon levy dead sea scrolls digital library: The digitiza-
He, S., Wiering, M., and Schomaker, L. (2015). Junction tion project of the dead sea scrolls. Journal of East-
detection in handwritten documents and its applica- ern Mediterranean Archaeology and Heritage Studies,
tion to writer identification. PR, 48(12):4036–4048. 2(2):71–89.

701
ICPRAM 2017 - 6th International Conference on Pattern Recognition Applications and Methods

Siddiqi, I. and Vincent, N. (2010). Text independent


writer recognition using redundant writing patterns
with contour-based orientation and curvature features.
PR, 43(11):3853–3865.
Sobel, I. (1990). An isotropic 3× 3 image gradient operator.
Machine Vision for three-demensional Sciences.
Stokes, P. A. (2015). Digital approaches to paleography
and book history: some challenges, present and future.
Frontiers in Digital Humanities, 2:5.
Tigchelaar, E. (2002). In search of the scribe of 1qs. In
Emanuel, pages 339–352. Brill.
Tigchelaar, E. (2010). Dead sea scrolls. In The Eerdmans
Dictionary of Early Judaism, pages 163–180. Eerd-
mans.
Van der Zant, T., Schomaker, L., and Haak, K. (2008).
Handwritten-word spotting using biologically in-
spired features. IEEE PAMI, 30(11):1945–1957.
Varga, T. and Bunke, H. (2003). Effects of training set
expansion in handwriting recognition using synthetic
data. In Proc. 11th Conf. of the Int. Graphonomics
Society, pages 200–203. Citeseer.
Yardeni, A. (2002). The book of Hebrew script: his-
tory, palaeography, script styles, calligraphy & de-
sign. Oak Knoll Pr.

702

Potrebbero piacerti anche