Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Improving Foreign Language Writing with Learner Analysis of Authentic Spanish Texts
Sarah Wroblewski
Abstract
Often the teaching of how to write genre, or text types, is not introduced to language learners
until the upper levels of proficiency, however, it can be a valuable skill at all levels. One
potential way to teach genre is through authentic model texts. This practitioner study uses corpus
tools, rubrics, and student reflections to analyze changes in student writing samples before and
Keywords: authentic texts, L2 writing, corpus analysis, learner corpora, second language
acquisition, International Baccalaureate, Diploma Program, genre, text type, practitioner
research
Improving Foreign Language Writing with Learner Analysis of Authentic Spanish Texts
Introduction
writing, is a key skill to be developed when learning a new language (Gambrell, Malloy,
Marinak, & Mazzoni, 2014). While the ability to speak in the new language is often a main goal
of many students, the ability to read and write allows for full expression of both receptive and
expressive skills and can contribute to the overall ability to communicate (Gambrell et al., 2014).
One aspect of literacy is that of genre, sometimes known as text types (Paltridge, 1996; Wingate,
2012). The exploration of genre is a valuable part of literacy, however, often it is not introduced
to its full extent until later in foreign language learning, sometimes even at the collegiate level, or
relegated to strictly academic texts (Mishan, 2005; Tardy, 2009). Authentic texts, or “model
texts” are one potential way to introduce students at the lower proficiency levels to the idea of
genre and assist them in producing texts of a similar type (Mishan, 2005). The question the
current practitioner study investigates is whether the reading and analysis of authentic texts of a
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 3
particular genre, in this case blogs, influences student writing of the same text type. This is
analyzed through student reflection, rubric-based structural evaluation, and a corpus analysis of
Literature Review
In this section, relevant literature from the linguistic, genre analysis, second-language
learning, and literacy fields are explored in greater detail as they pertain to the current study.
First, second language literacy instruction will be explored, particularly the traditional approach
to literacy as influenced by the ACTFL guidelines versus the alternative offered by the
International Baccalaureate (IB) program. Second is a survey of literature involving the reading
and writing of genre (and text type) and the value that this type of learning can have for students
within literacy instruction. Next is a discussion of the use of authentic texts within the second-
language classroom, including their utility in relation to teaching genre and literacy in general.
Fourth is a research-informed basis for the types of tasks and activities used within genre-based
literacy instruction based on authentic texts. Fifth, relevant literature of corpus analysis is
reviewed within the context of the analysis of learner writing samples for this study. Finally,
there is a short section explaining where this practitioner study fits in within the literature.
guidelines (2012) emphasize a focus on both reading and writing. The guidelines for writing
place particular emphasis on accuracy (control of grammar and vocabulary) and communication
(ability to convey a message) (ACTFL, 2012). When it comes to reading, the focus is on
comprehension of text on a variety of topics (ACTFL, 2012). While the ACTFL guidelines do
outline different types of text or genres to some extent with the use of terminology such as
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 4
“Presentational” versus “Interpersonal” texts, the idea of text type or genre as a whole is usually
not introduced until the upper levels of language learning (ACTFL, 2012). For example, the
upper proficiency levels on the ACTFL scale (i.e. Advanced and Superior) mention student
ability to “understand texts from many genres dealing with a wide range of subjects” for reading,
and the use of “appropriate conventions” for writing, while the lower proficiency levels (i.e.
Novice, Intermediate) are expected to focus on “short, non-complex texts” with “basic
information” (ACTFL, 2012). Besides mentioning some writing genres that characterize students
at the lower proficiency levels (i.e. simple messages, notes, bulletin boards, weather reports),
textual conventions such as audience or purpose are rarely mentioned (ACTFL, 2012). In
addition, the typical culmination of a student’s language learning career, the AP language exam,
may require comprehension of various text types or genres through reading, but does not require
students to write anything beyond an e-mail response (interpersonal writing), persuasive essay,
or narrative (presentational writing) (The College Board, 2019). While some aspects of text
conventions are mentioned, including “use of register”, they are not a main focus (The College
Board, 2015). As a result, many foreign language classrooms in the United States relegate these
more “complex texts”, or reading and writing of various genres, and the analysis or use of
components of discourse such as audience, register, and author purpose, to the upper levels of
language learning. This is in stark contrast to language learning within the International
Baccalaureate (IB) program, an educational program adopted by many schools around the world
(International Baccalaureate Organization (IB), 2019). Their curriculum places particular focus
on the comprehension and production of various text types as well as recognition and use of text
conventions such as audience, message, and purpose (IB, 2019). One program of the IB, the
Middle Years Program (MYP), includes aspects of text conventions and text type as early as
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 5
Phase 1 and Phase 2, the lowest on their language proficiency scale (IB, 2014). On their
language acquisition rubrics for reading, this inclusion is particularly clear: “Strand ii: Student
clearly recognizes basic conventions including aspects of format and style, and author’s purpose
for writing” (IB, 2014). This focus on text type and text conventions culminates in the Diploma
Program (DP) Language B course and exams, the IB equivalent to AP language exams, where
text types are central to the aims of literacy. The quote below, from the updated Language B
guide section describing the written portion of the IB exam (Paper 1), demonstrates this:
The aim of this component is to assess the ability to communicate in writing for a variety
need to show, through their use of text type, register and style, that they understand the
concepts of audience, context, purpose, meaning and variation, and can apply them to
The IB also mentions that “teachers should provide frequent opportunities for students to
understand and use a variety of text types” (IB, 2018, p. 21). The DP Language B Guide includes
a list of the text types they expect students to master, divided into three categories: personal,
professional, and mass media, as shown in Figure 1 (IB, 2018). It is clear that the idea of genre,
what the IB calls text type, is central to language learning within the IB program. The IB even
has various publications with the explicit intention of helping students learn text types
(Arriagada, 2012). The data analyzed in this study comes from students at an IB school, where
students are expected to maintain a focus on text types throughout their language learning. The
question becomes: what is the value of the learning of genre or text type when it comes to
literacy in the second language classroom, and how is it best taught to students?
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 6
Genre has many definitions. Paltridge (1996) mentions a few, including that of Swales
(1990), one of the main researchers of genre. He states that genre is “a class of communicative
events, the members of which share some set of communicative purposes which are recognized
by the expert members of the parent discourse community” (Swales, 1990 as cited by Paltridge
1996, p. 238). Examples of genres are advertisements, news reports, letters, brochures, resumés,
or recipes (Paltridge, 1996; Tardy, 2009). The IB uses the term “text type” in the DP Language B
guide, but it is somewhat unclear on whether this term is interchangeable with genre (IB, 2018).
Based off of the types of text that are listed (see Figure 1) and the language used in the guide, the
IB Language B program defines text types as similar to if not the same as genre. Paltridge (1996)
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 7
points out that there are many researchers that use the terms interchangeably, however he also
discusses the difference between the two. He defines text type more as what the text is doing, or
the rhetorical organization; for example, anecdote, description, exposition, procedure, or recount
(p. 239). When the definition of text type is defined in this way, “more than one genre may share
the same [text] type…equally, a single genre, such as formal letters, may be associated with
more than one text type” (Paltridge, 1996, p. 239). In other words, one e-mail may involve a
problem-solution text type while another may include a description, or the same e-mail could
include both a procedure and description (Paltridge, 1996). Even though text type as a separate
entity can be useful for teachers (see Paltridge, 1996 or Lee, 2001 for additional sources to
explore this), due to the fact that IB defines text type in the same way as genre and most teachers
are more familiar with the idea of genre, for simplicity’s sake the terms will be used
interchangeably throughout this research paper. The preferred definition used here comes from
Martínez-Lirola’s (2015) discussion of genre theory: “Texts that share the same purpose in the
culture and have many of the same obligatory and optional features are called genres or text
Oftentimes genre is viewed from the perspective of learning how to structure a text; for
example, including a title, an introduction, the body text, and a closing (Tardy, 2009). This is
sometimes known as “formal knowledge” of a genre (Tardy, 2009). However, when viewed from
a discursive and rhetorical perspective, which is touched on in Swales’ (1990) definition when he
mentions “communicative purposes” and the “parent discourse community”, genre knowledge
becomes more than just the form of a text (Tardy, 2009, p. 20). Tardy states that “learning to use
genres requires much more than learning text types and forms, it requires learning the social
contexts, actions, and goals that give genres their meaning” (Tardy, 2009, p. 12). The structural
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 8
elements, or formal knowledge, of a genre are still important, but genre knowledge includes
other aspects of knowledge as well: the intended audience of the text, context of the text
distribution, style and register, and even the content (Tardy, 2009). Tardy (2009, p. 21-22)
community
In learning genre, students are not just learning how to write a text based on the structural
elements or learning the content, but immersing themselves into the culture and language of that
genre. This is a major goal in literacy in general; not just learning to read and write, but also
“understanding discourse and the processes by which it is created and interpreted… gateways to
learning new, alternative ways of organizing thought and expression” (Kern, 2008, p. 374).
Genre approaches also allow students to learn the social and historical contexts of texts
(Martinez-Lirola, 2015). In this way, acquiring genre knowledge can be incredibly valuable for
students in order to reach intercultural as well as linguistic goals. Kern (2008) summarizes this
point:
Texts… offer more than something to talk about (that is, content for the sake of
practicing language). They offer students the chance to position themselves in relation to
distinct viewpoints and distinct cultures. They give students the chance to make
connections between grammar, discourse, and meaning, between language and content,
between language and culture, and between another culture and their own – in short,
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 9
making them aware of the webs, rather than strands, of meaning in human
The IB Language B guide discusses the value in learning text type/genre as well:
The guiding principle for using texts in the DP language acquisition courses is to develop
students’ receptive, productive and interactive skills in the target language by focusing
their attention on the ways in which good communicators consider the audience, context
and purpose of what they want to say or write in the process of choosing and developing
Learning genres/text types helps students to develop their language and communication skills,
make deeper connections with cultural communities, and make connections between language
and culture. In addition, learning to write and read through the perspective of genre better
approximates the type of literacy students will need to use outside of the classroom (Tardy, 2009;
Martínez-Lirola, 2015). Some may argue that learning genre or text type in order to make
connections to the cultures of a language community is not possible until the AP or DP level of
language learning. Kern (2008) argues against this, pointing out that without practice in the early
levels, students will be unable to “gradually develop the skills and sensibilities that will allow
them to succeed in dealing with texts by the time they get to advanced level courses” (p. 375).
This is especially important for IB teachers in preparing their students for the Diploma Program
language courses with their intense use of text types. What is the best way to teach genre/text
type at the lower levels of language learning? One way to teach students genre is through “genre
study, or immersion within a certain genre”, which will be examined in the next section
(Bromley, 2014).
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 10
Genre study, or immersion, is often done through “the use of authentic texts as models…
that are successful to accomplish communicative ends in a determined context… [they] introduce
students to the main text types, to their functions and to their textual and grammatical
characteristics” (Martinez-Lirola, 2015, p. 61). Using authentic models in the genre allows for
exposure to “vocabulary and structures that are actually used” and students can focus on the
language that is common in the genre they will write (Polio, 2016, p. 19). The reading of
authentic texts has long been championed within the language learning community, going all the
idea that learners should receive input just above their level in order to learn a language
(Krashen, 1981, as cited in Mishan, 2005). Within general language literacy research, learning
genre through authentic texts used for authentic purposes is linked to better growth in reading
comprehension (Duke & Martin, 2014). In second language literature, findings show that using
authentic texts is incredibly motivating for students because they allow them to see real samples
of real language in action (Gilmore, 2007; Mishan, 2005; Polio, 2016). Authentic texts also help
students develop autonomy and confidence in their language learning skills as they “stimulate
learners to further develop independent discovery and learning” (Mishan, 2005, p. 10). However,
what is an authentic text? Mishan (2005) quotes the definition of Morrow (1997), which is often
used in paraphrase by second language practitioners: “An authentic text is a stretch of real
language, produced by a real speaker or writer for a real audience and designed to convey a real
message of some sort” (Morrow, 1997, p. 13 as cited in Mishan, 2005, p. 11). Mishan (2005)
points out some issues raised by educators when deciding to use authentic texts. First, the
question of whether to simplify or “dumb-down” authentic texts often comes into play when
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 11
considering using them with lower-proficiency second language learners. If a text is too difficult,
students can become frustrated by the vocabulary and language but if it is too simple they may
become annoyed with the lack of content (Welbourn, 2009). In other words, students would be
outside of their ideal learning conditions; outside of what they are able to do on their own with
minimal support, their “zone of proximal development”, the term coined by the psychologist
Vygotsky (Mishan, 2005). Either way, student motivation, a key factor in literacy learning,
suffers (Guthrie, 2014). Teachers have to make informed decisions when choosing and
potentially modifying authentic texts. Some studies show that simplifying authentic texts, or
editing them by changing the vocabulary to make it easier, is not helpful: “far from assisting our
learners, by ‘simplifying’ and shortening texts, we risk not only eliminating elements crucial to
comprehension but also generally impoverishing learners’ input” (Mishan, 2005, p. 24). Other
studies on text modification have mixed results, meaning this method is still up for debate
(Gilmore, 2007). Still, many teachers of lower proficiency learners see the difficulties of
authentic texts as an obstacle rather than a challenge that learners can work to overcome
(Mishan, 2005, p. 61). As Tomlinson (1994) says, there is a need “to respect, stimulate and
challenge the lower levels” (p. 4, as cited in Mishan, 2005, p. 75). To help make authentic texts
more accessible for lower proficiency students, instead of trying to make authentic texts easier to
read, the “texts can be made accessible to learners… by adjusting the demands of the task
involving them (Mishan, 2005, p. 61, emphasis in original). Choosing tasks appropriate to not
only the learner and their level of proficiency but the texts themselves becomes the most
important question for language teachers, with both reading and writing (Gilmore, 2007; Mishan,
2005). As Arnold (1991) states, “‘Use of authentic materials does not imply that tasks will be
authentic… it is what trainees or students DO that counts’” (p. 238, as cited in Mishan, 2005, p.
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 12
16). Designing appropriate tasks to be used with authentic texts in order to teach genre will be a
Students learn to write when given models, clear expectations, room to make mistakes,
feedback, and realistic practice (Bromley, 2014, p. 290). When looking at genre, the approach is
similar. Ellis (2004) presents one model for teaching and learning genre, which he takes from
Martin (1999). In this process, there are three stages: deconstruction, joint construction, and
independent construction (Ellis, 2004, p. 211; see Figure 2 below). Students start by analyzing
model texts from the genre for “content, structure, and language features to provide insight about
the genre and register to the students” (Ellis, 2004, p. 211). Next students jointly construct their
writing together through preparation of the genre text, and finally they produce the text on their
Figure 2: Genre-based teaching and learning model (Martin, 1999, p. 131 as cited in Ellis,
2004, p. 212)
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 13
The first step, deconstruction of text, often involves students reading and analyzing authentic
models of that text type. Some teachers may worry that using models is too much like copying,
but studies have shown that students can use them effectively and they help improve aspects of
writing (Tardy, 2009, p. 49-50; p. 96-97). One example is Henry & Roseberry (1998), where
students received genre-based instruction with model texts. Students that used and learned from
the models had higher gains on writing than those that had not (Henry & Roseberry, 1998, as
cited in Tardy, 2009). “The texts to which they were exposed played a very important role in
developing their knowledge of an unfamiliar genre” (Tardy, 2009, p. 50). These models give
students a stronger understanding of structure and form of the text, and the styles of language
used within that genre (Tardy, 2009). However, input does not always lead to uptake or
processing of that input in a meaningful way without drawing attention to it (Loewen, 2015). The
types of reading and writing activities used with model texts should include some form of
awareness-raising or noticing of text conventions and language (Loewen, 2015; Mishan, 2005).
These consciousness-raising activities may include comparisons between different texts of the
same genre, annotation of the text, or looking at form and language of specific sentences or
passages (Tardy, 2009; Fisher & Frey, 2014). Tardy (2009) also supports the ideas of Swales by
advocating for more explicit treatment of genre: “Activities that ask students to explicitly
understanding of the linguistic and sociorhetorical dimensions of those genres” (p. 102). This
also fits within the general research for reading comprehension with the integration of explicitly
teaching text structures being particularly helpful for struggling learners (Duke, Pearson,
Strachan & Billman, 2011, p. 68). Devitt, Bawarshi, & Reiff (2003) outline some steps in
explicitly analyzing genre: gathering samples, studying the context and situation (i.e. writers,
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 14
readers, why it’s used), identifying patterns in the language and form, and finally analyzing how
In addition to these types of more explicit activities, it is important that reading tasks
remain consistent with the text as far as authenticity (Mishan, 2005). Authentic tasks based on
texts should try to mimic the same communicative purpose as the text itself, be similar to real-
life tasks related to that text type, and involve some communication between learners (Mishan,
2005, p. 75). For example, it makes little sense for students to closely analyze a newspaper
article for specific vocabulary or minute details when most native readers simply skim or read
for main ideas in real life when they look at newspaper stories and then relate what they read to a
friend (Mishan, 2005). There are different types of communicative purposes for texts:
2005, p. 78). Each different communicate purpose best lends itself to different authentic tasks
that imitate what native speakers do when reading or deconstructing a text (Mishan, 2005). For
example, when reading a more instructional blog (i.e. eat healthier with these tips!), a more
authentic task would be reacting to the text or making a response rather than extracting specific
information (Mishan, 2005). When looking at different genres, teachers need to keep the
communicative purposes of that text type in mind when they design activities to engage with the
model texts of that particular genre (see Mishan, 2005, for a full list).
When it comes to the second and third steps in the process of genre-based teaching and
learning, joint construction and independent construction, students are still using model texts but
are more focused on writing (Ellis, 2004). Joint construction involves scaffolding for students to
help them learn how to write that text type on their own, potentially through the use of graphic
organizers or peer feedback (Ellis, 2004; Tardy, 2009; Duke et al., 2011). Model texts also play
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 15
an important part in this stage as students base their own writing off of what they have analyzed.
The authentic models can be used as a springboard for their own writing, especially as it helps
them develop their own interlanguage systems (Ortega, 2009; Gilmore, 2007). This building of
text together fits well with the sociocultural theories of second language acquisition as students
work to jointly create texts with the teacher or peers within their Zone of Proximal Development-
just beyond what they can do on their own (Lantolf, 2012). In addition to group construction of
text, feedback is essential to improve writing but can be incredibly time-consuming for teachers,
especially with large classes or longer writing samples (Bromley, 2014; Polio, 2016). An
alternative is peer feedback, which is usually only effective when students are given very
specific assignment guidelines (Polio, 2016, p. 23). Still, peer feedback can help build learner
autonomy, is more learner-centered, and involves more interaction (Polio, 2016; Benson, 2012;
Mackey, Abbuhl & Gass, 2012). Especially at the lower levels, peer feedback on language
accuracy can lead to more errors or even an avoidance of errors in the future (Polio, 2016).
Instead, a focus on meaning, overall text structure, or a specific language structure (one at a
time) can be helpful (Polio, 2016). Also, in order for feedback to be effective on writing,
“students need to do something with the feedback” (Polio, 2016, p. 24). Examples include having
students rewrite, write about the feedback they received, or keep a sort of “error log” where they
keep track of their errors (Polio, 2016, p. 26). Other alternatives to individual student feedback
include self-correction with a checklist or providing feedback to all students at the same time
after a general overview of all student writing (Polio, 2016, p. 26-27). Scaffolding through model
texts, feedback, peer work, and authentic task activities can help students eventually reach the
third stage of genre-based teaching: writing in the genre independently. These learner writing
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 16
samples of genre can be analyzed for growth or change in various ways. For this particular
practitioner study, corpus analysis was the chosen medium for pre and post writing analysis.
Corpus Analysis
Corpus analysis has recently become a more intense focus of world language teachers
due to its utility in materials development and teaching methods (Xu, 2016). Corpus analysis
involves examining the language of a large collection of texts, also known as a corpus (corpora
in the plural) (Weisser, 2016). “Any collection of texts that has been systematically assembled in
order to investigate one or more linguistic phenomena can be termed a corpus”, however the
texts are usually authentic and consist of written and/or spoken language (i.e. books, news
articles, radio broadcasts) (Weisser, 2016, p. 23). One example of a large Spanish corpora is El
spoken or written words from authentic texts (Davies, 2019). Most corpora are computerized,
allowing for easy analysis through various online and downloadable tools. The major advantage
to using corpora is the ability to look at how language is actually used, particularly the patterns
of words that occur together, phrasing and register, and the frequency of vocabulary or phrases.
Frequency dictionaries are one example of materials that are developed as a result of corpus
analysis (Xu, 2016). These resources show the actual most frequently used words in a language,
helping teachers to develop materials based on these common words to allow students to acquire
the most vital words for communication first (Xu, 2016). The “Super Seven” or seven most
(TeachingComprehensibly, 2019). In other words, corpus analysis can help teachers and students
to “achieve a more realistic learning experience that is at least a little closer to language
investigating different words forms in their ‘natural’ contexts” (Weisser, 2016, p. 80). When it
comes to investigating text types and genre, “corpus linguistic tools and methods have been
instrumental in establishing the links between formulaic language and text structuring” (Paquot
& Granger, 2012, p. 134). The analysis of sample texts of a particular genre, even in a smaller
corpus, can help create links between textual patterns in language and their social context (Tardy,
2009). Corpus linguistics can identify language used to organize texts based on various rhetorical
functions (i.e. introducing a topic, conclusions, summary…) as well as common vocabulary and
phrases for that genre (Paquot & Granger, 2012). Corpus analysis as it pertains to teaching text
type or genre is frequently used in English for Academic Purposes (EAP), but is not as common
for investigation of text types or genre in other languages (Paquot & Granger, 2012).
In second language acquisition, the use of learner corpora is a recent development which
is still not widely used (Gilquin, Granger, & Paquot, 2007). Learner corpora are “electronic
collections of texts produced by foreign or second language (L2) learners” (Paquot & Granger,
2012, p. 130). These learner corpora create opportunities for exploration of L2 writing and
speaking and give researchers and practitioners a window into what is going on inside of
learner’s brains as they acquire a new language (Xu, 2016). More specifically, learner corpora
can be used to look at L2 writer or speaker errors, comparison of use to various vocabulary or
phrase lists, comparison to expert or novice native texts (Contrastive Interlanguage Analysis),
and in the case of data-driven learning (DDL), students can utilize corpora to explore their own
language use in comparison to native texts (Gilquin, Granger & Paquot, 2007; Mishan, 2006; Xu,
2016). Contrastive Interlanguage Analysis (CIA) is a popular method that often involves two
types of comparison: “comparisons of learner language and one or more native speaker reference
corpora (L2 vs. L1), and comparisons of different varieties of learner language (L2 vs. L2)”
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 18
(Gilquin, Granger & Paquot, 2007, 322). In this study, a form of CIA is used to compare learner
writing to native writing (model texts) to investigate the native authentic writing of a specific
text type/genre. This comparison shows the goal that learners are striving for in “expert” texts
(Gilquin, Granger, & Paquot, 2007). It is also used to compare learner writing before exposure to
authentic texts (T1) and after (T2). Corpus analysis tools permit investigation of a large
changes over time. One critique of learner corpora is the limited authenticity of the data due to
the artificial nature of the task (Xu, 2016, p. 47). However, all classroom writing is essentially
unnatural due to the setting. If writing is under “authentic classroom circumstances” and includes
continual discourse for a specific purpose rather than disconnected sentences, it is better able to
fulfill the rhetorical view of genre (Tardy, 2009, p. 12; Xu, 2016, p. 47). In addition, by giving
one task to a group of learners, even though “the learner corpora are rather small… the learner
texts have the same topic as the prompt… thus making it possible to control for the influence of
the topic on lexical choice” (Paquot & Granger, 2012, p. 132). Corpus analysis also means that
learner texts can be examined for use of language borrowed or adapted from the model texts of
that genre, fitting with the idea that when learning how to write in a genre, students may “borrow
explicit textual fragments” and look at “support genres… to learn how to communicate
genre is that it can “highlight linguistic features [and]… patterns that are used by… learners but
which are not (or very rarely) found in native professional writing” (Gilquin, Granger, & Paquot,
2007, p. 327). As a result, educators can use this comparative data to inform their instruction on
what students need to become more “native-like” in their production of a particular genre.
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 19
When investigating genre and whether the analysis of authentic texts can influence
writing, the text type must be chosen wisely. The text type or genre introduced to students in this
study is blogs. A blog, short for “weblog”, “is an online journal or informational website
displaying information… it is a platform where a writer or even a group of writers share their
views on an individual subject” (Djuraskovic & FirstSiteGuide Team, 2018). The IB text types
textbook by Arriagada (2012) defines it as more of an online journal and states that it is “similar
to the personal diary format” (p. 54). However, blogs can serve a variety of purposes and some
can be more professional than others, depending on the topic (Djuraskovic & FirstSiteGuide
Team, 2018; Arriagada, 2012). Blogs were chosen as the text type in this study for a few reasons:
1. Many students already have some background knowledge of blogs from their own use
2. Blogs are extremely varied in their discourse. Some are informal, others formal, and
many include various purposes or intended audiences depending on the content and
1996; Mishan, 2005). This allows for more discourse analysis than some other, more
limited, text types/genres. Blogs include comments as well, which can offer an
3. Blogs are the only genre included in all three of the IB text type categories (personal,
professional, and mass media- see Figure 1), and they sometimes morph these
together or appear to move between them. For example, a blog by a doctor may
appear more professional in its trustworthiness and content, but could use the
4. Many blogs use simple language for the varied audience of online readers and
therefore are more appropriate for the reading level of lower-proficiency students
(Arriagada, 2012).
5. The written samples (T1 and T2) were conducted via Google Docs, making an online
medium such as blogs more appropriate because students have access to online tools
when creating their texts. In addition, more editing options are available when it
comes to the visuals that sometimes accompany blogs (i.e. bold text, different fonts
6. On the Internet there are a wide variety of blogs on the topic of health, the subject
matter covered within the Spanish II curriculum at the time of this research, and
students were already familiar with the base vocabulary necessary to understand a
7. Blogs on health often include the use of the informal tú (you) through the giving of
students had recently practiced in class through other contexts (i.e. giving directions
in the city). It is also the register they are most familiar with using in the Spanish I
This research study fills a void in existing research for a few reasons. First, most existing
studies of genre and particularly writing of genre are at the university or upper-levels of
proficiency, and many involve English (i.e. English for Academic Purposes) (see Tardy, 2009 for
one example). The same is true of corpus analysis of genre (Paquot & Granger, 2012). This study
focuses on the secondary setting in a lower-proficiency course for Spanish rather than upper-
levels in English. It is also an example of learner corpus analysis which is still not as common in
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 21
the field of second language acquisition or linguistics (Gilquin, Granger & Paquot, 2007). This
study involves Contrastive Interlanguage Analysis, which while common in learner corpus
studies, is not always used for both comparison of learner data at different points in time (T1 to
T2) and comparison to model authentic texts. In addition, the inclusion of structural rubric-based
data as well as learner reflections on the experience allow for consideration of not only the
language of the genre, but also the structural aspects of genre and the context of writing (Mishan,
2005). Finally, this study falls into the domain of practitioner research or problem-solving
research (Mitchell, 2009). This growing field of second language acquisition research is
important due to its strong relevance to what teachers are actually doing in the classroom and it
can help drive further research by raising new and important issues (Mitchell, 2009; De Graaff &
Housen, 2009). With teachers and other practitioners as active partners in the research process,
more problem solving can occur within the field and teachers can further develop their own
reflection skills through participation (Mitchell, 2009). One disadvantage of this type of research
is many factors and variables can come into play that may not be present during more
experimental laboratory studies (Mitchell, 2009). One could argue that although these additional
variables can confound results, they also reflect the reality of many classrooms that are often
complex and not at all like the controlled conditions in a laboratory or more structured research
formats. In fact, some argue that some second-language research findings may not always be
generalizable to the classroom context (De Graaff & Housen, 2009). Practitioner research can
help to fill this gap and investigate whether research findings make sense within the classroom
context. By focusing on a practitioner-relevant question (how to best teach genre and how using
authentic texts can affect student writing of genre), this study aims to investigate the teaching of
literacy using genre within the classroom context. This is of particular importance to language
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 22
types/genres in order to best prepare them for the Diploma Program Language B exams. By
looking at a real classroom context in a lower-level course at an IB school, the hope of this
researcher is to shed some light for language educators on potential ways for them to best teach
Research Questions
Based on the review of the literature on genre, authentic texts, literacy instruction, and
corpus analysis, the research questions for this practitioner study are as follows:
1) Does the analysis of authentic texts of a certain genre (blogs) influence lower
sample (T1), post-writing (T2) sample, rubric evaluation, and student reflection?
2) How does student writing compare to model texts (MT) of the same genre?
Methods
Participants
The participants in this study consist of high school students from two classes of foreign-
language learners (n=35 total); ages 14 through 18. The majority of students are freshmen with a
few upper classmen. The students are enrolled in a large public International Baccalaureate (IB)
high school in the Midwestern United States. All students are in the second semester of their
second year of Spanish language study and are from novice mid to intermediate low on the
throughout the school year (ACTFL, 2012). There is a wide variety in learner background with
some students coming from homes where additional languages are spoken and others with only
English as an L1, Spanish language heritage learners, gifted students, and still others with special
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 23
education designations. The classes are integrated, with some students having chosen an
accelerated curriculum (n= 25) and others a regular curriculum (n=10). The differences between
these designations are largely structural, with accelerated students learning slightly more
advanced vocabulary and grammar and with different rubrics for assessment (Phase 2 for regular
students on the IB MYP rubric scale vs. Phase 3 for accelerated) (IB, 2014). Students are
required to take two years of a language through the IB program requirements and graduation
requirements in the state of Michigan. Some students no longer take a language after completion
of the Spanish II course while others will continue on a traditional path (Spanish III, Spanish IV,
AP Spanish). Still others choose the full IB Diploma Program and take Spanish III then DP
Spanish B, a two-year course. While this study focuses on the IB text types, the literacy activities
Context
Within the Spanish classroom at this school, students do not use a textbook as a part of
their curriculum and units are thematic based on topic rather than grammar. The Spanish teachers
within the district determine the units with guidelines from ACTFL standards and IB
recommendations (ACTFL, 2012; IB, 2019). This study took place during the unit on health.
This unit covers healthy diet, exercise habits, visiting the doctor, injuries, and overall wellness.
This is a common and relevant theme across language classrooms, and is a topic on both the AP
Spanish language exam and is a sub-theme for the IB Language B exam (IB, 2018). Prior to the
research, students worked through various communicative activities throughout the unit,
including doctor/patient roleplays and interpretive activities with videos and info-graphs related
to health. Other, more traditional explicit activities were also a part of the unit with students
taking grammar notes on informal commands (the imperative mood) and the present progressive
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 24
tense. Students used these aspects of language out of context (i.e. rote practice with websites
recommendations for what to do when ill, analyzing pictures to say what people are doing). As
far as writing, students complete daily written warm-ups, sometimes with short answers and
paragraphs required. More writing practice is given via explicit worksheet practice and
communicative writing through information gap activities and interpersonal writing such as
messages to other students. Vocabulary is taught in a variety of ways, often through visual,
auditory, tactile, and kinesthetic input (i.e. playing Simón Dice or Simon Says to learn body
parts) and constant repetition of words through both input and student output. Students received
a base list of vocabulary for the health unit, tiered for the two populations (accelerated vs.
regular). Vocabulary lists are created through teacher collaboration, but usually without
consultation of corpus analysis and therefore do not always reflect the real-life usage of the
language (Ellis, 2009). For most units, students complete Integrated Performance Assessments
evaluated on the IB MYP rubrics for Criterion A (listening), B (reading), C (communication) and
D (accuracy) (IB, 2014). These are often district common assessments, meaning all teachers test
in the same or very similar ways. The text-analysis tasks completed in this study were somewhat
unfamiliar to students, but the reading and writing activities were similar in many ways to what
students were used to in class. The use of Google Docs as a medium for collecting writing
samples was also familiar to students, although few had used it before when typing in Spanish.
Class periods at the school are 50 minutes in length and occur every day of the school week (5
days). One day of the study did occur on an early release schedule of a 28-minute class instead of
Procedure
Participants for the study were chosen because they were already enrolled in the Spanish
II course taught by the researcher. Parents and guardians were informed of the study, and
completed consent forms sent to a third party (to avoid bias in class grades). 35 students
consented to participate out of the 41 students total in the two classes. Participants first
completed the T1 writing sample, asking them to write a blog in Spanish based on the topic of
health (see Appendix A for the prompt given) in class over two class days (approx. 100 minutes).
Initially a 50-minute time window (one class period) was anticipated, but many students
struggled to get started, were absent or on a field trip the first day, or did not write enough for
any sort of analysis, so the second day of class was added with more success. It was emphasized
to students that they should focus on the message of their writing and try to write as much as
possible rather than focusing on the accuracy of their writing. Students were not permitted to use
any notes, translators, or other tools, although they did have access to some of their chapter
vocabulary list and other basic grammar and vocabulary via classroom posters/visuals. The T1
writing sample was completed in a Google Document where students typed their responses and
submitted them to the Google Classroom page for the class. Typing in Google Docs was chosen
as a writing medium because using a computer can lead to better writing output for lower-
proficiency learners, it has an automatic spell check, and it results in easier upload to corpus
After the T1 writing, students completed various text analysis activities over the course of
five days in class (approx. 250 minutes, see Table 1). The first set of activities focused on
explicit genre analysis and were meant to raise awareness to the form, context, audience, and
purpose of the blog genre (Tardy, 2009). Prior to the reading and analysis of texts, students were
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 26
given a chart to record vocabulary they learned during their reading, personalized for their own
needs and desires of what they would like to learn or use for the future (see Appendix B). This
was not only a learner-driven activity but also research shows that by choosing their own
vocabulary students challenge themselves more and it can improve motivation and acquisition
(Benson, 2012; Duke & Martin, 2014). Students first received model texts with three different
blogs on the topic of health, chosen for their accessibility of language based on the vocabulary as
well as theme (see Appendix C for links to these blogs). Students explored the blogs online in
their original medium so they could experience some of the more unique aspects of the text that
do not show up on paper (i.e. links within the blog to other entries). The participants were placed
in groups of three based on proficiency level; one higher-proficiency (2AC) student with two low
to mid-proficiency (2 regular) students. This grouping was chosen as some research shows that
mixed groups can be helpful, while others show that lower proficiency students paired with
higher proficiency don’t communicate as much (Lowen, 2015). Each student received a different
blog, with the higher proficiency students receiving the longest to account for their overall
stronger levels of motivation in the face of challenge. Using a graphic organizer, each student
filled in information about the features and context of the genre based on their assigned model
text, including format, register, and language (Ponzio, 2017; see Appendix D for the organizer
students used). The following day students formed their groups and did a jigsaw activity, sharing
what they had learned about their blog and filling in the missing sections of their graphic
organizer based on the information from their groupmates. Printed copies of the blogs were
provided so students could point out key features they had noticed. The third day was joint
construction of a rubric to analyze the features of genre based on the 6 + 1 traits for writing
(Ponzio, 2017). Students first got into their groups of three again and created a rubric for what a
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 27
“typical” blog should look and sound like based on commonalities across all the blogs they had
read. Afterwards the whole class shared out their thoughts and created a blog rubric together
from all of their observations. This allowed for a large degree of learner autonomy and
interaction, although it should be noted that more of the discussion occurred in English rather
than Spanish (Benson, 2012). Students later received a typed copy of their finalized rubric
combining the final rubrics of both classes to be used for feedback and writing analysis (see
Appendix E). The fourth and fifth days of instruction, students analyzed a fitness blog in more
detail (see Appendix F). The blog introduction was analyzed in randomized small groups, with
students answering questions based on the communicative purpose of the text; to inform and give
instructions on fitness activities (Paltridge, 1996; Mishan, 2005). Questions involved grasping
the main idea and making a personal reaction to the text rather than looking at highly specific
details (Mishan, 2005). Some questions did help students notice contextual or linguistic aspects
of the text, such as asking them to point out already known or new vocabulary and asking for
their observations on voice/register (first, second, or third person; formal or informal). After
going over potential answers as a class, students independently analyzed the rest of the text
including the main sections of the body as well as the conclusion. Questions were similar to the
introduction analysis, based more on meaning rather than language or specific details.
Throughout this process participants were strongly encouraged to use their personal vocabulary
lists to keep track of any new words they had learned and wanted to use in the future. The final
day of text analysis involved a previous homework assignment. Students were asked to find a
Spanish-language blog online that involved health in some way but also on a topic that interested
them. This allowed students for more autonomy in learning and was meant to increase
motivation due to learner interest in authentic texts (Guthrie, 2014). The resulting list was posted
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 28
into a Google Document in Google Classroom where all students could access it. On the day this
assignment was due, students went into the document in class and explored the various blogs
their peers had found, gaining more exposure to the genre (see Appendix G for the list of blogs
generated by students). The list included a variety of topics on everything from mental health to
sports to healthy cooking, demonstrating the wide range of student inquiry. After finding a blog
of interest, either the one they posted or one that a peer posted, their assignment was to complete
similar analysis questions to the previous blog reading, then use a graphic organizer to begin
planning their own blog on the topic of health (see Appendix G, second page of linked
document).
The following week of instruction (approx.170 minutes, with time subtracted for other
required curricular activities including preparation for a speaking exam) students worked on
writing and peer editing (see Table 1 for instructional activity sequence). In pairs, students were
asked to use their graphic organizer, their personalized vocabulary lists, and the model texts
studied and found by peers to co-create a short blog (Ponzio, 2017; Mackey, Abbuhl & Gass,
2012). Pairs were determined based on similar proficiency, with a lower-proficiency student
working with an only slightly higher proficiency student when possible to increase
communication and avoid one student taking over when writing (Loewen, 2015). Each student in
a pair received a Chromebook to type in the same document. The use of Google Docs allowed
for monitoring of authorship of various sections through the editing history feature, which
prevented one partner from dominating over another (Ponzio, 2017). Students were encouraged
to use model texts as a basis for their writing, based on research that this sort of “patchwriting”
can be useful for novices to a genre (Tardy, 2009). Another advantage to Google Docs was the
grammar to help them as they wrote (Ponzio, 2017). After completing the co-written mini-blog,
once again in pairs, students peer edited each other’s blogs using the original rubric they created
as a class and a checklist with specific instructions for feedback (see Appendix H for the peer
editing instructions). Due to time constraints, some teacher feedback was given using the
“comments” and “suggestions” features of Google Docs as students were writing for live
feedback, but feedback was not given on every portion of all student writing after completion.
The final day of writing students were given the class period to look over their errors, feedback,
and reflect on them both (see within the document in Appendix H).
The T2 writing sample was conducted in the same manner as the T1 sample with students
receiving two full class periods (100 minutes) to write in Google Docs. The T2 prompt was the
same as the T1 prompt. Students were not allowed to use any notes or other online resources;
however, they were encouraged to study their personal vocabulary list and look over their
writing feedback and practices the night before. Finally, upon completion of the T2 writing,
students were asked to complete reflection questions via Google Forms (including an assent form
for collection of data) to think about their overall experience in the learning of a genre (see
Appendix I). Questions were both on a Likert scale and more open-ended to allow for some
person having analyzed a different blog and create a rubric for blogs
(Appendix D & E)
Homework: Find any blog on health on a topic that interests you,
post link in Google Classroom (due later in week)
Warm-up: correction of some selected errors from T1 writing
DAY 4 Whole-class discussion and creation of rubric (later both classes were
(50 minutes) combined into one rubric and distributed to students)
Read through introduction of another sample blog as a group, discuss
Warm-up: more correction of selected T1 errors
Stations of reading: blog model text divided into sections, students
rotate and read each, answer comprehension questions as they read
DAY 5
(Appendix F)
(50 minutes)
Additional activity: students working on personal dictionary/list of
words as they go, filling in words they read that they want to use
again in the future. Students were permitted to use WordReference
dictionary to help them define unknown words. (Appendix B)
Warm-up: students made their own comments in response to a
DAY 6 comment on a blog
(50 minutes) Continue reading stations of model blog
Homework due (find your own blog)
Warm-up: students made their own comments in response to a
comment on a blog
DAY 7 Student-found blogs activity: students answered comprehension
(50 minutes) questions and discussed blogs they found, could choose which to
read (Appendix G)
Writing preparation- graphic organizer (Appendix G)
Warm-up: students created their own sentences with their personal
vocabulary (words they found from the readings)
DAY 8 Speaking practice (unrelated to research)
(50 minutes) Writing practice: introduce partner blog writing project, assign
partners (Appendix H)
DAY 9 Writing practice: partners work together in Google Docs to write a
(50 minutes) blog using model texts, personal vocabulary, and any other resources
DAY 10 Continue partner writing practice
(28 minutes)
DAY 11 Continue partner writing practice
(50 minutes) When complete, peer review in partners (Appendix H)
T1 and T2 writing samples for students opting to participate in research were downloaded
to a secure password-protected drive and coded with random identification numbers stored in a
different location. Samples were stored both in their original form (Google Docs) to allow for
analysis of structure and formatting as well as Plain Text files (.txt) to allow for analysis using
corpus tools. The plain text files were edited to provide more accurate computer analysis.
Edits made to the writing samples are included in Table 2. Writing sample files were edited
using BBEdit free trial software (Bare Bones Software, Inc., 2019) and the TextEdit software on
the Macintosh operating system. The model texts that students analyzed during classroom
activities were also compiled into a corpus of plain text files. Many were downloaded from the
Internet (see list of links to blogs in Appendix G) as either HTML or plain text files. Edits to
model texts were minimal but included any noticeable spelling errors, removal of HTML coding
when appropriate, and some formatting removal (i.e. large amounts of space, advertisement text).
Titles, headers, and captions for photos were left intact for both student writing and model texts
For the analysis of both model and participant texts in corpus form, the free Voyant Tools
text analysis website was used (Sinclair & Rockwell, 2019) in addition to the tools available
through Laurence Anthony’s website, mainly AntConc (Anthony, 2019). The plain text files
were uploaded as separate corpora into these software tools for analysis and comparison- the T1
Results
Structural Data
To analyze the structure and layout of student produced texts in relation to model blog texts, the
rubric that students co-created during instruction (based on Ponzio, 2017) was used. Students
created these categories of the rubric after analyzing various blogs in groups to compile features
typical of most health blogs. This rubric is reproduced in Figure 3. The two rubric categories that
are under consideration in regards to structure are Presentación (presentation) and Organización
(organization). They consider structural aspects typical to blogs such as use of fonts, text, and
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 33
photos in addition to headers, titles, breaking the text into sections, etc. The total for these two
categories was out of 10 (5 each). Both the T1 writing sample and the T2 writing sample were
analyzed using the original Google Docs files (because the plain text files lose the formatting) to
look at structure. These were graded by the teacher / researcher. As you can see from the
numerical data below (Table 1), there was a huge difference in student text structure from T1 to
27 2 10 8 400.00%
28 2 9 7 350.00%
29 2 10 8 400.00%
30 2 10 8 400.00%
31 2 10 8 400.00%
32 2 10 8 400.00%
33 2 9 7 350.00%
34 2 10 8 400.00%
35 2 9 7 350.00%
AVG 2.37 9.17 6.8 314.52%
In the T1 writing samples, most students wrote their text in a paragraph format with little to no
structure. For T2, nearly all students divided their writing into sections, included titles and
headers as well as visual aspects such as photos and bullet points to organize information. This
made the texts much easier to read and they appeared more similar to the model texts (again, see
Appendix J for visual comparison), however it may have resulted in some interesting changes
across T1 to T2 in the corpus analysis which are discussed in the next section.
The T1 and T2 corpora were first analyzed individually using Voyant Tools (Sinclair &
Rockwell, 2019) to examine number of words (length of texts), type token ratio, vocabulary
density, and average words per sentence. These factors were chosen to show any changes (or
gain/loss) between T1 and T2. The settings used for word definitions and token definitions were
the default settings in Voyant Tools, which cannot be changed. Data is visible in Table 4 and 5.
Based on the data, various trends emerged. First, students appeared to have written much more
on the T2 writing sample as compared to T1 both based on word count and words per sentence.
In addition, the type token ratio (vocabulary density) appears to have decreased from T1 to T2,
which was somewhat unexpected. To analyze this further, the amount of gain/loss (defined here
as change) was calculated for each category, along with percentage of change. A t-test was run
on the changes as well to determine significance of changes from T1 to T2. Statistical analyses
were calculated using the Excel Analysis add-on as well as the Social Science Statistics online
calculators (Stangroom, 2018). All significance values were set at 0.05. In looking at changes
from T1 to T2 word count (Table 6), students had a 71.27% increase, with a two-tailed t-test
value of t=11.023907 for a p-value of less than .00001, making it statistically significant.
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 38
This is especially interesting because both the T1 and T2 writing prompts encouraged students to
write between 250-500 words, with no change in wording between the two prompts. Students
wrote more on the T2 writing anyway, with all students showing positive change except for one
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 39
(#21 dropping from 122 words to 116). Some changes were especially dramatic, such as #15
showing 209.41% growth between the number of words written in T1 and T2.
Similar but less drastic increases are visible in the T1 to T2 data involving sentence length (Table
7- words per sentence), with a two-tailed t-test value of t=2.28017 and a p-value of p=.02899,
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 40
making it barely statistically significant (with significance set at .05). Students overall showed a
11.98% increase, however there were some writing excerpts that showed a decrease in sentence
length (i.e. #18 from 19.00 words to 14.13 words for a -4.87 or 25.61% loss). Others showed
more extreme positive growth (i.e. #34 from 6.95 words to 11.00 words for 4.05 or 58.33%
increase). One potential reason for the various decreases in sentence length could be the fact that
blogs structurally include titles, headers, lists, and bullet points to present information and these
features typically result in shorter “sentence” length. Students noted these textual features on
their co-created rubric and many utilized them in their T2 writing, but not on the T1 pre-write.
These textual features were not removed from data because of their importance in the structure
of the text type of blogs, but they may be influencing the data.
Unexpectedly, the type token ratio or vocabulary density was significantly lower on the T2
writing as compared to the T1 writing, with an overall drop of 14.31% (Table 8). On a two-tailed
t-test, t=-6.436948, with a highly significant p-value of <.00001. There are various potential
explanations for this, one being the use of titles, lists/bullets, and headers in the T2 blog writing
that were not present in the T1 writing. Use of titles and headers could result in some repetition
of words that are not present in simple paragraph writing without such textual features. Another
interesting data point emerges when looking at vocabulary density compared to the word count
and word sentence length. There were a few students that had increases in one but decreases in
another: for example, sample #34 showed a 58.33% increase in sentence length and a 150%
increase in word count between T1 and T2 but a 32.92% drop in vocabulary density. A
correlation was run in Excel Analysis between word count and vocabulary density to investigate
Table 9: Correlation between word count and Vocabulary Density (Type/Token ratio)
There is a negative correlation of -.61 between the word count and the vocabulary density
(“Ratio” in the table) for T1 and a negative correlation of -.46 between the word count and the
vocabulary density for T2. Using a significance value of .05 and analysis through Social Science
Statistics online calculators, both negative correlations are significant at a p-value of .000107 for
T1 and .005097 for T2 (Stangroom, 2018). The T2 correlation is weaker, but it still means that as
students wrote more, their vocabulary density decreased. This could simply be that as they wrote
more, they needed to use more repetitive function words to include more content (see the section
on vocabulary frequency). Another possibility is that in their attempts to write more (or perhaps
to include more information in titles, lists, etc.) they are including less varied vocabulary and
repeating the same vocabulary more frequently. When reading the texts, however, very few
students repeated content or information throughout the text, meaning any repetition was not just
writing the same or similar sentences over and over again. This is discussed more in the
vocabulary frequency section. Overall results for changes from T1 to T2 are compiled in Table
10 below.
T1 T2 Change % Change
The model text corpus, made up of the texts that students read in class, found online, and
generally all texts used for class activities between T1 and T2 (see Appendices D-G), were
analyzed individually using Voyant Tools (Sinclair & Rockwell, 2019). The number of texts was
33, similar to the 35 student-produced texts. Some texts that students found during the student-
found blog activity were removed from analysis, mainly due to their irrelevance (i.e. blogs listing
other health blogs). The texts were analyzed to examine number of words (length of texts), type
token ratio, vocabulary density, and average words per sentence (Table 11). These were the same
factors chosen for T1 and T2 analysis to allow for comparison (Table 12). The settings used for
word definitions and token definitions were the default settings in Voyant Tools.
T1 T2 MT
35 35 33
Total number of texts
6,910 11,261 28,610
Total words (TOKENS)
896 1,260 5,243
Total unique word forms (TYPES)
0.13 0.11 0.18
Type/Token Ratio (TOTAL)
0.49 0.42 0.44
Vocabulary density (AVG)
The model texts, although there are fewer texts overall, include a much higher number of words
and unique word forms in comparison to the student productions, which is to be expected
considering their proficiency level. Student writing for T2 did approach a more similar word
count, type count, and words per sentence as compared to the model texts. The vocabulary
density did decrease from T1 to T2, but both are not incredibly off from the MT data, with .13
overall for T1 (or .49) as compared to .18 (or .44) for the MT data and .11 overall for T2 (or .42)
as compared to the .18 (or .44) for the MT data. As suggested by the negative correlation
between word count and vocabulary density, the more that students wrote, while approaching a
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 45
more “native-like” word count, resulted in less varied vocabulary as compared to native texts.
This is somewhat unexpected based on the growth in other areas between T1 and T2, but does
make sense when considering potential factors such as the inclusion of textual elements like
titles, headers, and lists, or the fact that as students write more they must include more function
words in order to communicate the content. Still, more investigation is needed, particularly for
student production of text types with similar textual features like titles and lists.
Vocabulary Frequency
Vocabulary use in student samples and model texts was investigated further through
frequency analysis. AntConc was chosen for frequency wordlists due to the ability to better
control the stoplist (or lack thereof) manually, easier conversion into Excel and other tables, and
note that the definitions of words (tokens) may be different across Voyant and AntConc due to
the inflexibility of Voyant’s settings. Token definitions and Tool Preferences settings were set to
default in AntConc. A note that Range is the term used by AntConc to show how many texts in
which the word is present (i.e. range of 30 = the word is used in 30 out of 35 texts). Table 13
shows the frequency of the top 50 most frequent words in each corpus: T1, T2, and MT without a
stoplist, meaning both content and function words are included. There is some overlap across
texts, including typical function words like y (and), the preposition para (for), and pero (but) that
students learn in the lower levels of Spanish. Two words that were not in the top 50 in T1 but did
make the top 50 in T2 with overlap of the MT frequency list include ser (to be) and si (if). Si is
especially interesting because students have never explicitly learned it, and it often triggers more
complex tenses to talk about “if… then” clauses, such as conditional + imperfect subjunctive;
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 47
usually taught in the upper levels of Spanish. Si can also be used within the rhetorical function of
a health blog to tell individuals “if you ____ then you ____” when giving recommendations or
suggestions. This is investigated more in the recommendations section. Another function word
that is highly frequent in student writing is muy, the adverb meaning “very”. Using KWIC (key
word in context) within AntConc, MT texts use muy with a wide variety of adjectives including
poca (little), sencillo (simple), útil (useful), and distinta (distinct), to name just a few (see Figure
4). In T2 writing, however, students use muy in a similar fashion, but with a smaller variety of
adjectives. The most common adjectives paired with muy in T2 student writing are bien (good),
bueno (good), beneficio (incorrect, although they are trying to use beneficioso for beneficial),
(important) with over 50+ uses of muy importante. This overuse of importante will be discussed
Students are using the word muy in a similar way to the native texts, but their variety of
adjectives that it modifies is much less diverse. They also appear to have a lower vocabulary
when it comes to adverbs, using muy when the native texts use many more adverbs such as
especialmente (especially), completamente (completely), etc. Another point of note with the
frequency data overall is the increase in frequency of function words (i.e. y, es, de…) from T1 to
T2, which could be a contributing factor in the lower vocabulary density for T2 writing as word
count increased. Perhaps students needed more function words to include more content, and
Next, frequency lists were generated for content words only using a stoplist (see
Appendix K for the full stoplist from Text Tools, 2018). Table 14 shows the results for the top 50
Table 14: T1 & T2 & MT most frequent words (AntConc- content words, with stoplist)
There is a substantial amount of overlap in content words across texts. Nouns like ejercicio
(exercise), salud (health), cuerpo (body), vida (life), and agua (water) occur in all three, typical
of blogs about health. The informal 2nd person register is very common within the blog genre in
Spanish, as students observed when looking at model texts (see Figure 3 for the student-created
rubric). This is clear here for all three texts from terms such as tú (subject pronoun “you”), te
(reflexive pronoun or indirect object pronoun), and tu / tus (possessive adjective). One
interesting note is that the MT corpus uses the terms alimentos and alimentación to refer to food
(or nourishment), while students use the more basic term comida (food or meal). The words are
used in a similar manner, with phrases like alimentación/comida saludable (healthy food)
appearing in the T2 writing and MT texts. The same sort of simplified vocabulary can be seen
with sueño (sleep, dream) which is common in the MT corpus, and the verb dormir (used as a
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 50
noun- sleep), which is more simplistic and more common in the T1 and T2 corpora. In other
words, students are trying to express similar ideas but with less sophisticated vocabulary. It is
also interesting to point out that the word blog is much more prevalent in the T2 writing than the
T1 writing, with students including it in their titles (El blog de salud or El blog de bienestar –
“The health blog”) and more mentions in the comments students wrote to accompany their blogs
(Me gusta el blog- “I like the blog”), both features of the text type. Students appear to be very
In order to compare across texts with different total word counts, normalized frequencies
were calculated, dividing the frequency for the given word by the total number of words (with
the stoplist), and multiplying by 1000 (McEnery & Hardie, 2012, p. 50). In other words, this is
the adjusted frequency of that word per 1000 words in each text (McEnery & Hardie, 2012).
Table 15: Overlap between T1, T2, and MT content words, normalized frequency
The normalized frequencies for words based on the top 50 of the MT corpus are included in
Table 15. The use of some words increased from T1 to T2: for example, beneficios from .03 to
.31, closer to MT texts at .10, but slightly exaggerated. For other words it decreased: i.e. estás
from .52 down to .16, closer to the MT texts frequency of .09. A few words that were not used at
all in T1 writing but were acquired for T2, perhaps through the authentic texts, include hábitos,
alimentación, and dejar, although it should be noted that their use overall is minimal when you
look at range (how many students used them). Some students acquired this new vocabulary while
others did not. In general, the use of certain vocabulary is much, much higher for the T1 and T2
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 52
texts as compared to the MT corpus, with some numbers above 0 (i.e. cuerpo at 1.20), meaning
students are repeatedly using those words. Nearly all of these repetitive words come from the
student vocabulary lists for their health unit (saludable, cuerpo, salud, ejercicio, dieta, comer, &
comida), meaning that they were highly salient for students before the research study took place.
Many classroom activities were centered around their use, resulting in a large amount of prior
exposure. Students likely overused them due to their particular salience and relevance to the
topic of health. A positive takeaway is that even with less variety in vocabulary, most of the
words that students used at a high rate were also words of high frequency on the MT corpus.
Giving Recommendations
One common rhetorical feature of blogs, particularly those about health, is giving
recommendations or suggestions for the reader. Students noticed this in the model texts in
various ways. First, when creating the Voz (voice) section of their blog rubric, they point out the
use of the informal tú (you) in the model texts to refer to the reader (see the student-created
rubric in Figure 3). This could be one reason for the high frequency of both tú (pronoun, you)
and tu (possessive adjective, your) in student writing (refer back to Table 14 and 15). When
looking at the frequency of the subject pronoun tú (you) in AntConc, it is used to refer to the
reader in the MT corpus (i.e. ¿Y tú?), but not very frequently (8 times, range of 6- see figure 5).
Tú does not even make the top 50 most frequent words for the MT corpus, but it is in the top two
for both the T1 and T2 corpora. However, when searching the MT corpus for verbs with the
inflection of present tense 2nd person (tú) form, for example, quieres (you want), puedes (you
can), haces (you do), etc., the frequency within the MT corpus is higher (i.e. 16 for quieres, 39
for puedes, 12 for haces…). Using the regular expression (regex) of \w*(as|es)\b to search for
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 53
other verbs with the 2nd person inflection (as/es ending), there were many, many more1. Going
back to the pronoun tú, while it is infrequent in the MT corpus, in T2 writing tú is used 255
times, often paired with a verb (see a sample of the KWIC output in Figure 5).
This points to potential influence from the L1 for most of the participants in this study: English.
In English, verbs cannot be used without their pronouns in the present tense, yet you can do so in
Spanish. For example, “run” cannot be exchanged for “you run”, yet corres already means “you
run” in Spanish without the pronoun tú included. The inflection (es ending in this case) gives the
1
It should be noted that an exact count of all 2nd person verbs was not possible without a part of speech tagger,
however a manual count of the 3,118 hits was deemed too tedious and a glance at the data showed a large number of
2nd person verbs without further analysis
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 54
lexical information necessary to show it is referring to tú. Students may be overusing tú to refer
to the reader because of influence from English, although they are using it to perform a similar
function as compared to the MT texts. Instead of including tú, the MT texts are simply
conjugating the verb in the 2nd person and leaving the pronoun out. It’s also interesting to note
that students have some errors with tú, overusing the pronoun in the incorrect form (i.e. after a
preposition instead of ti- es importante por tú vs. es importante para ti), which could be creating
a higher frequency. The overuse of tú with verbs for L1 learners of English is something that
needs to be addressed in instruction so that students understand they can use the tú form of verbs
(or any form) in Spanish without the pronoun included, which will result in a closer
Another word with excessively high frequency is the possessive adjective (or determiner)
1.55 in the MT corpus. Tu is used before nouns to denote possession (i.e. your body), and
students use it frequently with a variety of nouns related to personal health (see Figure 6). First,
it should be noted that some errors in accents were visible in the data (i.e. using tu vs. tú), likely
due to researcher error when cleaning data. In addition, there are some examples of students
using tu instead of tus when referring to plural nouns, forgetting to change the possessive
adjective to agree in number. Both of these factors could be influencing the data. While the usage
of tu is similar in the MT corpus, the usage of the definite articles el and la (the) is not (see Table
16). Students may be using tu more frequently in front of nouns to try to address the reader rather
el 2.01 2.46
la 1.38 3.28
tu 2.19 0.69
than the definite articles, which are generally interchangeable in Spanish as long as the possessor
is clear. For example, with te lavas el pelo (you wash (the) hair), the reflexive pronoun te has
already made it clear that you’re washing your own hair so the definite article la can be used
instead of tu. Looking at the web/dialects Corpus del Español with over 2 billion words, in native
Spanish el (85,366,616) and la (76,803,014) are more frequent overall than tu (2,489,705)
(Davies, 2019). This is not the case with the T2 corpus. This is also a possible side effect of a
less nuanced sentence structure without pronouns, where the MT refers to the reader in a more
indirect way through the use of verb inflection or pronouns, for example with the T2 phrase
ejercicio es muy importante para tu salud (exercise if very important for your health) vs. the MT
phrase te proporcionará otros beneficios para la salud (it will supply you other benefits for (the)
health). Students are likely overusing tu in front of nouns because they are attempting to include
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 56
more references to the reader as they see in the model texts. Unfortunately, they do not yet have
the ability to use a more varied sentence structure including other pronouns or verb inflection
within the sentence. If they did, they could use more definite articles in front of nouns rather than
the possessive adjective tu, providing more indirect rather than direct reference to the reader.
Aside from references to the reader using tú or tu, the rhetorical function of
recommendations and suggestions was further investigated using typical phrases or words in
Spanish. Some of the words and phrases under investigation are in Table 17.
Spanish English
Obligation (present/conditional tense)
Deber (i.e. debes, deberías) + verb infinitive Should (i.e. you should)
Poder (i.e. puedes) + verb infinitive Can (i.e. you can)
Tener (que) (i.e. tienes (que)) + verb infinitive Have (i.e. you have (to))
Necesitar (i.e. necesitas) + verb infinitive Need (i.e. you need to)
Imperative mood (informal commands)
Positive commands (duerme, deja) Positive commands (sleep, stop)
Negative commands (no comas, no fumes) Informal commands (don’t eat, don’t smoke)
Impersonal Expressions
Es importante + infinitive It’s important to…
Es necesario + infinitive It’s necessary to…
Hay que + infinitive One must…
se + conjugated verb (se debe) Impersonal se (i.e. one should/must)
Subjunctive mood / “if” clauses / Complex tenses
Si (tú) (quieres)… If (you want)…
Es importante/necesario (que) + subjunctive verb It’s important/necessary (that) + subjunctive
Often in traditional Spanish curricula, students learn to give suggestions and recommendations
first through the present tense (you should sleep more), then the imperative mood (sleep more!),
then impersonal expressions (it’s important to sleep more), and finally the subjunctive mood and
complex tenses (if you were to sleep more then you would feel better / it is necessary that you
sleep more). At the language level of students in this study, they have explicitly been taught the
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 57
present tense options for the above function, the imperative mood (informal tú commands), and
some impersonal expressions, but no subjunctive. Phrases used to fulfill the rhetorical function
of suggestions or recommendations were investigated in all three corpora using the KWIC and
frequency features in AntConc. The present tense verbs of debes (you should) and puedes (you
can) were included due to their high frequency in T2 writing. The word importante (important),
which can signal impersonal expressions or subjunctive, was also highly frequent in T1 and T2
and therefore merited further analysis. Finally, the word si (if) was included, which was more
common in T2 as compared to T1 (refer back to Table 13 for frequency). For each of these
words, the KWIC (key word in context) tool was used in AntConc with settings at default (levels
at 1R, 2R, 3R) and search window of 20. The T2 and MT data were analyzed, leaving T1 out due
to the length of analysis. For the following figures showing KWIC data, it is only a selection of
Debes is frequent in the T2 texts, with 55 concordance hits and a range of 18. The sample in
Figure 7 shows the general use of this word; to give advice to the reader using very concrete,
simple, and health-related action verbs like comer or dormir (you should eat, you should sleep).
The usage of debes is much less in the MT texts at 10 concordance hits. The verbs that co-occur
are also fewer concrete actions, with terms like saber (to know) and tener (to have).
Interestingly, when searching the use of the more impersonal (or more formal) debe (he/she/one
should) in the model texts, there are more examples (i.e. no se debe comer menos) (see Figure 7).
This means that the model texts are still using the verb deber, just in other, more impersonal and
more formal ways and less frequently than the student texts. When looking at puedes (you can),
it is also more frequent in T2 than in the MT corpus, with a normalized frequency of 1.17 for T2
and 0.11 for MT. The usage is similar, with the more impersonal se puede showing up in the
model corpus like se debe, and T2 usage with simpler verbs. It’s also interesting to note the
repeated use of the pronoun tú with debes and puedes here in T2 writing, uncommon in native
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 59
Spanish but more common for those with English as an L1, as noted in the previous section.
Puedes is overall more common than debes, making the top 50 frequency of content words for
the MT corpus. One potential reason is that deber is a more forceful verb indicating more of a
command (debes- you should) versus the suggestion given with poder (puedes- you can) when
referring to the reader. Native writers wishing to refer to the reader and give suggestions in a
blog take the more nuanced route of puedes (you can) rather than a sense of obligation based on
personal opinion with debes (you should), but students likely don’t recognize the difference.
It’s also possible that native texts are using more impersonal expressions to give
suggestions and recommendations as compared to students. The most common version of this in
the T2 texts is importante (important). Students used importante mostly for phrases with a noun
as a subject (i.e. exercise is very important), which is how they are taught the adjective in the
previous Spanish 1 course (see Figure 9). Model texts did the same, but also added more correct
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 60
usage of the subjunctive in the impersonal expression es importante que (it’s important that).
impersonal expression (i.e. es importante tú – it’s importante you), but they did not use the
subjunctive. In addition, while the T2 texts use almost exclusively importante, the MT texts use
other impersonal expressions such as es necesario (it’s necessary- 13 hits), and hay que (one
must- 10 hits) that are essentially absent from student writing. Some students are attempting to
use impersonal expressions, albeit incorrectly, while others resort to the more basic sentence
Finally, it was noted previously that the word si became more frequent in T2 writing as
compared to T1 writing (refer back to Table 13). Si can be used to indicate possibility and
alternatives. In Spanish, it often triggers the future, conditional, or imperfect subjunctive with if
clauses (i.e. “if you eat more fruits, you will feel better” or “if one were to exercise more, one
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 61
would have more energy”). In T2 writing students appear to be using si with the present tense
and occasionally commands (i.e. si eres un adulto, debes dormir 6-8 horas- if you are an adult,
you should sleep 6-8 hours). The MT texts also use si in this manner, (i.e. si tu objetivo es bajar
there are more instances of complex sentence structures using future or conditional tense (i.e. si
tenemos un perro, seguro que le encantará…- if we have a dog, it’s certain that it will love…).
The overall higher usage of si in T2 could indicate influence from the model texts, even if
When looking at all of the data, students were still able to complete the essential
healthy habits; they just used more repetitive, less nuanced, and easier to remember vocabulary
and grammatical phrases in order to do so. For example, students used more expressions of
obligation like puedes and debes rather than impersonal expressions such as hay que, although
they did make attempts to use more complex sentences with si and es importante que. This
observation of more simplistic sentence structure in student writing fits very well with what
particular rhetorical function… they tend to rely on a few items only, which they use over
and over again, to the detriment of other, perhaps less salient expressions. (p. 328)
Reflection Data
Soon after the T2 writing sample was complete, students reflected on the experience
using Google Forms. The form itself included quantitative (Likert Scale) as well as qualitative
data in the form of open-ended questions (to view the form, see Appendix I).
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 62
4. My final Spanish writing sample was better than my first writing sample. 4.51 5
11. I felt more motivated reading authentic texts (made for native Spanish
3.66 3
speakers, by native Spanish speakers) than I do when reading from a textbook.
From the reflection data a few interesting points emerge. First, the majority of students believed
1) overall T2 writing improved compared to T1 writing; 2) The most substantial change was in
the structure of the text based on the models; and 3) they were applying vocabulary and grammar
learned from class in their writing. Areas where students did not show as much enthusiasm were
1) enjoying this type of practice; 2) belief that this was more helpful than grammar notes or a
contrast with the research showing that use of authentic texts can help motivate students, but
there were also many positive comments, as seen in the qualitative reflection questions. Students
were asked two questions, and their short answers were coded into 8 categories based on general
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 63
number of each type of response was recorded, and some sample comments are included in
Table 19. Sample comments included were chosen based on representativeness of the comments
Question 1. How did the process of reading many Question 2. What was difficult about the writing
authentic texts (blogs) in Spanish help you to improve tasks that asked you to write specific types of texts
your writing? Explain what you think you did well (or (blogs) in Spanish? What do you think might help
better than before) and how you felt about the task. you to do these writing tasks better?
It was helpful to see how the blogs were actually
structured and how they use the informal voice
I learned about how a blog should be formatted.
I think that my final blog was much more
cohesive.
I think readibg [sic] the blogs helped me
ubderstabd [sic] the layout and how to actually
Structure/
write and create a blog. Structure/ The difficult part was writing it in a blog
style/
I was… able to learn on how to organize my style/ style, but it was easy once we looked at
organization
information different [sic] and what suggestions organization blogs
(20
(2 comments)
comments) I can include when writing my blog. I think I did
better in organization and including more
information and suggestions that actually relate
to health.
It helps me write text more authentically and
real Other than just texts that I found in a
textbook, which is more basic and less
complex.it helped me form a better blog
I found it was tricky to meet the word
I think as time went on, I got better at writing
requirement even the second time, so I
and reaching the word limit.
had to find a lot more to say about the
different topics
Word count
Word count What was difficult was hitting the 250
(1
(4 comments) minimum word criterion, as I had written
comment) about what I wanted to write in less than
I was able to write a lot more than I could the minim [sic] requirement. I think a
before. good fix to this is either lower the
minimum word requirement or give us
more time to work on it.
I didn’t know some specific vocabulary
Vocabulary I think I did a good job of using a wider variety Vocabulary
that I wanted to use. However, I learned
(9 of vocab words, and could do more things with (11
more vocabulary when reading other
comments) the words comments)
blogs.
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 64
By reading more authentic texts in Spanish I was I also found that incorporating the new
able to expand my vocabulary and learn how to vocabulary and using my past
use different information knowledge was difficult.
I think that the most difficult thing for
me was not knowing all of the
This helped me to improve writing, because I vocabulary words that I would have used
learned many new words that I usually use in my in English.
writing that I didn’t know in Spanish. i found it difficult to remember certain
words and phrases that would help move
my blog along.
I could tell how native speakers actually talk and
how it’s different than texts made for people
learning spanish
Reading/ Reading/
I felt like the activity that we did with finding
writing/ writing/
our own Spanish blog was helpful in giving us I think that more practice with writing
authentic authentic
the authentic Spanish blog that we all needed to instead of just reading would help.
texts (7 texts (1
see.
comments) comment)
It allowed me to be immersed into spanish and
allowed me to better understand spanish and
how to write.
I had trouble making sure that what I
They gave me sentences I could use to improve
was writing was making sense both
and taught me new grammar
grammatically and Spanish-wise
The most difficult thing for me was
spelling and conjugating certain words. I
Grammar (4 Grammar (5 also struggled with accents. I think
comments) comments) practicing conjugating words will help
Reading authentic texts help me shape my me do these better.
sentences.
Nothing was that difficult, but
sometimes I needed to use the future
tense and couldn’t, so I had to find some
way around it.
Because the topic was narrow it was
Because it helped me know what to write about
hard to find a variety of vocab and my
and how to write it.
blog felt repetative [sic]
It was still difficult, because I had to
write a lot of facts about each topic, but
some were a little hard to write for
It gives you many ideas about what to write
Ideas/ Ideas/ (sleeping). What I could do to make this
topics (7 topics (14 a little easier, is to pick topics I can write
comments) comments) a lot about.
I think what was difficult was the fact
that I am not an expert in the subject and
i didn't know specific words in Spanish.
I had more topics to work from.
Thinking of things to write about.
Because some stuff I did not know how
to say in Spanish.
Stress/ I felt the task was a lot more stressful because Stress/ I guess I’m just saying that it would help
difficult/ we had so much time to write something that difficult/ if the unit was more exciting.
boring (2 needed to fit all the criteria given. But overall it boring (4 I don’t really like continuous reading,
comments) wasn’t extremely difficult. comments) it’s kinda hard to focus due to the fact
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 65
Looking at student comments, the most popular coded categories for the question about writing
improvement were structure / style / organization (20 comments), vocabulary (9 comments), and
reading / writing / authentic texts and ideas / topics (7 comments each). Comments were
especially positive in regards to structuring blogs after reading authentic texts. Referring back to
Tardy’s four aspects of genre knowledge, students appear to perceive the most growth in formal
knowledge of the structural elements, which fits with the structural rubric evaluation of T1 vs.
T2 writing (Tardy, 2009). For the question about what was difficult about writing, the most
popular category was ideas / topics (14 comments), with at least 4 students citing that they felt
their writing was “repetitive” because they couldn’t say what they wanted to or they found the
topic too narrow. After that came vocabulary (11 comments), with students voicing the difficulty
themselves in Spanish on the topic. This validates the quantitative data showing a lowering in
vocabulary density as students felt they did not have the vocabulary to write what they wanted to
write in their blogs for the topic. It also shows that students may still be lacking in subject matter
knowledge when it comes to the genre (Tardy, 2009). More focused vocabulary instruction could
have helped, as could have a different, more personal choice in writing topic, or even a change in
word count recommendations. Overall, based on student reflection data it appears that the
majority of students found the experience valuable, although perhaps not exciting. They
perceived the most growth in text structure and felt the least competent in varied vocabulary use,
Conclusion
Pedagogical Implications
Based on this study, there are various pedagogical implications for language teachers.
First, there were many characteristics of the study design, particularly during instruction, that
students (and the teacher) found useful. While it is impossible to tell if any one of these activities
or tools had a stronger influence over student writing due to the quasi-experimental nature of the
study, they still deserve consideration. Google Docs and Google Forms were very helpful for
both peer writing activities, teacher feedback, and submission of consent documentation and are
strongly recommended for other practitioners and researchers. In the reflection comments,
students mention enjoying the ability to search for their own blogs within their personal domain
of interest, which is one way for the teacher to find a large number of model texts without
spending hours, and also give students autonomy and increase motivation. The student genre
rubric creation based on the reading of authentic texts was also an excellent activity for students
to critically consider aspects of the texts they might not otherwise have noticed (Ponzio, 2017).
Still, teachers need to consider how the more simplistic student rubrics could increase saliency of
certain textual elements to influence writing, as may have occurred in this study with the overuse
of the tú form to refer to the reader. Lastly, the student personal vocabulary lists where they kept
track of words they read in the texts that they wanted to use in the future were useful, but likely
needed to be used more intentionally (i.e. students use X number of those words in their writing
As far as the results based on the corpus analysis data, students wrote much more after
the analysis of authentic texts, but their vocabulary density decreased as they wrote more.
Teachers need to consider how the chosen genre’s text structure can influence vocabulary
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 67
density, especially with a text type such as a blog where titles, subtitles, and lists are considered
typical of the genre. For Diploma Program teachers, working with students on creating textual
features such as lists without sacrificing quality and variety of vocabulary will need to be a high
priority to prepare for the Language B exam. In addition, student reflection comments mentioned
the difficultly of avoiding repetition due to the topic chosen and/or some students felt the topic of
health was boring. Practitioners working on writing genre with their students need to ensure the
topics chosen are of high interest to students to generate motivation. Student background
knowledge is also important to avoid repetition of vocabulary due to lack of information on the
topic. In the lower proficiency levels where students do not yet have a wide vocabulary for the
writing of informational texts, personal text types such as diary entries, social media posts, or
personal e-mails could be the best options to introduce the idea of genre because students already
know quite a bit about their most engaging subject- themselves! Finally, choosing a word limit
(or lack thereof) can be important. In this study, students were encouraged to write between 250-
500 words, yet they wrote much more on the T2 writing. In reflections comments, some
participants felt rushed and stretched by this word count while others felt they had too much time
to write. Still, students feeling that they must write more could lead to lower vocabulary
complexity as they attempt to reach the set number of words. The practitioner must consider
his/her goals when designing the writing prompts for genre practice and whether a word count is
appropriate. It is important to note that overall lower vocabulary density is expected at the lower
proficiency levels. This is clear when referring to the ACTFL proficiency guidelines (2012) for
students at the Intermediate Low level, the proficiency level for many of the learners in this
study:
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 68
Most sentences are recombinations of learned vocabulary and structures. These are short
and simple conversational-style sentences with basic word order. They are written almost
exclusively in present time. Writing tends to consist of a few simple sentences, often with
The “repetitive structure” of student writing at this proficiency level is typical, and teachers
should not be discouraged if student vocabulary density is well below that of model texts. Other
features can be considered as factors of student improvement, such as structure and rhetorical
device use. For example, in this study, students showed major growth in the area of structure and
formatting between T1 and T2, and students were able to complete the main rhetorical function
of health blogs in giving recommendations even if they did use more simplistic vocabulary and
grammar.
Another factor to consider when looking at student writing is influence from the L1 and
ensuring to pre-teach aspects of writing that may come under this influence. For this study,
students overused the word tú (you) with verbs to fulfill the rhetorical function of referencing the
reader, but it was likely due to influence from the L1 of English where “you” is necessary before
Vocabulary saliency is yet another aspect of writing that teachers should keep in mind. In
this study, students overused words from their chapter vocabulary list such as cuerpo and salud
at the expense of less salient vocabulary they could have learned from the authentic texts. The
vocabulary that is taught and used regularly in the classroom will likely show up more frequently
in writing, meaning teachers need to make informed decisions about what to include on
vocabulary lists and in classroom activities. In fact, the corpus tools themselves could be an
excellent way for teachers to analyze authentic texts to focus on what vocabulary and phrases to
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 69
teach within a specific genre or text type. In this study, students were not explicitly taught
vocabulary or phrases to help them to write within a specific genre, but instructor text analysis
ahead of time could allow for better instructional decisions when teaching a text type. In other
words, a vocabulary frequency analysis could be run on a corpus of authentic texts to see which
words need to be added to vocabulary lists or used during instruction, or an N-grams / lexical
bundles analysis could be used to look at which phrases are most common. For example, in this
study phrases such as hay que which were used to give suggestions in native texts could have
been pre-taught to students. To go beyond this, teachers could even have their students analyze
the texts themselves with data-driven learning, such as the work done by Kennedy & Miceli
(2001).
Finally, this study demonstrated that the formal knowledge of genre, or the structure of
texts, was greatly improved through the classroom tasks (Tardy, 2009). Still, students did not
always consider the rhetorical function of texts or other types of genre knowledge (Tardy, 2009).
Practitioners teaching writing of a genre may consider having students do other types of text
analysis activities to look at text distribution and how the chosen text type fits within the target
Most of the limitations of this study stem from the fact that it is a practitioner study and
took place within a real language classroom where various factors cannot be controlled. For
example, within the context of the instruction involved in his research students were pulled for
field trips, students fell ill and were absent, technology being used in the classroom was taken for
make-up standardized testing, etc. These sorts of interruptions and issues are familiar to any
teacher and can be the reason educators see a disconnect between controlled research studies and
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 70
the classroom. Individual differences can also place many confounding variables into play when
analyzing the findings. Differences such as age, aptitude, working memory, and learner
background can all have a large influence on language acquisition and could potentially skew
results (Loewen, 2015). These individual background variables were not analyzed as a part of
this study.
Another issue is the model authentic texts chosen for this study were chosen based on
teacher evaluation rather than any particular criteria and no corpus analysis on model texts was
run before the instruction took place. Future studies on genre could use corpus analysis of
authentic texts ahead of time to determine vocabulary for instruction, which could influence
outcomes. Data driven learning is another possibility for future practitioner research, in the vein
of Kennedy & Miceli where students use corpus analysis to look at their own writing and model
texts (2001). Further investigation of the potential lower vocabulary density with higher word
count in student writing could also be interesting, especially with similar text types. Finally, this
study did not look at learner errors in writing although they were plentiful and could have
resulted in noteworthy results. This could be another direction for future studies in learner corpus
analysis.
Another limitation of the current study is its lack of process-oriented tests, for example,
think-aloud tasks and retrospective interviews (De Graaff & Housen, 2009). These types of tests
allow researchers to investigate what participants are thinking during the learning process. This
study did include a reflection portion, but did not include any of these process-oriented tests due
to time constraints and also the added difficulty of a more complex consent process, however
When referring to other limitations, the corpus analysis for this study was conducted by
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 71
an amateur, as were the statistical analyses, meaning there is potential for error in results or
interpretation therein. More studies need to be conducted by experts for conclusive results.
Finally, more research needs to be done using corpus analysis for lower-proficiency learner
writing, particularly in the area of genre and using L2 other than English. There were very few
studies to use as a basis for the design of this one, making it difficult as a practitioner to make
informed study design decisions. More practitioners using corpus analysis tools to look at text
type and genre in other languages can only benefit future researchers, teachers, and students.
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 72
References
American Council on the Teaching of Foreign Languages (2012). ACTFL proficiency guidelines
proficiency-guidelines-2012/
http://www.laurenceanthony.net/software.html
Arriagada, I. E. (2012). Developing Skills for Text Types: A Guide for Students of Spanish.
https://www.barebones.com/products/bbedit/
Bromley, K. (2014). Best Practices in Teaching Writing. In L.B. Gambrell & L. Mandel-Morrow
(Eds.), Best Practices in Literacy Instruction (5th ed.), (pp. 343-364). New York: Guilford
Press.
Benson, P. (2012). Learner-Centered Teaching. In Burns, A. & Richards, J.C. (Eds.), The
Cambridge Guide to Pedagogy and Practice in Second Language Teaching, (pp. 30-38).
The College Board (2015). AP Spanish Language and Culture 2015 Scoring Guidelines.
The College Board (2019). AP Spanish Language and Culture: Course Overview. Retrieved from
https://apstudent.collegeboard.org/apcourse/ap-spanish-language
De Graaff, R. & Housen, A. (2009). Ch 38: Investigating the effects and effectiveness of L2
instruction. In C. Doughty & M. Long (Eds.), The handbook of language teaching (pp.
Devitt, A., Bawarshi, A., & Reiff, M. (2003). Materiality and Genre in the Study of Discourse
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 73
Duke, N.K. & Martin, N.M. (2014). Best Practices in Informational Text Comprehension
Duke, N., Pearson, P.D., Strachan, S. & Billman, A. (2011). The Essential Elements of Fostering
and Teaching Reading Comprehension. In S.J. Samuels & A.E. Farstrup (Eds.), What
Research Has to Say About Reading Instruction (4th ed.), (pp. 51-93). Newark: DE:
Ellis, R. (2004). Supporting genre-based literacy pedagogy with technology- the implications for
the framing and classification of the pedagogy. In L. Ravelli & R. Ellis (Eds.), Analysing
Ellis, N. (2009). Ch 9: Optimizing the input: Frequency and sampling in usage-based and form-
focused learning. In C. Doughty & M. Long (Eds.), The handbook of language teaching
https://www.corpusdelespanol.org/
Djuraskovic, O. & FirstSiteGuide Team (2018, April 8). What is a Blog? – The Definition of
https://www.thoughtco.com/giving-advice-spanish-3079435
Fisher, D. & Frey, N. (2014). Best Practices in Adolescent Literacy Instruction. In L.B. Gambrell
& L. Mandel-Morrow (Eds.), Best Practices in Literacy Instruction (5th ed.), (pp. 149-
Gambrell, L., Malloy, J., Marinak, B., & Mazzoni, S. (2014). Evidence-Based Best Practices in
Practices in Literacy Instruction (5th ed.), (pp. 3-36). New York: Guilford Press.
Gilmore, A. (2007). Authentic materials and authenticity in foreign language learning. Language
Gilquin, G., Granger, S., & Paquot, M. (2007). Learner corpora: The missing link in EAP
https://doi.org/10.1016/j.jeap.2007.09.007.
Guthrie, J. (2014). Best Practices in Motivating Students to Read. In L.B. Gambrell & L.
Mandel-Morrow (Eds.), Best Practices in Literacy Instruction (5th ed.), (pp. 61-82). New
https://www.spps.org/site/handlers/filedownload.ashx?moduleinstanceid=38371&dataid=
21214&FileName=language_acquisition_guide_2014.pdf
http://www.cosmopolitanschool.de/wp-content/uploads/2018/10/dp-language-B-
guide.pdf
https://dx.doi.org/10125/44567
A. Mackey (Eds.), The Routledge handbook of second language acquisition (pp. 57-72).
Lee, D. (2001). Genres, registers, text types, domain, and styles: Clarifying the concepts and
navigating a path through the BNC jungle. Language Learning & Technology, 5(3), 37-
72. https://dx.doi.org/10125/44565
Loewen, S. (2015). Introduction to instructed second language learning. New York: Routledge.
Mackey, A., Abbuhl, R., & Gass, S. (2012). Ch 1: Interactionist approach. In S. Gass & A.
Mackey (Eds.), The Routledge handbook of second language acquisition (pp. 7-23). New
York: Routledge.
Martínez-Lirola, M. (2015). Using Genre Theory for teaching writing at tertiary level: a practical
McEnery, T. & Hardie, A. (2012). Corpus linguistics: method, theory and practice. Cambridge:
Mishan, F. (2005). Designing authenticity into language learning materials. Bristol, UK;
Mitchell, R. (2009). Ch 36: Current trends in classroom research. In C. Doughty & M. Long
Long (Eds.), The handbook of language teaching (pp. 81-105). Singapore: Wiley-
Blackwell.
Paltridge, B. (1996). Genre, text type, and the language learning classroom, ELT Journal,
Amsterdam: Benjamins.
Paquot, M., & Granger, S. (2012). Formulaic language in learner corpora. Annual Review of
doi:http://dx.doi.org.proxy2.cl.msu.edu/10.1017/S0267190512000098
Polio, C. (2016). Teaching Second Language Writing. New York, NY: Routledge.
Ponzio, C. (2017, September 9). Seven Ways to Use Google Docs to Support Bilingual Student
google-docs-to-support-bilingual-student-writers
Sinclair, S., & Rockwell, G. (2019). Voyant Tools. Retrieved from https://voyant-tools.org/
https://www.socscistatistics.com/tests/
Tardy, C. M. (2009). Building genre knowledge. West Lafayette, IN: Parlor Press.
http://teachingcomprehensibly.com/super-seven/
https://countwordsfree.com/stopwords/spanish
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 77
Wingate, U. (2012). Using Academic Literacies and genre-based models for academic writing
instruction: A ‘literacy’ journey. Journal of English for Academic Purposes, 11, 26-37.
Welbourn, M. (2009). Approaching Authentic Texts in the Second Language Classroom: Some
from
https://www.academia.edu/10632377/Approaching_authentic_texts_in_the_second_langu
lan_classroom
Xu, Q. (2016). Application of Learner Corpora to Second Language Learning and Teaching: An
http://dx.doi.org/10.5539/elt.v9n8p46
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 78
Appendix A
Prompt: Create a typed blog post in which you discuss recommended aspects of
diet and/or exercise in order to maintain one's health.
Your goal should be to write between 250-500 words (check the word count above by
highlighting what you wrote and clicking Word Count under “Tools”). You will use
Google Docs to type below, and while you must type up your blog post, please do not
use any online resources to help you (i.e. translators, websites, dictionaries). Do not use
your notes either! You should use your own brain to write :)
(Crea una entrada de blog en lo que describes tus recomendaciones para la dieta y/o
ejercicio para mantener la salud. Debes escribir entre 250-500 palabras. Vas a usar
Google Docs, y aunque necesitas escribirlo a máquina, por favor no uses recursos del
internet como traductores, sitios web, diccionarios).
Nombre: Hora:
Appendix B
Appendix C
Romano, G. (n.d.). 5 hábitos saludables para bajar de peso. Tu Salud Total.com. [web log].
Como evitar lesiones musculares cuando haces deporte. (2015). Blog de Bienestar. [web log].
musculares-cuando-haces-deporte/
Reto Fitness (n.d.). Gimnasio en casa. Reto de 30 días Fitness. [web log]. Retrieved from
http://retode30dias1.blogspot.com/2015/04/gym-en-casa.html
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 81
Appendix D
Appendix E
Appendix F
Original source:
ejercicios-de-cardio-para-acompanar-el-gimnasio/
https://docs.google.com/document/d/1W12EmoQcWMt6D3zxx7gQJL-
ZnPoZ_kyUNPxUem9vlPI/edit?usp=sharing
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 84
Appendix G
https://docs.google.com/document/d/1BTrrLuAsKC2zv0p2LsIfXqWFHLuiSTkg7KG6UIpY_4A
/edit?usp=sharing
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 85
Appendix H
https://docs.google.com/document/d/10L98OOiEbctBWH1DVBQQ28N-
yiwcRzxzMMIGFOghh3A/edit?usp=sharing
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 86
Appendix I
A. Pick which is most true for you for each of the questions based on the scale below.
1 2 3 4 5
(not at all true for me) (very true for me)
4. My final Spanish writing sample was better than my first writing sample.
1 2 3 4 5
(not at all true for me) (very true for me)
5. The structure of my writing in Spanish changed as a result of the readings we did in class.
1 2 3 4 5
(not at all true for me) (very true for me)
1 2 3 4 5
(not at all true for me) (very true for me)
7. Reading these texts helped me with writing more than taking grammar notes or doing a
worksheet.
1 2 3 4 5
(not at all true for me) (very true for me)
8. I applied or included grammar (new verbs or new verb structures) I learned from the
readings in my final Spanish writing sample.
1 2 3 4 5
(not at all true for me) (very true for me)
9. The activities we did in class helped me to better understand the texts we read.
1 2 3 4 5
(not at all true for me) (very true for me)
10. I applied or included grammar or vocabulary I’ve learned earlier this year in my final
writing sample.
1 2 3 4 5
(not at all true for me) (very true for me)
11. I felt more motivated reading authentic texts (made for native Spanish speakers, by native
Spanish speakers) than I do when reading from a textbook.
1 2 3 4 5
(not at all true for me) (very true for me)
B. Answer in English:
1. How did the process of reading many authentic texts (blogs) in Spanish help you to
improve your writing? Explain what you think you did well (or better than before) and
how you felt about the task.
2. What was difficult about the writing tasks that asked you to write specific types of texts
(blogs) in Spanish? What do you think might help you to do these writing tasks better?
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 88
Appendix J
T1 (#23)
T2 (#23)
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 89
T1 (#27)
T2 (#27)
IMPROVING FOREIGN LANGUAGE WRITING WITH AUTHENTIC TEXTS 90
Appendix K
siendo trabajan ya
siete trabajar yo
sigue trabajas
siguiente trabajo
sin tras
sino trata
sobre través
sois tres
sola tuvo
solamente tuyo
solas última
solo últimas
sólo ultimo
solos último
somos últimos
son un
soy una
su unas
sus uno
tal unos
también usa
tampoco usais
tan usamos
tanto usan
tendrá usar
tendrán usas
teneis uso
tenemos usted
tener va
tenga vais
tengo valor
tenía vamos
tenido van
tercera varias
tiempo varios
tiene vaya
tienen veces
toda ver
todas verdad
todavía verdadera
todo verdadero
todos vez
total vosotras
trabaja vosotros
trabajais voy
trabajamos y