The Hypothetical: Between Classroom Lecture and Discussion

A perennial debate among teachers seasoned and new concerns how much lecture versus discussion to feature in class. The debate is especially relevant to teachers of liberal arts subjects, since the content of such courses is not always conducive to rote learning techniques. Liberal arts subjects often require the completion of previously assigned reading, and even if enough students read to engage in fruitful discussion there remains the risk of devolving into debate rather than dialogue. To complicate matters further, lectures and class discussions are frequently, and falsely, pitted against one another, viewed as binary opposites and sometimes filtered through a political lens which codes the former as conservative and the latter as progressive. Consider the image of a professor lecturing to a class of students dutifully taking notes versus a classroom of circled desks with students and teacher alike engaging in a dynamic conversation.

I believe there are more or less appropriate times and subjects for either lectures or discussions, and I don’t buy that either has a particularly salient political character; after all, even the teacher discussing texts in a circle still assigns grades at the end of the term. But there are challenges to both lectures and class discussions that I think frustrate new teachers in particular. Lectures aren’t especially engaging unless done well, which comes with time and practice. On the other hand, discussions can intimidate students to the point of non-participation, especially if the topic of discussion is particularly controversial. Also, the skill of facilitating and nourishing discussions is underrated and quite challenging, another pedagogical virtue that comes with time and practice. Additionally, I find neither mode particularly effective for beginning a class. Opening with a lecture (especially in the early morning) risks students nodding off to sleep, as their engagement with lectured material is entirely determined by their individual commitment. Conversely, jumping into a discussion right away often fails to take off, as any teacher can attest, since students can be reluctant to break the ice.

Enter the hypothetical. The hypothetical is exactly as it sounds: a contrived example or problem related to the day’s concepts is posed to students who must reason through its various frictions. Incorporating hypotheticals into liberal arts classes is particularly effective, as it occupies a middle ground between lecture and discussion, combines the best elements of both, and requires no prior reading. Like a lecture, a hypothetical allows the instructor discretion in guiding students more or less forcefully towards the concepts intended for learning. And like a discussion, it engages students’ creative and critical thinking muscles, prompting them to respond and participate. Hypotheticals are a great icebreaker to boot; completing the reading is not required to participate in the deliberation of the hypothetical, as its contrived parameters present enough content for participation. It also doesn’t risk intimidating students from offering controversial opinions since its hypothetical nature provides a comfortable space for intellectual exploration.

Here is one hypothetical I offer students in one of my first-year composition courses:

Here’s what typically happens when I pose it. The majority of students almost immediately answer “no,” which allows me to oppose the class consensus to encourage deeper thinking. For instance, I point out that only a 1% chance of a wrongful conviction is quite good, especially considering current estimates put the US wrongful conviction rate somewhere between 2-10%. I then ask which kind of evidence would persuade them to convict, to which they usually respond “eyewitness testimony,” “DNA evidence,” and “video footage.” All of which, I suggest, is likely less than 99% accurate. So what gives, I press.

I then switch to another hypothetical, this time involving blood pressure medication, to which the entire class always answers “yes.” (As a side note, a great technique is to juxtapose two structurally-similar hypotheticals that nevertheless induce students to opposite conclusions; the cognitive dissonance is generative.)

“So what’s the difference?” I ask. Students usually reason many of the differences pretty well, in my experience: The stakes are different for sending an innocent person to prison compared to leaving blood pressure untreated; the former involves deciding someone else’s fate and the latter your own; and there’s a matter of “trust” when it comes to doctors that feels distinct from the consequentialism of the judicial system.

The larger point I make is this: The American judicial system (in theory, at least) does not consider the probability of committing a crime as admissible evidence, knowledge of guilt. (Thank God Minority Report is just a movie.) Even if there’s only a 1% chance, you are presumed innocent until proven guilty. In American medical science, however, the probability of a drug’s effectiveness, inferred from carefully controlled experimental trials, is considered knowledge. In fact, inferring the effectiveness of medical treatments from samples is the only way to know anything in medicine at all; it is impossible to “witness” or testify to, at least rigorously and systematically, the evidence of medical efficacy. These hypotheticals therefore elegantly demonstrate the difference between two types of information: observational knowledge, obtained and verified through experience, and probabilistic belief, inferred through studies of manipulated samples.

In the larger context of the course, we are discussing the nature of knowledge, how it is we can say we know something. The goal of this class period is to demonstrate how knowledge is context-dependent; what counts as knowledge in a medical trial is not what counts as knowledge in a court of law. Does that mean knowledge is whatever we say it is? No, it means that certain domains (medicine, law, science) have developed their own rules that govern how knowledge in that domain is understood and counted. Awareness that there are differences in how we derive knowledge is a concept I discuss in my first-year composition courses because I believe the idea is essential to understand before reading and writing academic research at the college level (your mileage may vary). This hypothetical helps students to intuit the primary takeaway of the lesson, the constructed nature of knowledge, without me lecturing at them about it.

The sum of everything

Do you all ever think about infinity? It’s one of those things unique to human cognition and thus sort of unavoidable, like object permanence or episodic foresight. We are forced to confront the infinite as a matter of course. Yet to ponder the infinite is paradoxical, for thinking of it limits the limitless.

The concept of infinity comes out of mathematics of course, but I think it transcends the disciplinary borders typically ascribed to it. Infinity has more in common with faith and religion than math. The infinite and God both contain similar qualities of omnipotence, and like the ontological argument for the existence of God, if we can conceive of infinity in our minds–a watered down version–then it must exist in a purer form elsewhere. To imagine the infinite implies that an even more infinite infinity exists outside of us.

Outside of math class, infinity is historically contingent, repackaged by different cultures to reflect the specific historical conditions surrounding them. In Ancient Greece, when thinkers were just beginning to grapple with the concept, infinity was first inferred from the banal. The infinite presented itself in such mundane tasks as walking from point A to B, a distance with infinite halfway points between, or in the shapeless water of a river, infinitely turning in on itself so that one can “never step in the same river twice.”

For us today, the infinite reveals itself in myriad other ways, consumer choices chief among them. I often find myself mindlessly scrolling through Netflix’s seemingly infinite offerings, squandering two hours of my day. I am similarly paralyzed by the infinite consumer choices aggregated on Amazon or on display at grocery stores, so much so I often don’t purchase whatever it was I needed in the first place. I also experience a flavor of the infinite when I think about voting. While election votes are not technically infinite, there are enough of them relative to the one of mine that political outcomes feel infinitely distant and detached from my singular blue vote cast in a red state, a snowflake in an avalanche as a popular analogy goes.

While there are important mathematical operations involving infinity–I mean, it’s a crucial part of calculus, after all–I think infinity has more impact conceptually than arithmetically. The infinite is, in other words, more rhetorical than numerical. Like a drinking glass, it gives shape to the aqueous nature of existing in an infinitely expanding and timeless universe; it names a sensation commonly felt among individuals living infinitely singular lives in a godless world of billions. Many abstractions short circuit our cognitive console, but naming them cools us down. We can’t really experience the purely infinite, but we can name it. We can know it’s out there.

Knowing something exists is a prerequisite for ignoring it. And maybe it’s time to start ignoring the infinite. Calculus has taught us many things, but perhaps most importantly it resolves part of infinity’s paradoxical nature. We now know that the infinite is only one side of the equation. You can divide a number in half infinitely, but add all those divisions up and you get back to the whole number you started with. In our culture and our politics and our lives, we are too fixated on one side of the equation—the side with infinite divisions. The other side remains a self contained unit, a finite value that the infinitesimally small divisions must add up to, even if there are an infinite number of them. I think we all stand to benefit from a greater focus on the whole rather than the infinitely divided parts.

We will become robots before robots become us

Who STAR TREK's Data Was, and Where He Is Now - Nerdist

Part of the overall argument of my dissertation is that the true threat of automated technology, AI, robots, etc., is not necessarily that robots are becoming dangerously human-like and that we might soon face a Matrix-style machine revolution, but rather that by increasing our interaction with automated, robotic technology we become more robotic.

I have never felt this sensation more acutely than when trying to call any kind of customer service line these days. It’s almost impossible to talk to an actual human person without first wading through a byzantine series of automated voiceover machine prompts. “Please describe your problem in a few words,” demands an affectless voice. I then proceed to ramble incoherently and in a totally unnatural cadence that results in the machine saying “I’m sorry, I don’t understand.” It’s maddening not only because you’re not solving your problem but also because you don’t feel or even sound like yourself. Seriously: next time you’re interacting with a robot voiceover, pay attention to the way you speak. It sounds more like you’re typing words into a search bar than anything resembling human dialogue; it is devoid of grammatical connector words, just a series of keywords rambled off in a frustrated tone. Never mind that the reason for the call in the first place is not easily summarized in simple sentences. Interestingly, a problem I can’t solve by Googling and that requires a phone call somehow never seems to fit into the simplistic categories that the robotic voice asks me to fit it into. Quelle surprise. But understanding poor descriptions of complex problems that defy easy categorization is something humans are very good at!

Speaking with an automated phone directory literally brings us down to the robot’s discursive level and forces us to talk and communicate like a robot does, because as humans our default social mode is to maintain interlocutor equilibrium. We subconsciously find and convert ourselves to the least common denominator in a communicative exchange. But robots can’t change their communicative register, so the only way to achieve mutual intelligibility with a robot is if we ourselves speak robotically. I find the whole experience of automated customer service incredibly frustrating and illustrative of the real threat automation poses, at least in the short term: It’s not that the robots will become like us, it’s that we will become like the robots. And of course, once we have normalized robotic human behavior, it will be much easier for differences between man and machine to cease to exist.

Who are the wise?

The Thinker — About UofL — The Thinker, University of Louisville

Where do we turn for wisdom in contemporary culture? Eclipsed by science, religion no longer wields the authority it once did. Our captains of industry offer nothing in the way of wisdom, as their station in life is not relatable to most people due to their unfathomable wealth. Companies offer no wisdom, only pandering and half-baked apologies for their involvement in various scandals. Celebrities are no more wise than us since they have been exposed, thanks to social media, as just as boring and sad as we are. Who holds wisdom today?

There is something: the wisdom of the crowd. In many ways, crowd wisdom has come to fill the void previously occupied by the authorial wisdom of political leaders and philosophers. In 2020, the “wisdom of the crowd” means the internet. And it’s true: the internet is full of wisdom. However methodologically flawed polling can be, I ultimately find the internet’s ability to aggregate public opinion on a topic and quantify it along a consensus scale a useful (and beautifully modern) kind of wisdom.

It’s important to distinguish wisdom from knowledge. Take user review metrics. Metrics like a movie’s Rotten Tomato score or a restaurant’s average Yelp! star count offer us data of a certain variety. We can’t say these metrics constitute knowledge, however. If the new Batman movie has a 99% rating on Rotten Tomatoes, do we know it is good? In most contexts, knowledge requires firsthand experience or observation. But if I scour dozens of product reviews on Amazon before I purchase a new keyboard, I am equipped with some kind of information about the product I didn’t have before. This information isn’t “knowledge,” but rather wisdom, or judgment, about how to navigate a world oversaturated with too many movies and restaurants and gizmos and gadgets, and the internet does it well.

Yet, this is practical wisdom. The practical wisdom offered by the internet is all good for the low-stakes struggles of discovering new movies and locating hole in the wall restaurants, but what about questions of greater significance? Where, today, might we turn for wisdom about deeper human quandaries? What about existential wisdom? Who has advice on how to live authentically during a once-in-a-century pandemic? How do we derive meaning from life when the entire US west coast is on fire? How should we live in a country whose democratic institutions are crumbling? All the gods are dead, after all, and our president is Donald Trump. Authorities have been demoted. No one is steering the ship.

To be sure, the wisdom of the crowd (read: the internet) has proven capable of addressing some of our anxieties, and could potentially solve future problems. Wikipedia, a testament to the power of collective action and an example dispositive of the omnipotence of the profit motive, functions as a kind of existential pain balm in its radical democratization of knowledge. But the Wikipedia model seems to me the exception and not the rule of the internet. In most other online ventures, the way that crowd wisdom is aggregated and repackaged in order to sell stuff or attract clicks deserves scrutiny.

The practical wisdom of the crowd is also by its nature anonymous, and therefore limited in its ability to address questions of existential significance. The existential wisdom I and I’m sure many others crave right now requires a personal, if individual, character. Crowd wisdom, on the other hand, is depersonalized, extracted by averaging together the opinions of many, meaning that its very process minimizes the impact of individual outlier data points; crowd wisdom is a resistant statistic.

That’s all good, and perhaps even preferential, for yielding an average movie score, but diluting existential wisdom by averaging it neutralizes one person’s wisdom with another’s thoughtlessness. And the challenges facing America today are anything but average; America is quite literally an outlier statistic in many of the most important metrics: gun deaths, covid-19 cases, per-capita health insurance spending, etc. It seems to me that a country determined to persist as an outlier may require outlier wisdom as a counterbalance. At any rate, sound judgment, prudence, and virtue—the ingredients of wisdom—are not traits easily expressed in aggregate.

A lot of things feel like they’re nearing their end these days: the US west coast, in-person education, democracy, the American dream. We know this. But knowledge simply won’t suffice, and neither will aggregate and practical crowd wisdom. Existential wisdom is what we need in such times. We already know how bad things are; the question is what will we do about them?

Dissertation excerpt: the politics of writing education

From Chapter 5: The Age of Automation in The Android English Teacher: Writing Education in the Age of Automation

The unique challenge of teaching writing serves as an instructive test case for education writ large. Writing embodies a series of paradoxes. It is both a science and an art, a technical skillset and a creative outlet. It is essential to every academic field—the medium through which scholarship transpires—yet its teaching is treated as a mere service to the university. Writing is inherently social, a communicative act between author and audience, and also a process that unfolds for long stretches in solitude. Writing is heavily mediated by technology, but also a fundamentally human endeavor. Finally, writing—and especially its teaching—is simultaneously progressive and conservative, associated with both radical expressivist disciplines and a traditional, prescriptivist and civics-oriented education. One of the main challenges for liberal arts scholars and writing educators is successfully balancing these contradictions, which become heightened in the age of automation.

An automated writing education fails to strike a balance between these divisions as human teachers do, and instead aligns with one side in each. With automated writing education, writing is only a science, a skill, and a nonsocial act; it is a lifeless interaction between a writer and a preprogrammed algorithm, a rote reproduction of conventions to be marked right or wrong. At root, writing is not concerned with being “right” or “wrong,” but with effective communication. The automation of writing education reduces the nuance of negotiating effective communication between author and audience to a formulaic transmission of agreed-upon conventions between a word manager and a machine. What portends to happen to writing education in the age of automation is not simply a change in the way writing is taught, but a redefining of what writing is.

Whether automated writing education will come to be defined as politically progressive or conservative remains to be seen. As higher education itself undergoes a redefinition amid public health pandemics and technological progress, its political valence has grown more significant. Pew Research (Parker, 2019) has shown for years a growing partisan divide in views of higher education (Figure 6). Choice of academic majors and course content, as well as the value of a degree, have become politically charged in a way they never have before. Writing occupies a peculiar dual position within this politicization: conservatives believe writing is an essential component of education and decry college students’ alleged declining writing ability, yet simultaneously view the very departments that teach writing as part of a domineering leftwing culture on campuses.

As universities and colleges grapple with remote and virtual learning configurations in the coming years, I fear these growing political fault lines will become ammunition in those debates. Educational technology companies may enlist progressive political rhetoric to push their products, and arguments about virtual learning or automated educational technology may end up being more about political allegiances than the pedagogical effectiveness of the tools. In the event of educational technology companies—sensing an opportunity to get their foot in the door on campuses—invoking progressive political rhetoric to sell their products, we should think critically about the pedagogical repercussions of employing virtual, and potentially automated, educational products and services independent of such rhetoric as best we can.

Impressive-sounding claims of academic personalization and customization, combined with a progressive framing of pandemic-related social distancing and educational “access,” will continue to escalate as education turns more and more virtual. My great fear is that out of a commanding paranoia of being perceived “conservative,” liberal-minded educators will thoughtlessly accept, even advocate for, corporate-led education reforms that are nominally and symbolically progressive but deeply and structurally reactionary. Many of the arguments that preserve our autonomy as writing educators have the potential to sound conservative, and perhaps some of them even are conservative in a definitional sense. “Conservatism” has become so radioactive that people forget there are many things worth “conserving”; I believe the in-person teaching of writing is worth conserving, for instance.

We must not fear being perceived as conservative if we push back against that rhetoric. In fact, much of our jobs and livelihoods as educators revolve around conserving certain elements of the current educational model that would be irreparably disrupted by the unilateral welcoming of endless technocratic reform. If we are so afraid of being perceived as conservative that we align with nominally progressive educational reforms that beget reactionary consequences, that could give technology companies with empty progressive branding the power to significantly redefine for us what higher education looks like.

A theory of educational relativity

The theory of classical relativity understands all motion as relative. No object moves absolutely; an object moves relative only to the motion of other objects. The same can be said about much of learning and education: our educational growth is frequently measured relative to that of others–our classmates, coworkers, friends, family, and so on.

Relative educational growth recalls a concept central to test theory I’ve discussed before: norm referenced assessment. When norm referencing academic achievement, individual students are compared relative to one another and to overall group averages, which are other objects in motion. Norm referenced assessment differs from criterion referenced assessment, which measures individual growth relative to established objective criteria, stationary objects; that is, for criterion referencing, performance relative to peers doesn’t matter. Think of a driving test: you either pass or not, but passing doesn’t depend on you scoring better than your neighbor, but rather on you meeting the state-established criteria required for licensure.

As it were, I would argue most of educational assessment consists of broad norm referencing often masquerading as criterion referencing. As far as I’m concerned this is not really good or bad. “Masquerading” has negative connotations, of course, but I believe the masquerade is less deliberate than inevitable. Any teacher will tell you it’s really really hard not to compare their students to one another, even if subconsciously. Try reading a stack of 20 papers and not keeping an unofficial mental rank of performance.

Although norm referencing students relative to their peers’ class performance is somewhat inevitable, I think with careful attention (and a little statistics) our assessments can prioritize a superior norm referenced comparison than that of students to their peers: the comparison between the student and themselves.

Comparing a student’s performance to themselves recalls the familiar growth vs. proficiency debate in education circles, which our current Secretary of Education is infamously ignorant about. Basically, the argument is that schools should assess growth and not proficiency, since not all students are afforded the same resources and there is incredible variation in individual academic ability and talent. Because not all students start at the same place they therefore cannot all be expected to meet the same proficiency criteria. I agree. (Incidentally, this is why No Child Left Behind utterly failed, since it was predicated on the concept of all children meeting uniform proficiency criteria.)

One way to prioritize the assessment of growth over proficiency in a writing class is to use z-scores (a standardized unit of measurement) to measure how many standard deviations students are growing by during each assignment. Writing classes are particularly conducive to such measures since most writing assignments are naturally administered as “pre” and “post” tests, or, more commonly, rough and final drafts. Such an assignment design allows for growth to be easily captured, since a student provides two reference points for a teacher to assess.

By calculating the whole class’s mean score difference (μ) from rough to final draft, subtracting that number from an individual student’s rough and final draft score difference (x), and dividing by the standard deviation of the class score difference (σ), you obtain an individual z-score for each student, which tells you how many standard deviations their improvement (or decline) from rough to final draft represents.

Why do all this? Why not simply look at each student’s improvement from rough to final draft? Because we should expect some nominal amount of growth given the assignment design of a rough and final draft, so not all improvement could actually be improvement in such a context. Calculating a z-score controls for overall class growth, so a nominal improvement in scores from rough to final draft can be interpreted in the context of an expected, quantified amount.

To assess for an individual student’s growth relative to themselves, then, you can calculate these individual z-scores for each assignment and compare the z-scores for a single student across all assignments, regardless of differing assignment scales or values. This provides a simple way to look at a somewhat controlled (relative to observed class growth) measure of growth for an individual student relative to themselves over the course of the semester. In this way, we are better able to see more carefully the often imperceptible educational “motion” of our students relative to themselves and to peers.

If we transfer what we learn, then we need a map

This past weekend I traveled to the College English Association 2018 conference in St. Pete, Florida, to give a talk about “learning transfer.” Learning transfer, often simply “transfer” in education literature, is the idea that when we talk about education broadly and learning something specifically, what we really mean is the ability to transfer knowledge learned in one context (the classroom, e.g.) to another (the office, e.g.). It’s the idea that we have only really learned something when we can successfully move the learned knowledge into a new context.

As far as theories of learning go, I think transfer is fairly mainstream and intuitive. Of course, the particular metaphor of “transfer” has both affordances and limitations, as all metaphors do. Some critics offer “generalization” or “abstraction” as more appropriate metaphors, and there might be a case to be made for those. But as long as the theory of transfer prevails I think we first need to get some things straight. This is what my talk was about.

If learning is concerned with transferring, and thus moving, between two places like the classroom and the office, then we must have a map to help us navigate that move. In the humanities, the literature on transfer disproportionately focuses on the vehicle of transfer, not a map to guide us through landscape the vehicle traverses. The vehicle is of course literacy–reading and writing. We painstakingly focus on honing and developing the most perfect and best kinds of writing prompts and essay projects, as well as assigning the most thought-provoking reading material. And this is good. If we try to move, or transfer, from point A to B but our vehicle is broken or old or slow or unfit for the terrain, we can’t go anywhere. We like to say that our students don’t just learn to write, but they write to learn as well. Literacy is a vehicle for all knowledge. All disciplines understand this. It’s why we write papers in all our classes, not just English.

But no matter how luxurious our transfer vehicles (our writing assignments and reading requirements) are, if we don’t know where we’re going, how to get there, or what obstacles lie in our way, then it doesn’t much matter. So how do we map the abstract terrain of education and cognition? Here are two simple statistics that can help us: R-Squared and Effect Size, or Cohen’s d.

R-Squared

The R-Squared statistic is commonly reported when using regression analysis. At its simplest, the R-Squared stat is a proportion (that is, a number between 0-1) that tells you how much of the variance, or change, in one quantitative variable can be explained by another. An example: suppose you’re a teacher who assigns four major papers each semester, and you’re interested in which of the essays can best predict, or explain, student performance in the class overall. For this, you’d regress student performances on one of the assignments on their final course grades. Some percentage of the variance in your Y variable (final course grades) will be explained by performance in your X variable (one of your assignments). This can give you a (rough) idea of which of your assignments most comprehensively tests the outcomes for which your whole course is designed to measure. (R-Squared proportions are often low but still instructive.)

If we extend the transfer/map metaphor, R-Squared is like the highways on a map–it can help us find more or less efficient routes, or combinations of routes, to get where we want to go.

Effect Size, or Cohen’s d

Effect Size is a great statistic, because its units are standard deviations. This means Effect Sizes can be reported and compared across studies. Effect Sizes are thus often used in Meta-Analyses, one of the most powerful research techniques we have. At the risk of oversimplification, Effect Size is important because it takes statistical significance one step further. Many fret over whether a result in an experiment, like the test of an educational intervention, is statistically significant. This just means: if we observe a difference between the control and experimental group, what is the likelihood that the difference is simply due to chance? If the likelihood falls below a certain threshold (which we arbitrarily set, usually 5%, but that’s a different discussion), then we say the result is statistically significant and very likely real and not due to chance. However, some difference being statistically real doesn’t tell us much else about it, like how big of a difference it is. This is where Effect Sizes come in.

An Effect Size can tell us how much of a difference our intervention makes. An example: suppose you develop a new homework reading technique for your students and you test it against a control group, which will be based on performance on a later reading comprehension test. If the group means (experimental vs. control group) differ significantly, great! But don’t stop there. You can also calculate the Effect Size to see just how much they differ–and, as mentioned, Effect Size units are standard deviations! So you are able say something like: my new homework reading technique is so effective that a student testing at the 50th percentile in a different class will test about one standard deviation higher in my course, around the 84th percentile.

If we again extend the transfer/map metaphor, Effect Size helps us find the really good places to visit, or transfer to. It’s like a map of the best vacation spots. After all, most of us would rather visit the beach than North Dakota in the winter, so it’s good to know where’s worth going (for certain purposes).

Stats can help teachers

Basically, what I tried to argue in my talk is not that quantitative methods/analysis are superior, but that they can do a lot of cool things, and that they are particularly important for building maps. Maps are, after all, inherently approximate and generalized, just like grades and student performance on tests and any kind of quantitative measure are merely approximations. Maps are indeed limited in many ways: looking at a (Google street) map of Paris, for instance, obviously pales in comparison to staring up at the Palace of Versailles in person. But looking at a map beforehand can help you get around once you do visit. It’s the same with quantitative measures of learning. They can help us get a lay of the land, which then allows us to use our pedagogical experience and expertise to attend to the particular subtleties of each class and each student. Teachers are experimenters, after all, and each class, each semester, is a sample of students whose mean knowledge we hope is significantly and sizably improved by the end of the course.

“I can’t teach writing in only one semester”

One complaint I’ve consistently heard (and myself made) during all my years teaching college writing is that one or two semesters is not enough time to teach writing. One finding I’ve repeatedly come across in all my years poring over educational research is that the largest source of variance in student writing performance is the interaction between student and task. In other words, the fewer the number of tasks assigned in your course (assignments, papers, tests, etc.), the higher the variation in student performance, which means the less reliably (consistently) your class measures whatever it is we conceive of as “writing ability.” This seems intuitive to me, and it is consistently found in the literature. On average two different assignments, in the same course and even for the same student, are basically nonpredictive of performance in one by performance in another. A student can write an amazing editorial and absolutely bomb a research paper. Does that make your course an unreliable measurement of writing ability? Maybe.

The logical response to this is simple: just assign many more and wildly different tasks in your course to better cover the vast domain of writing ability and thus reduce performance variance. But, alas, that’s where the time constraint comes in. It’s virtually impossible to assign more than four major assignments a semester. Are four writing assignments enough to reliably capture writing ability? No way. Not given the infinite amount of genres and writing tasks out there and our evolving definition of writing ability.

So then, what if we increased the number of assignments, but made them smaller and spent less time on each? 10 small writing assignments a semester, instead of four major ones? Would 10 assignments more reliably capture writing ability and minimize our measurement error? Statistically, yes. Intuitively, I think yes, too. But I understand the resistance to this idea. There is value, I think, in longer, more in-depth writing assignments. I bet most freshman college students haven’t written a paper longer than 10 pages, and at some point they absolutely should write one. (I think multiple.)

But I wonder if that value can be realized in a freshman writing class. What is the realistic purpose of a one semester, freshman college writing class after all? If our time, and thus our measurement instrument, is narrowed to one semester, maybe we should break up the cognitive trait we intend to measure into smaller chunks, since the whole construct can never be reliably captured in a semester with four major assignments. It’s like we’re trying to measure several miles with four yardsticks. If we’re only going to get one (or two at most) semesters(s), maybe we should adjust the use of our narrowed instrument accordingly, by using it on more, smaller, varied tasks. Then instead of measuring miles with yardsticks we’ll at least be measuring yards with rulers.

What are grades?

nyc-rest_-grades

Should teachers rank-order their students against one another?

The fancy way to ask this: Do we want norm-referenced grades? As opposed to criterion-referenced grades?

Norm-referenced means grades are assigned relative to a class average, while criterion-referenced means grades are assigned relative to a set of objective outcomes. Put another way: Do we care about what order the runners finish the race in? Or do we just want everyone to finish the race? What is a race?

Norm-referencing is common at the highest levels of educational and professional attainment, like in law or medical school. First in your class. Fifth in your class. Last in your class. Criterion-referencing, meanwhile, is more common in lower-stakes situations, such as a driving test. As long as you meet some level of competence, it doesn’t matter how your driving test score compares to that of your neighbor, you both get driver’s licenses. (Curiously, the Bar Exam and certain Medical Exams are criterion-referenced, even if prior schooling for the students taking them isn’t.)

Aside from law and medicine, school grades represent somewhat of a middle ground between pure ranking and broad licensure or credentialing. School grades reflect performance relative both to established criteria and to the performance of other students in the class. Think about it: we’ve all at times done A work or C work or D work, which means there are objective tiers of performance criteria. Theoretically, nothing prevents multiple, or even all, students from achieving the same grade, if they perform at roughly the same level. Yet this almost never happens. And grades, though tiered, are still ranked; the A tier is obviously better than the C tier. You can’t say you just care about everyone finishing the race if you rank-grade their performance. So what are grades?

My answer? Depends on the class. Some classes are indeed more like driving tests, and ranking students in such classes serves little purpose if we’re just trying to get students driving. But other classes perhaps do benefit from a more granular assessment of performance, like the rank-ordering of law and medical school students. Surgery isn’t driving.

My–and I suspect many others’–gut says ranking is bad, inherently hierarchical, discouraging, and inegalitarian. Yet, at the same time, I fear the service model of education comports all too well with a credentialing-style approach to grades, where students expect to just “finish the race” with little thought to how much learning actually occurs along the way.

Teach metaphor not grammar

At least once a semester since I started grad school (seven semesters now) I find myself embroiled in a familiar debate: to teach grammar or not to teach grammar. The majority of people outside of my field and outside the world of education broadly would laugh that this is even a question, because to most people of a course a writing teacher teaches grammar. Every time my job comes up in conversation with a non-teacher there’s inevitably a side comment made about watching their grammar around me. You can set your watch by it. In fact, to most people, grammar’s all I should teach. This is of course wrong, my field will eagerly point out. Writing is more than grammar; good grammar does not equal good writing; traditional, decontextualized grammar instruction–in the form of grammar handbooks or sentence diagramming–is largely ineffective as a pedagogical approach to teaching writing and usage to young, learning writers.

However, I’ve always felt that because the grammar jokes persist, that because grammar is so strongly associated with writing and teaching in a general, it simply can’t be ignored. In a sense, these people are right. I’ve thus argued every semester I get in this debate that we absolutely we need to teach grammar. It’s irresponsible to ignore something so heavily weighted and so widely rewarded in our culture. Yes, we ought to live in a society that doesn’t disproportionately reward a certain interpretation of correct grammar, but we don’t. The question, then, is not should we teach grammar but how should we teach grammar.

In recent semesters I’ve concluded one effective way to teach grammar is to teach metaphor. The first unit of my course introduces Kenneth Burke’s four “master tropes” of language–metaphor, metonymy, synecdoche, and irony, the latter three each a species of the first. Now, in most composition circles, it’s currently fashionable to critique the teaching of writing at the sentence level; I might be laughed at by serious compositionists for emphasizing such dull linguistic nuances and sentence level tropes as metaphor and irony instead of spending valuable class time on having students explore their identity or something similar through their writing. I happen to believe assignments dealing with personal identity can (but not always) institutionalize identities in a way that doesn’t sit well with me, but that’s a different story. The real resistance to sentence-level writing pedagogy is that people associate it with sentence diagramming and rote grammar drills, which, many claim (and some studies support), are ineffective strategies for teaching writing.

But while sentence diagramming and decontextualized and rote grammar drills do in fact fail to teach writing, we must not confuse the method of pedagogical delivery for its content. Which is to say: the problem with attention to the sentence level is not the sentence level content itself; it’s the way we present the sentence level content. A big part of teaching writing, I’ve learned, is changing students’ perception of what writing and language is and does in the first place, an undoing of the view of language as solely grammatical, a product of the rote grammar exercises that programs us to think of language as merely a set of rules to feverishly follow. Coming at the sentence level from a different angle then–that of metaphor broadly and its various species, metonymy, synecdoche, irony–can help students re-think the content of the sentence level in productive, and, more importantly, novel ways. Language becomes less of a system of rules to anxiously follow and more of a vast toolbox for helping you describe and externalize some feeling inside you. Language becomes fun. Not to mention, teaching language as metaphorical, rather than simply “grammatical,” is a lot more intuitive to students.

To be sure, emphasizing language as metaphorical as opposed to grammatical doesn’t remove the rules of grammar, it just transforms them. Foregrounding writing as metaphorical actually gets at grammatical concepts through a backdoor. It reframes the rules so they’re less about prohibition and more about assisting. It’s like crushing up your medicine and sprinkling it in pudding. Making students write deliberately metaphorical prose forces them to make use of more sophisticated grammatical constructions without them even realizing it. For example, early in the semester I often assign them the task of writing an extended metaphor for attending college, and by asking them to not only describe a complex event (attending college) but in a metaphorical way, they’re forced to reach for new kinds of tools in the toolbox of language to help construct their metaphor. Whether the students are conscious of it or not, these new tools contain various grammatical complexities, which then filter out into their writing more generally. After they write, then we can chat about what kind of grammatical affectations are present, how changing them might change the sentence or the metaphor as a whole, and so on.

I think we often teach grammar with the hope that, if students just memorize all the rules then they can write beautiful and grammatically complex sentences. But the rules of grammar are unintuitive, endless, and inherently restrictive (they are in fact rules.) Meanwhile, metaphors are generative. They’re not prohibitions, but sparks, suggestions. What’s more likely to enable you to write complex sentences, memorizing a bunch of rules or playing a game?

It’s not a perfect system and I’m still tweaking it, but it’s been a good move for me, and it helps resolve some of the anxieties I have about teaching grammar in Freshman Composition.