The problem with text-based discussion questions is that they require students to generate content from their own knowledge and opinions. For lower-level students, or students who find a topic unfamiliar, this blank-page problem can stop a conversation before it starts.
Visual prompts solve this differently. An image always provides content - something is happening, someone is present, a place exists. Students don't need background knowledge or strong opinions to respond to what they see. The visual gives them something to work with immediately, and the conversation grows from there.
This is different from the narrow use of images in IELTS Part 2 preparation (compare these two photographs). Visual prompts can serve far broader purposes across any level, any topic, and any speaking format. YapYapGo is a classroom speaking practice tool for ESL and EFL teachers. Visual activities work well as warm-ups before YapYapGo's structured pair modes. Here's a practical guide.
Why images work differently from text
Interpretation is always possible. You can always say what you see, what you think is happening, what you feel about it. These three responses are available to every student at every level, requiring no background knowledge. Images are polysemous. The same image means different things to different people. This generates natural disagreement without any controversy - students genuinely see different things, prioritise differently, and respond differently. They bypass content generation. Students don't have to think of something to say - the image gives them something. This removes the blank-page anxiety that freezes discussions, especially for lower levels and more anxious students. They trigger personal connection. Images of universal human experiences - joy, struggle, beauty, difficulty - reliably trigger personal memories and associations that fuel conversation.Choosing images that generate good discussion
Not all images work equally well as speaking triggers. The best images have:
Genuine ambiguity. An image that's obviously a tree doesn't generate discussion. An image of a person reading a letter in what might be good news or bad news does. The ambiguity is what requires interpretation - and interpretation is speaking. Human presence or human consequence. Humans looking at humans is endlessly interesting. An abstract landscape can be beautiful but rarely sparks extended discussion. Add a person, a human story, or evidence of human activity, and the conversation deepens. Open emotion. Images of clearly happy or clearly sad people produce simple descriptive responses. Images of ambiguous emotions (is this person tired, reflective, or sad?) produce interpretive conversation. Accessible visual vocabulary. For lower-level students, images with complex visual detail require too much vocabulary. Clear, uncluttered images work better at A2-B1. Complex, layered images work well at B2-C1.Activity formats for visual prompts
The interpretation pair (B1-C1)
Show one image to the class. Each student in a pair interprets it independently for 30 seconds (no sharing). Then they compare: What did you notice first? What do you think is happening? How does it make you feel?
The interpretation comparison is the activity. Students are often genuinely surprised by what their partner saw or felt - which is exactly the kind of genuine communicative surprise that drives extended conversation.
The story behind the image (A2-B2)
Show an image of people in a situation. Students discuss: What happened just before this photo was taken? What happens next? What is the relationship between the people?
The narrative speculation format produces naturally extended speech because there's always more story to tell. Lower levels tell simple stories; higher levels add complexity, character motivation, and narrative consequence.
The compare-and-respond pair (all levels)
Each student in a pair sees a different image on the same theme. Without showing each other, they describe their image. Then they answer together: What do the two images have in common? What is most different? If you had to choose one as a poster for [topic/concept], which would you choose?
This is the direct equivalent of IELTS Part 2 and Cambridge FCE Part 2 - useful for exam preparation, but the format also works for any topic where you want students to practice comparing and contrasting.
Tool tip: After a visual prompt warm-up, YapYapGo takes the topic that emerged from the images and runs structured pair discussion questions on it. The image establishes the emotional context; YapYapGo provides the question sequence. A classroom countdown timer keeps visual activities focused - images can absorb attention indefinitely without a time structure.
The rank and justify (B1-B2)
Five images on a theme (five interpretations of "success," five types of relationship, five moments in a journey). Students rank them individually from most to least [resonant / typical / surprising]. They compare rankings with their partner and justify any differences.
Ranking with justification forces extended speaking. "I ranked this one first because..." requires more than one sentence, especially when partners have ranked differently.
The news photograph discussion (B2-C1)
A news photograph - one with genuine social, political, or human interest. Students discuss: What do you see? What story does this tell? What questions does it raise? What doesn't the image show you?
The final question ("what doesn't the image show?") is particularly generative - it invites students to think about framing, perspective, and what's absent from any single representation of reality. This is sophisticated critical thinking that C1 students find genuinely engaging.
The picture prompt chain (A2-B1)
A sequence of simple images telling a story (available from picture story books, stock image libraries, or comic strips with text removed). Students take turns describing what's happening in each image, alternating. Then they retell the whole story together.
The collaborative retelling produces connected narrative - students have to link their partner's version with their own. For lower levels, this generates more connected discourse than any other format.
Where to find images
Unsplash and Pexels: Free, high-quality photography without licensing concerns. Search by emotion ("contemplation," "conflict," "joy") or situation ("workplace," "family," "urban"). The New York Times Learning Network: Publishes photograph-based discussion prompts for young people - some are directly usable in ESL contexts. Magnum Photos: High-quality photojournalism. Best for B2-C1 critical analysis activities. Your own photos: Personal photos of places you've been, things you've noticed, situations you've found interesting. These carry authenticity and often produce the most interesting discussions.A random student picker and this-or-that generator (for image-based binary choices like "which of these two photos best represents success?") are both useful their interpretation with the class. For more on picture description activities specifically for exam preparation, see our post on picture description activities for ESL speaking.
Sources:
- Ur, P. (1981). Discussions That Work. Cambridge University Press. - Visual triggers as prompts for genuine discussion.
- Nation, I.S.P. & Newton, J. (2009). Teaching ESL/EFL Listening and Speaking. Routledge. - Image-based tasks for lower-level and anxious speakers.
- Kress, G. & van Leeuwen, T. (2006). Reading Images: The Grammar of Visual Design. Routledge. - How different image features generate different interpretive responses.
