Conversation Mining: How to Discover What Your Ideal Clients Say When Nobody’s Watching
TL;DR: Conversation mining means reading what your audience writes in places where they’re not performing for you. Reddit threads, Facebook groups, forums, review sites. You’re not looking for content ideas. You’re looking for language. The exact words and phrases people use at two in the morning when they’re being honest about what they’re going through. A survey might tell you clients want to “feel like themselves again.” The same person on Reddit at 1am writes: “I woke up soaked through again last night and I genuinely don’t know how much longer I can do this job.” Those two sentences describe the same experience, but only one would make someone stop scrolling. Conversation mining finds the second version. This article covers where to look, what to extract, what to ignore, the ethical considerations, and when to automate the process versus doing it by hand.
I found a sentence that changed how I think about audience research
A menopause coach I was working with had sent out a survey. Good survey, thoughtful questions. The responses came back saying things like “I want to feel like myself again” and “I’d love more energy and confidence.” Perfectly reasonable.
That same week, I was reading a Reddit thread in r/Menopause. A woman had posted at 1:14am: “I woke up soaked through again last night and I genuinely don’t know how much longer I can do this job.”
Same person, probably. Same problem, certainly. But the survey answer and the Reddit post don’t sound like they come from the same planet. One is the version she’s comfortable sharing with a coach. The other is what she actually thinks about lying awake at quarter past one on a Tuesday.
The Reddit version is where your content should start. That’s conversation mining.
What is conversation mining?
Conversation mining is the practice of reading what your audience writes in places where they’re being honest, and extracting the language they use to describe their problems.
Not surveys. Not polls. Not “what are you struggling with?” in a Facebook group where everyone knows you’re the coach. Those give you the edited version. Conversation mining gives you the first draft, before anyone’s thought about who might read it.
The term comes from voice-of-customer research. Companies like Amazon and Procter & Gamble have been doing it for decades. What’s changed is access. Reddit, Facebook groups, Quora, Trustpilot, Google Reviews, niche forums. The conversations are public, searchable, and free.
You’re not looking for content ideas. You’re looking for the exact phrases, complaints, and emotional states your audience uses when they’re not talking to you.
Why does this matter for coaches specifically?
Coaches have a vocabulary problem that product businesses don’t.
You’ve trained. You’ve studied. You’ve developed precise language for the transformations you facilitate. “Boundary-setting.” “Nervous system regulation.” “Values alignment.” Accurate terms.
Your clients don’t use any of those words. Not yet. Before they find you, they describe their problems in their own language. Raw, specific, often messy.
A divorce coach writes “navigating the emotional complexities of separation.” Her ideal client, on Reddit at 2am, writes: “He moved out three weeks ago and I still set the table for two.”
Both are about the same experience. One sounds like a professional. One sounds like a person in pain. Content built from the second version stops people mid-scroll because it sounds like someone read their mind.
This is the Language Gap, and conversation mining is the most reliable way to close it.
Where to mine conversations
Not every platform gives you the same quality of honesty. Here’s where to look and what each source is good for.
The richest source for honest audience language. Reddit is anonymous, so people write things they’d never say publicly. It’s also the most cited source in Google’s AI Overviews, which means the language patterns here are the same ones AI systems use to understand a topic.
Where to search:
- Subreddits for the problem, not the solution. If you’re a sleep coach, try r/insomnia, not r/sleephacks.
- Use Reddit’s search bar with phrases your clients might use: “can’t sleep,” “exhausted all the time,” “marriage falling apart.”
- Sort by “new” rather than “top.” The highly upvoted posts tend to be polished. The raw ones are in the newer, lower-traffic threads.
Facebook groups
Less anonymous than Reddit, but still honest when the group is peer-led rather than coach-led. Look for groups where people with the problem support each other.
What to look for: The posts that get 40+ comments. Those are the ones where someone said something that resonated. The comments often contain richer language than the original post.
Review sites
Trustpilot, Google Reviews, and Amazon book reviews for books in your niche. People reviewing a product or service will describe what they hoped it would fix. That’s your audience telling you their problem in the context of trying to solve it.
Example: Amazon reviews for books on anxiety often contain sentences like “I bought this because I was having panic attacks at work and couldn’t tell anyone.” That’s a content headline for an anxiety coach, word for word.
Forums and Quora
Older but still valuable. Niche forums (parenting, health, hobby) contain years of archived conversations. Quora answers tend to be longer and more detailed than Reddit posts, which makes them useful for understanding how people explain their situation to strangers.
Where NOT to look
- Your own comments section (they’re performing for you)
- Coaching industry groups (coach language, not client language)
- Competitor social media (their audience’s public face, not private struggles)
What to look for: the five language patterns
When you’re conversation mining, you’re not reading casually. You’re extracting. Here are the five patterns that matter most.
| Pattern | What it sounds like | Why it matters |
|---|---|---|
| Raw problem language | “I can’t get out of bed most mornings” | The words they use before they learn your terminology |
| Failed solution frustration | “I’ve tried everything and nothing works” | Reveals what they’ve already attempted, so you can position differently |
| Specific trigger moments | “It hit me when my kid asked why I was always tired” | Emotional turning points that make powerful content hooks |
| Identity statements | “I used to be someone who had it together” | How they see themselves in relation to the problem |
| Permission-seeking questions | “Is it normal to feel this way?” | Signals they’re looking for validation before they look for solutions |
1. Raw problem language
The unfiltered descriptions. No clinical terms, no coaching vocabulary, no self-editing. “I snapped at my kids again this morning and then cried in the car for twenty minutes.” That sentence contains more useful information for a parenting coach than any survey response.
2. Failed solution frustration
When someone describes what they’ve already tried, they’re telling you what doesn’t work and what they’re willing to try next. “I’ve done the meditation apps, I’ve done the journaling, I’ve done the cold showers. I still can’t focus for more than ten minutes.” That person is ready for something different.
3. Specific trigger moments
The moments that made someone realise they needed help. Concrete, visual, emotionally immediate. “My boss asked me a question in a meeting and I completely blanked. In front of everyone.” That’s a scene. Scenes are what make people stop scrolling.
4. Identity statements
How someone describes themselves in relation to their problem tells you about their emotional state. “I used to be the organised one” is a grief statement. “I don’t even recognise myself anymore” is a crisis statement. Each requires different content.
5. Permission-seeking questions
“Is it normal to…” and “Does anyone else…” are signals that someone is isolated in their experience. They’re not looking for advice yet. They’re looking for proof that they’re not alone. Content that answers these questions builds trust before you’ve offered anything.
How to do it: a step-by-step process
This is conversation mining done by hand. It works. It takes time, but every minute you spend here saves you hours of guessing later.
Step 1: Identify three to five platforms where your audience talks
Don’t try to cover everything. Pick the platforms where your specific audience is most honest. For most coaching niches, that’s Reddit plus one or two Facebook groups.
Step 2: Search using problem language, not solution language
If you’re a confidence coach, don’t search “confidence coaching.” Search “I feel like a fraud,” “everyone else seems to know what they’re doing,” “scared to speak up in meetings.” Think about how your clients described their problem in their first session, before they had any of your vocabulary.
Step 3: Read at least 30 threads
Not skim. Read. You’re looking for patterns, and patterns don’t show up in five threads. Thirty is the minimum where you’ll start seeing the same phrases and emotional states repeating.
Step 4: Copy exact sentences into a document
Don’t paraphrase. Copy the exact words. “I feel like I’m drowning and nobody can see it” is useful. “Person feels overwhelmed and unsupported” is not. The exact language is the point.
Organise your extracts into columns:
| Their words | What it reveals | Content angle |
|---|---|---|
| “I woke up soaked through again” | Physical symptom as breaking point | The moment it affects your work |
| “I’ve tried everything and nothing works” | Solution fatigue | Position your approach as different |
| “Is it normal to feel this angry?” | Shame about emotional response | Normalising content that builds trust |
Step 5: Look for the three to five phrases that keep repeating
After thirty threads, certain phrases will appear again and again. These are your audience’s core vocabulary. Build a list. These become the foundation of everything you write.
Step 6: Compare their language to yours
Put your last five social media posts next to your extracted sentences. Do they sound like they’re about the same thing? Would the person who wrote that Reddit post at 1am recognise themselves in your content?
If not, you’ve found the Language Gap. And now you can close it.
The ethical bit
I’d be a hypocrite if I didn’t mention this. Conversation mining means reading people’s honest, sometimes vulnerable words. That comes with responsibility.
Do:
- Use language patterns, not specific stories. “I woke up soaked through” is a pattern. Quoting someone’s entire post about their marriage breakdown is something else.
- Treat what you read with respect. These are real people describing real pain.
- Be honest about what you’re doing. Researching how your audience talks so you can communicate better is legitimate work.
Don’t:
- Screenshot or share specific posts, even anonymously.
- Join groups under false pretences to extract data.
- Use someone’s exact personal story as a case study without their knowledge.
The line is: use the language, respect the person.
Manual vs automated: when each makes sense
The conversation mining process I’ve described above works. I did it manually for months before I built anything to automate it. There’s genuine value in doing it by hand at least once, because reading those threads changes how you understand your audience in ways a report can’t.
That said, it takes hours. For a single niche, you might spend a full weekend on it (the Weekend Research Sprint covers how). For ongoing monitoring across multiple subreddits, it becomes a second job.
That’s why I built Pain Point Pulse. It pulls conversations from online sources, extracts the language patterns, maps the pain points, and gives you a report you can create from immediately. It finds the 2am posts you’d never have time to find yourself.
But honestly: if you’ve never done this manually, start there. Read the threads. The tool is faster. The manual version teaches you something the tool can’t.
For a detailed comparison: Manual vs Automated Audience Research.
Common mistakes when conversation mining
Collecting too much, analysing too little. A spreadsheet with 500 copied sentences and no patterns identified is just a spreadsheet. After every session, identify the repeating phrases.
Mining coaching subreddits instead of client subreddits. r/lifecoaching is coaches talking to coaches. r/decidingtobebetter is potential clients talking to each other. The second one is where your data lives.
Ignoring the emotional context. “How do I stop procrastinating?” looks like a productivity question. Read the rest of the post and it’s usually about shame, fear of failure, or feeling like a fraud.
Treating conversation mining as a one-off project. Language shifts. New problems emerge. Even ten minutes a week keeps your understanding current.
Frequently asked questions
How many threads do I need for effective conversation mining?
Thirty is the minimum for a single niche. By that point, you’ll start seeing the same phrases and emotional states repeating. When new threads stop surprising you, your initial map is solid. For ongoing research, five to ten threads a week keeps it current. The Weekend Research Sprint covers how to do a focused burst in two days.
Is it ethical to use language I found in anonymous forums?
Yes, with boundaries. You’re using language patterns, not individual stories. Writing a post that opens with “I woke up soaked through again last night” as a relatable scenario is fine. Screenshotting someone’s post and sharing it, even without their username, crosses a line. Use the words as inspiration for how you communicate. Don’t use specific people’s pain as content. Use the language, respect the person.
What if my coaching niche is too small for Reddit?
Reddit covers more niches than you’d expect. But if yours genuinely isn’t there, the same method works on Facebook groups, Quora, niche forums, Amazon book reviews, Trustpilot reviews, and Google Reviews. Anywhere someone describes their experience to strangers is a valid source. For small niches, book reviews are often the richest source, because people describe what they hoped the book would fix.
How is this different from just reading my DMs?
Your DMs give you language from people who already know you. That’s valuable, but it’s filtered. They’re using your vocabulary back at you. Conversation mining captures people before they’ve found you, before they’ve adopted your language, before they know what coaching even is. That’s the language your marketing needs, because that’s where your next clients are right now. It’s what I call the Invisible Audience.
Can AI tools do this for me?
Partly. ChatGPT and Claude can summarise themes if you paste in thread content. But they can’t browse Reddit for you, they hallucinate pain points that don’t exist, and they flatten emotional language into generic summaries. Pain Point Pulse was built specifically for this. It pulls real conversations, extracts real language, and maps real patterns. But even with automation, reading some threads yourself is worth it. The tool gives you scale. The manual reading gives you empathy.
Conversation mining is basically just reading what people write when they don’t think anyone important is paying attention. I spent years doing it with film audiences before I knew there was a name for it. Watching from the back of the cinema. Listening in the bar after a screening.
Your audience is doing the same thing right now, in a Reddit thread somewhere, at some ungodly hour.
Probably worth a look.
Pat Kelman. Come and look at this.
Image: Photo by KoolShooters on Pexels