From meeting to action.
From meeting to action.
A 3-day research sprint into why Fireflies.ai was losing users to Granola and why the stated problem wasn't the real problem.
A 3-day research sprint into why Fireflies.ai was losing users to Granola and why the stated problem wasn't the real problem.

Before Getting Started
Before Getting Started
Project context
Project context
This is a speculative case study carried out independently, with no affiliation to or commission from Fireflies.ai. The project was developed under real time constraints, prioritizing research quality and focused solutions over comprehensive coverage. All research was conducted using publicly available sources.
This is a speculative case study carried out independently, with no affiliation to or commission from Fireflies.ai. The project was developed under real time constraints, prioritizing research quality and focused solutions over comprehensive coverage. All research was conducted using publicly available sources.
The Problem
The Problem
What was the challenge given?
What was the challenge given?
The problem came framed as a business concern: Fireflies.ai was losing users to Granola.ai. Churned users reported not knowing which features they had, and described Granola's interface as simpler and easier to navigate. Leadership flagged that new users were taking far longer than expected to activate (LTV), meaning they took too long to reach the product's core value: viewing the summary of a meeting
The problem arrived pre-framed as onboarding and discoverability. The first job was to test that framing, not build inside it.
The problem came framed as a business concern: Fireflies.ai was losing users to Granola.ai. Churned users reported not knowing which features they had, and described Granola's interface as simpler and easier to navigate. Leadership flagged that new users were taking far longer than expected to activate (LTV), meaning they took too long to reach the product's core value: viewing the summary of a meeting
The problem arrived pre-framed as onboarding and discoverability. The first job was to test that framing, not build inside it.

Important nuance
Important nuance
A stated problem is not always the real problem. Users rarely report the root cause accurately, they report the symptom closest to their frustration. The research process was designed to find what was actually happening before any design decisions were made.
A stated problem is not always the real problem. Users rarely report the root cause accurately, they report the symptom closest to their frustration. The research process was designed to find what was actually happening before any design decisions were made.
Research Approach
Research Approach
Understanding the market, the users,
and the product
Understanding the market, the users, and the product
Understanding the market, the users, and the product
Given the 3-day time constraint, traditional user interviews were not feasible at scale. The research was built on three layers: structured desk research across comparison sites and product blogs, a deep qualitative analysis of 25 Reddit thread discussions where real users discuss AI meeting tools spontaneously, and a hands-on exploratory walkthrough of the real Fireflies product.
Given the 3-day time constraint, traditional user interviews were not feasible at scale. The research was built on three layers: structured desk research across comparison sites and product blogs, a deep qualitative analysis of 25 Reddit thread discussions where real users discuss AI meeting tools spontaneously, and a hands-on exploratory walkthrough of the real Fireflies product.
Research decision
Research decision
Why Reddit? Spontaneous comments from users explaining, unprompted, why they switched tools, what broke their trust, or what made them stay are qualitatively equivalent to guerrilla interview data. People don't perform for a forum. They vent, they recommend, they warn. That's the signal.
Why Reddit? Spontaneous comments from users explaining, unprompted, why they switched tools, what broke their trust, or what made them stay are qualitatively equivalent to guerrilla interview data. People don't perform for a forum. They vent, they recommend, they warn. That's the signal.



What the Research Revealed
What the Research Revealed
The moment that made the problem concrete
The moment that made the problem concrete
Alex is a product manager. He just finished a 45-minute stakeholder call. Three decisions were made. Two people committed to deliverables. He was fully present, no notes, because Fireflies was supposed to handle it.
The call ends. He switches to Slack. The summary isn't there. He opens Fireflies. It says "processing." He waits. The window for acting on the meeting is closing, in 10 minutes he has another call.
The summary arrives 14 minutes later. It's long. The action items are vague bullets. He spends 8 minutes editing before sending anything to his team.
The job isn't to transcribe the meeting. The job is to make the 5 minutes after the meeting feel effortless.
Alex is a product manager. He just finished a 45-minute stakeholder call. Three decisions were made. Two people committed to deliverables. He was fully present, no notes, because Fireflies was supposed to handle it.
The call ends. He switches to Slack. The summary isn't there. He opens Fireflies. It says "processing." He waits. The window for acting on the meeting is closing, in 10 minutes he has another call.
The summary arrives 14 minutes later. It's long. The action items are vague bullets. He spends 8 minutes editing before sending anything to his team.
The job isn't to transcribe the meeting. The job is to make the 5 minutes after the meeting feel effortless.
What users are actually saying
What users are actually saying
The research uncovered patterns that went well beyond the stated problem. Several of the most critical findings directly challenged the original framing.
The research uncovered patterns that went well beyond the stated problem. Several of the most critical findings directly challenged the original framing.

Eight behavioral categories emerged from the analysis. The most strategically relevant:
Eight behavioral categories emerged from the analysis. The most strategically relevant:
Bot social friction
Bot social friction
Users avoid tools with a visible bot on external or sensitive calls. Discretion is a feature.
Users avoid tools with a visible bot on external or sensitive calls. Discretion is a feature.
Trust gap in summaries
Trust gap in summaries
Users re-read transcripts to verify what the AI said. This eliminates the time savings the product promises.
Users re-read transcripts to verify what the AI said. This eliminates the time savings the product promises.
Post-meeting action gap
Post-meeting action gap
Summaries are consumed but not acted on. The handoff from "what was said" to "what to do next" is broken.
Summaries are consumed but not acted on. The handoff from "what was said" to "what to do next" is broken.
Billing & trust erosion
Billing & trust erosion
Unexpected AI Credit charges cause emotional disconnection from the product, more than any UX friction does.
Unexpected AI Credit charges cause emotional disconnection from the product, more than any UX friction does.
Synthesis
Synthesis
What we know for sure
What we know for sure
After mapping the research, findings were separated into what was confirmed by multiple independent sources and what remained a hypothesis requiring user validation.
Confirmed, no further validation needed
Confirmed, no further validation needed
Billing trust is the primary documented churn trigger.
Billing trust is the primary documented churn trigger.
Integration failures happen silently, users discover a missing Notion summary hours after the meeting, not in real time.
Integration failures happen silently, users discover a missing Notion summary hours after the meeting, not in real time.
Granola also has critical reliability issues (silent crashes, missed meetings) that rarely appear in its "simpler UI" narrative.
Granola also has critical reliability issues (silent crashes, missed meetings) that rarely appear in its "simpler UI" narrative.
Still questioning
Still questioning
Is the activation delay caused by onboarding friction or by the user simply not having another meeting until days after signup?
Is the activation delay caused by onboarding friction or by the user simply not having another meeting until days after signup?
Is "users don't know features exist" a discoverability problem, or do users see the features and not understand their value?
Is "users don't know features exist" a discoverability problem, or do users see the features and not understand their value?
The most important reframe from the research: users are not churning because Fireflies has too many features. They are churning because they never built a habit around the product before hitting a billing surprise or a silent integration failure. Fixing discoverability without addressing these trust moments would solve the wrong problem.
The most important reframe from the research
The most important reframe from the research
Users are not churning because Fireflies has too many features. They are churning because they never built a habit around the product before hitting a billing surprise or a silent integration failure.
Users are not churning because Fireflies has too many features. They are churning because they never built a habit around the product before hitting a billing surprise or a silent integration failure.
Jobs to be Done
Jobs to be Done
What users actually hire Fireflies to do
What users actually hire Fireflies to do
Six core jobs emerged from the research. Each was framed as a job statement using the structure: I want [motivation] when [context], so I can [outcome].
Six core jobs emerged from the research. Each was framed as a job statement using the structure: I want [motivation] when [context], so I can [outcome].

Exploratory Usability Testing
Exploratory Usability Testing
A first-time user walkthrough before testing with real users
A first-time user walkthrough before testing with real users
Before wireframing, the jobs and assumptions above were tested directly, by walking through the real Fireflies product as a true first-time user would, from sign-up through a live meeting capture to the generated summary, documenting every reaction in the moment it happened.
Before wireframing, the jobs and assumptions above were tested directly, by walking through the real Fireflies product as a true first-time user would, from sign-up through a live meeting capture to the generated summary, documenting every reaction in the moment it happened.
01
01
Mental model mismatch: the instinct was "create a bot to bring into the meeting," not "add a bot to a live meeting", even after reading the page headline.
Mental model mismatch: the instinct was "create a bot to bring into the meeting," not "add a bot to a live meeting", even after reading the page headline.
02
02
Confirming Fireflies was actually capturing required leaving the meeting entirely to check a separate "Meeting status" tab. Zoom confirmed recording, but not whether Fireflies was.
Confirming Fireflies was actually capturing required leaving the meeting entirely to check a separate "Meeting status" tab. Zoom confirmed recording, but not whether Fireflies was.
03
03
A real misuse risk surfaced: the Fireflies notetaker bot appears as a selectable option in the native "assign new host" dialog, with no warning about what happens if it's picked by accident.
A real misuse risk surfaced: the Fireflies notetaker bot appears as a selectable option in the native "assign new host" dialog, with no warning about what happens if it's picked by accident.
04
04
The recording wasn't visible for ~2 minutes after leaving the call. Long enough to think the entire meeting was lost.
The recording wasn't visible for ~2 minutes after leaving the call. Long enough to think the entire meeting was lost.
05
05
AskFred and AI Skills felt redundant: AI Skills gave thinner output and cost credits; the same ask in AskFred (free, flexible) gave a fuller result.
AskFred and AI Skills felt redundant: AI Skills gave thinner output and cost credits; the same ask in AskFred (free, flexible) gave a fuller result.
06
06
The sidebar navigation was checked directly: there is no cross-meeting search or "ask across all meetings" entry point today. It isn't hidden, it doesn't exist yet.
The sidebar navigation was checked directly: there is no cross-meeting search or "ask across all meetings" entry point today. It isn't hidden, it doesn't exist yet.
Priorization
Priorization
What to solve, and in what order
What to solve, and in what order
Two frameworks, one purpose: figure out what to build, and in what order.
MoSCoW separated must-solve from nice-to-have. Now–Next–Later turned that into a delivery sequence. One deliberate change: search discovery was moved from Next to Now. The reasoning: surfacing Fireflies' strongest retention driver after onboarding means users decide whether to stay before they've seen the reason to. That's a sequencing error, not a feature gap.
Two frameworks, one purpose: figure out what to build, and in what order.
MoSCoW separated must-solve from nice-to-have. Now–Next–Later turned that into a delivery sequence. One deliberate change: search discovery was moved from Next to Now. The reasoning: surfacing Fireflies' strongest retention driver after onboarding means users decide whether to stay before they've seen the reason to. That's a sequencing error, not a feature gap.


User Flow
User Flow
The current broken path vs.
the desired experience
The current broken path vs. the desired experience
The current flow has six failure points between meeting end and first action taken. The redesigned flow collapses that into one continuous experience: meeting ends 🡪 summary pushed in 2 minutes 🡪 60-second read 🡪 action items created in one tap 🡪 search introduced as the bridge to long-term value
The current flow has six failure points between meeting end and first action taken. The redesigned flow collapses that into one continuous experience: meeting ends 🡪 summary pushed in 2 minutes 🡪 60-second read 🡪 action items created in one tap 🡪 search introduced as the bridge to long-term value

The flow also includes an explicit failure branch: "What happens when the recording fails?". A tool that looks like it's working but isn't is more dangerous than one that fails loudly. The failure state was designed with the same care as the success state.
The flow also includes an explicit failure branch: "What happens when the recording fails?". A tool that looks like it's working but isn't is more dangerous than one that fails loudly. The failure state was designed with the same care as the success state.
Wireframes
Wireframes
Six screens, three directions each,
built on the real interface
Six screens, three directions each,
built on the real interface
Six screens, three directions each, built on the real interface
Every wireframe was built on top of real Fireflies UI screenshots, not invented from scratch. That constraint kept every decision grounded: if the solution didn't work in the context of the actual product, it didn't make the cut.
The exploratory test also changed the scope mid-sprint. Two screens were replaced after direct product use revealed more critical issues than the brief had anticipated.
Every wireframe was built on top of real Fireflies UI screenshots, not invented from scratch. That constraint kept every decision grounded: if the solution didn't work in the context of the actual product, it didn't make the cut.
The exploratory test also changed the scope mid-sprint. Two screens were replaced after direct product use revealed more critical issues than the brief had anticipated.
A note on existing features
A note on existing features
Some of the solutions presented here build on functionality that already exists in Fireflies.
The goal of this project was not to identify missing features, but to evaluate whether existing ones were discoverable, understandable, and usable during a real first-time experience. The findings revealed several usability and trust gaps despite the underlying functionality already being available.
Some of the solutions presented here build on functionality that already exists in Fireflies.
The goal of this project was not to identify missing features, but to evaluate whether existing ones were discoverable, understandable, and usable during a real first-time experience. The findings revealed several usability and trust gaps despite the underlying functionality already being available.
Screen 1
Screen 1
Add to live meeting
Add to live meeting
Direction chosen: replace manual link-pasting with calendar-native meeting selection. The failure mode wasn't a bad error message, it was that pasting a link at all was the wrong interaction model. The calendar is already connected. Use it.
Direction chosen: replace manual link-pasting with calendar-native meeting selection. The failure mode wasn't a bad error message, it was that pasting a link at all was the wrong interaction model. The calendar is already connected. Use it.

Screen 2
Screen 2
Live meeting capture status
Live meeting capture status
The anxiety: "Is Fireflies actually recording?" requires leaving the meeting to check a separate tab. This direction keeps status visible inside the call and adds a guard before the bot can be accidentally assigned as host (a real misuse risk discovered during testing).
The anxiety: "Is Fireflies actually recording?" requires leaving the meeting to check a separate tab. This direction keeps status visible inside the call and adds a guard before the bot can be accidentally assigned as host (a real misuse risk discovered during testing).

Screen 3
Screen 3
The post-meeting wait
The post-meeting wait
Direction chosen: repurpose the existing AskFred panel, already in the product, currently underused, as the post-meeting status messenger. No new UI surface needed. The infrastructure exists. The question was whether users would find it there. Testing would answer that.
Direction chosen: repurpose the existing AskFred panel, already in the product, currently underused, as the post-meeting status messenger. No new UI surface needed. The infrastructure exists. The question was whether users would find it there. Testing would answer that.

Screen 4
Screen 4
Summary view
Summary view
The most-cited research finding: in a long transcript, a decision and a topic look identical. This direction adds an explicit "Decisions made / Action items" split above the notes, anchoring the two things users actually need to act on before they read anything else.
The most-cited research finding: in a long transcript, a decision and a topic look identical. This direction adds an explicit "Decisions made / Action items" split above the notes, anchoring the two things users actually need to act on before they read anything else.

Screen 5
Screen 5
AI Skills vs. AskFred
AI Skills vs. AskFred
Two features, same job, different cost. AskFred is free and flexible. AI Skills costs credits and returns thinner output. When a first-time user is given both side by side, they choose AskFred, every time. This direction explores what consolidation looks like when one of the options is clearly dominant.
Two features, same job, different cost. AskFred is free and flexible. AI Skills costs credits and returns thinner output. When a first-time user is given both side by side, they choose AskFred, every time. This direction explores what consolidation looks like when one of the options is clearly dominant.

Screen 6
Screen 6
Search discovery bridge
Search discovery bridge
Cross-meeting search doesn't exist in Fireflies' nav today. It's not hidden, it's absent. This screen proposes two entry points: a standing nav item, and a contextual introduction right after a user's first completed summary. The second matters more: the moment of highest engagement is the moment to show users why staying is worth it.
Cross-meeting search doesn't exist in Fireflies' nav today. It's not hidden, it's absent. This screen proposes two entry points: a standing nav item, and a contextual introduction right after a user's first completed summary. The second matters more: the moment of highest engagement is the moment to show users why staying is worth it.

View all wireframe versions in Figma
View all wireframe versions in Figma
🡭
Next Steps
Next Steps
Where this goes from here
Where this goes from here
01
01
Usability testing with real users. Each screen has three explored directions,the next step is testing them against real users to see which one actually performs, not which one looks best on a board.
Usability testing with real users. Each screen has three explored directions,the next step is testing them against real users to see which one actually performs, not which one looks best on a board.
02
02
Dot voting with the Fireflies design and product team. Where directions are equally defensible, like AskFred-first vs. a fully unified summary surface, the fastest way to align internally before testing is a quick collaborative vote, not a unilateral design call.
Dot voting with the Fireflies design and product team. Where directions are equally defensible, like AskFred-first vs. a fully unified summary surface, the fastest way to align internally before testing is a quick collaborative vote, not a unilateral design call.
03
03
Validate the activation-time metric. Clarifying whether it measures time-to-first-summary or time-to-return-visit remains an open question worth resolving with the product team directly.
Validate the activation-time metric. Clarifying whether it measures time-to-first-summary or time-to-return-visit remains an open question worth resolving with the product team directly.
04
04
Identify retention-correlated behaviors. If product data shows cross-meeting search predicts long-term retention, Screen 6 should be the first concept tested.
Identify retention-correlated behaviors. If product data shows cross-meeting search predicts long-term retention, Screen 6 should be the first concept tested.
View the full research board on Miro
View the full research board on Miro
Briefing · Desk Research · HMWs · JTBD · Usability Test · MoSCoW · User Flow · Wireframes
Briefing · Desk Research · HMWs · JTBD · Usability Test · MoSCoW · User Flow · Wireframes
🡭
Conclusion
Conclusion
What this case set out to prove
What this case set out to prove
A feature existing and a feature working for someone encountering it for the first time are two different claims. This case focused on that gap, not feature gaps, but usability and trust gaps in what was already built.
Every recommendation connects back to a specific observation, a specific finding, or a decision point where one direction was chosen over another and why. That's the standard this project held itself to.
A feature existing and a feature working for someone encountering it for the first time are two different claims. This case focused on that gap, not feature gaps, but usability and trust gaps in what was already built.
Every recommendation connects back to a specific observation, a specific finding, or a decision point where one direction was chosen over another and why. That's the standard this project held itself to.
Existing doesn't always mean usable
Existing doesn't always mean usable
Many of the opportunities identified in this case relate to functionality that already exists. The research showed that a feature existing and a feature working for someone encountering it for the first time are two different claims. Users often struggled to discover, understand, or trust capabilities that were already available.
Many of the opportunities identified in this case relate to functionality that already exists. The research showed that a feature existing and a feature working for someone encountering it for the first time are two different claims. Users often struggled to discover, understand, or trust capabilities that were already available.
From evidence to direction
From evidence to direction
Rather than focusing on feature gaps, this case focused on usability gaps. Every recommendation is tied back to observed behavior, research findings, and tested assumptions, turning insights into clear, testable design directions.
Rather than focusing on feature gaps, this case focused on usability gaps. Every recommendation is tied back to observed behavior, research findings, and tested assumptions, turning insights into clear, testable design directions.
Thank You!
Thank You!
ROLE
UX Designer (solo)
UX Designer (solo)
YEAR
2026
3 days · 2026
TOOLS
Miro · Figma · Claude
Miro · Figma · Claude
SECTOR
AI Productivity / SaaS
AI Productivity / SaaS
© 2026
Matheus Paiva
© 2026
© 2026
Matheus Paiva



