The problem with "rich" data sources
When we started building Novaseed, we had a list of potential sources to scan: LinkedIn, X, Facebook Groups, public Slack communities, Reddit. Everyone in our network told us to start with LinkedIn. "That's where B2B buyers are." "People share their professional problems there." "Decision-makers are active on it."
We ran the test anyway. Three weeks of scraping, analyzing, counting.
LinkedIn is mostly people announcing they're "excited to start a new chapter" or publishing threads about leadership lessons. X has become a hot-take slot machine. Facebook Groups are solid for certain niches but wildly inconsistent depending on which group you land in.
Reddit surprised us.
What Reddit has that nothing else does
People go to Reddit to ask for help, not to look good. That's a fundamental difference that gets underestimated.
A VP of Sales won't post on LinkedIn: "help, we keep losing deals because our prospecting tool is broken, anyone fixed this?" Too exposed. Too bad for the personal brand. Instead he'll write a brag-wrapped post about "5 things I learned building sales teams" and hope someone reads between the lines.
On Reddit, that same guy goes to r/sales or r/SaaS and writes exactly this: "We're running Apollo + HubSpot but we keep missing warm leads that come in through side channels. Anyone solved this?" He doesn't know you're reading. He just wants an honest answer.
That's the signal. Purchase intent, expressed clearly, with no personal branding filter in the way.
We analyzed 4,000 posts over one month across five sources. The rate of posts containing a genuine, explicit commercial ask (someone actively searching for a product or product category) was 3.4x higher on Reddit than on LinkedIn, adjusted for post volume processed. LinkedIn has more raw volume, but far lower signal density.
The anonymity bias works in your favor
Reddit users aren't identified by their job title and company name. And paradoxically, that makes them more precise in their questions.
When you're not worried your CRO will see the post, you can write: "Our current sales intelligence tool costs $2,400 a month and hasn't sourced a single qualified lead in three months. I'm ready to rip it out entirely." That's commercial gold in a single post. Budget, frustration, timing, competitive context, all there.
We found posts at that level of detail on a regular basis. Not every day, but consistently enough to build a meaningful pipeline of prospects who had self-identified their own intent.
The risk with a source like this is noise. Reddit also has a lot of posts that go nowhere: lurkers asking theoretical questions, MBA students running market research, founders validating ideas with no budget behind them. That's why we invested heavily in intent scoring before scaling to other sources. A post in r/Entrepreneur from a 3-month-old account with 4 comments isn't the same signal as a post in r/sales with 40 replies from someone who's been active for two years. You have to score both the post and the poster.
What we learned about which subreddits actually convert
Not all Reddit communities are equal for B2B prospecting. Obvious when you say it out loud, but when you start crawling, the temptation is to go everywhere at once.
Here's what we observed across the communities that produced real signal:
- r/SaaS: lots of early-stage founders looking for tools, strong signal on acquisition and ops categories
- r/sales: reps and managers comparing stacks, budget usually already approved
- r/startups: weaker on immediate purchase intent, but useful for market context
- r/Entrepreneur: very broad, but high-intent posts are genuinely identifiable with decent filtering
- r/smallbusiness: underrated, direct decision-makers, less corporate noise
We now scan around 40 subreddits. But we started with 8 targeted ones and had enough flow to work with.
Why we didn't just start with LinkedIn anyway
I'll be blunt: LinkedIn scraping in 2024 is a technical nightmare. The official API is nearly useless for what we needed. Unofficial scraping means constant bans, CAPTCHAs, and maintenance overhead that never ends. And the signal you extract is still diluted by what the platform is designed for, which is personal branding, not the honest expression of real business problems.
We did eventually add LinkedIn to Novaseed's sources. But if we'd started there, we would have burned four months on plumbing for a disappointing signal/noise ratio. Reddit gave us readable results in three weeks.
If you're building any kind of intent detection system, start where people say what they actually want. Reddit wins that comparison and it's not close.
Want to see Novaseed in action?
Scan your site, get 20 prospects ready to buy. Free, 30 seconds.