Going Through Multiple Accelerators: The Data Might Surprise You

There's been a lot of debate recently about the quality of startups that go through multiple accelerators. This all kicked-off with Sam Altman's post Getting Into YCombinator which discouraged the practice. Responses included Dave McClure's Plays Well with Others and Calcanis's Incubator Hopping.

"We now have enough data to know that the track record of companies that go through multiple accelerators is much worse than companies that just do YC." - sama

Unfortunately other than the above statement (with undisclosed data) a lot of this debate was driven by personal opinions rather than any firm evidence. I've done a fair amount of work analyzing startup data both for fun and profit so I naturally wanted to get a better answer. I started by pulling together a couple of data sets on seed accelerators, one a personal one I've being building for a while and secondly Jed's excellent Seed-DB and started crunching them.

After filtering out the edge cases such as startups going through multiple cohorts of the same accelerator and startups that have gone  through three or more accelerators I was left with a data set of 132 startups who have been through two accelerators  (16 of which went through YC after another accelerator).

Out of those 132 there are two clear successes (>$100m valuation) - Sphero (which has raised around $100m in investment) and GrabCad (acquired for ~$100m). PagerDuty (around ~$40m raised) likely falls into this group as well. Roughly around 1-2% of all accelerator alumni end up at >$100m so from this initial analysis it doesn't appear going through multiple accelerators is a strong negative signal.

However the problem with this approach is that it doesn't adjust for cohort age (older companies naturally tend to be larger) or more importantly the quality of the accelerator. We'd expect the alumni of top-tier accelerators to be significantly better than those from an average accelerator in any case. Another issue with this approach is that we're talking about a tiny handful of companies so the numbers can be very easily skewed.

I got around these problems by creating a dataset that reasonably represents a group similar to our "double accelerator group". For each startup in our group I randomly picked another startup that was in the same cohort/accelerator to control for that effect and added it to the comparison group. I then generated several hundred comparison groups so I could measure the variance we would expect to get from random luck. I then segmented the groups by bands of how much money they had raised (using that as a proxy for progress) and measured the percentage of startups that fell into each band. 

The green diamonds show how the double accelerator group performed compared to the boxplot showing startups going through their first accelerator. Not only is it not a negative signal, but overall companies that go through multiple accelerators tend to do meaningfully better than average for raising money <$20m and the same as average for raising above that (as these cohorts mature I wouldn't be surprised to see the >$20m raises also increase to above average).

While startups going through a second accelerator might do better in general the same might not apply to YC; after all YC is somewhat unique among accelerators. So I reran my analysis using only the 16 YC companies [1] comparing them against their own cohorts.

While there are some differences we again see that overall startups for whom YC is their second accelerator tend to be more likely to raise at each band then the average YC company. 

In practice almost all of the returns for an accelerator come from a handful of unicorns and we can't know at this stage if any of these companies will become unicorns, but based on the early date there's no evidence that they won't come out of this group. 


Footnotes

This should come with the proviso that YC almost certainly has better data than I do on their portfolio (I'm likely missing a number of YC companies which have been through other accelerators) but I'd encourage YC and any other accelerator looking at this to make sure they're analyzing their data in a reasonable way. I'm more than happy to share my modelling code with any accelerators which would find it useful.

[1] The 16 companies for reference were Bagaveev Corporation, Final, CribSpot, Labdoor, Chariot, Seva Coffee, Nomiku, Valor Water Analytics, Leada, Plate Joy, uBiome, FlightCar, Vayable, chute, MarketBrief and PagerDuty.



Open Source Business Models

I’ve had a few startups ask me recently regarding the pros-and-cons of going the open source route especially regarding the types of business models that are viable for open source based companies.

Given how influential open source has been over the last two decades, surprisingly little has been written in the public domain on the topic so I’ve put together a broad summary.

It focuses on business models that could enable companies to reach billion-dollar+ valuation levels (and so are VC investable) which means it's specifically excluding some models which can work for smaller scale companies (i.e. donations, grants, advertising, etc.)

I’ve illustrated the models with examples, these include companies where the company hasn’t been the primary developer of the underlying open source product but still has the product at the core of their business. 

There are essentially only six open-source business models that either have already produced billion dollar startups or look likely to do so within the next decade:

Support and Services

Also known as the “Red Hat Model” it’s based around the provision of a services business for users of the open source product. Typically this includes support contracts, training and certification, consulting and customisation work. 

Example vendors include RedHat and Hortonworks.

One of the original open source business models it now faces some concerns over the low margins and issues of competitiveness. While the company that creates an open source project has some competitive advantages when selling services (direct access to core engineers and product mangers, brand, lead-gen, etc.) there’s still significant risk in competitors underpricing due to not having to pay for the products development costs (i.e when Oracle significantly undercut RedHat for Linux support work).

Most open source companies will offer some level of services although increasingly startups are now waiting until they reach a level of maturity before building out a significant services (read “consultancy”) business.

Earlier stage startups are tending to offer services limited to specific areas which have additional value-add to the product business (training of third party vendors, support and bug-fixes).
 

Hosted Model 

The hosted or “managed’ model is where the vendor provides a hosted version of the open source product.

These businesses tend to largely fall into traditional SaaS (Wordpress, Odoo) or IAAP (Joyent, Heroku) models. In many cases this is combined with the "Value-Added SaaS" model where proprietary offerings are combined with the hosting service in order to provide a competitive defence against lower cost competitors offering equivalent hosting services.

Open-Core Model 

With open-core the core part of the product is open-source but with the vendor offering an enhanced version as proprietary commercial software.

Typically this is done by having a “core” version of the product which is open source and then an “enterprise” version of the software which includes features such as analytics, security, auditing, etc. which are likely to be required by high-end customers. Vendors using this model include Cloudera, SugarCRM and Mongodb.

In some cases vendors will offer these features as add-on products rather than as a drop-in replacement to the main product (for example ElasticSearch).

This business model is similar to that of traditional proprietary software vendors. However there exists a risk of conflict-of-interest when a vendor is deciding whether a feature should go into the core or enterprise edition of the software which can fuel resentment among users of the core product.

There’s also the risk that other third parties will develop open source alternatives to the enhancements.

Value-Added SaaS

One of the newest models is the Value-Added SaaS model which is similar to the open-core model but where the proprietary software is provided as a SaaS service.

This model is particularly effective where the SaaS service has a network effect as it can create a strong defence against new vendors entering the market.

Vendors taking this route include GitHub and Docker (the later has yet to release their SaaS offering).

Marketplace Model 

The Marketplace Model is where the vendor creates a platform for commercial products or services that relate to the open source product. The platform then charges a commission on transactions.

The most successful use of this model has been Google Android with their Play store, but other vendors have included Themeforest (Themes for Open Source CMS systems) and historically OpenX (ad-exchange integration into their open source ad server).

Again due to the strong network effects implicit in marketplaces we’re likely to see more open source startups with this model in the future.

Dual-Licencing 

Dual-licencing is the model where the vendor relicences the open source project under a traditional commercial propriety licence for a fee. Vendors have included MySQL and Trolltech.

At one time this was considered to the be the future of open source businesses (Gartner once stated “By 2012, at least 70% of the revenue from commercial OSS will come from vendor-centric projects with dual-license business models.”) — however this model is now largely considered dead.

Historically there were two major factors driving this model (1) The dominance of usage-restrictive and viral open source licences which restricted commercial integration (2) Companies being uncomfortable with using open source software.

However most modern open source projects now favour Apache or MIT style licences which are much less commercially restrictive and open source software has now become mainstream and widely accepted into businesses.

Some modern open source companies such as MongoDB still offer dual-licensing but it’s generally regarded as a service to enable customers who would otherwise be unable to use the product rather than a core revenue stream.

Markets for Marketplaces: Future Disruption

"Uber for X" has become commonplace model for startups but often such startups overly-focus on ease-of-delivery rather than what made Uber truly disruptive. Uber built a marketplace in a highly fragmented market that allowed them to change the entire dynamics of an industry. The taxi industry is only one of many industries that are ripe for disruption from online marketplaces.

Much of the money currently spent on mainstream advertising (Google, Facebook, traditional media, etc) is by vendors who would be better served by industry specific marketplaces; and it's only a matter of time before those marketplaces appear.

That naturally raises the question of what makes a market suitable for an online marketplace. To answer that question I’ve been working on a framework to help evaluate markets based on their intrinsic properties.

The framework is based upon analysing a large number of online marketplaces and identifying the key factors that differentiate the successful ones from the failures and understanding why those factors played the role they did. I’ve broken the framework into looking at the properties of the demand-side (i.e the customers/buyers), the supply-side (i.e the vendors/sellers) and the transaction:

Demand-side 

Multiple-transaction behaviour

Do customers make multiple transactions within a reasonable time-frame (i.e. several+ transactions a year) ?

  • If customers are regular buyers it means a marketplace can amortise the cost of customer acquisitions over multiple transactions.
  • Lack of this behaviour means high customers churn (thus lower lifetime value) and a constant need to acquire new ones just to maintain size. 
  • Repeat buying also allows for stronger brand building and building a relationship between the marketplace and consumer.
  • If a marketplace can’t extract more value from a customers acquisition than an individual vendor can then it reduces the incentive for vendors to be on the marketplace.
  • In some cases bundling together of related markets can create multiple-buying behaviour when the individual underlying markets may not have it. For example while the markets for plumbers, electricians, locksmiths, etc. might not exhibit regular purchases the aggregate market for hiring tradesmen might do so.

Low vendor-loyalty

Do customers regularly switch between vendors ?

  • High-vendor loyalty can have much the same impact as a lack of repeat buying as both parties have a strong incentive to take future transactions off-platform .
  • Some markets have intrinsically low vendor loyalty as consumer needs vary between transactions (for example for with AirBnB people switch vendors because they travel to different locations or Just-Eat because people don’t want the same takeaway all the time).
  • High vendor loyalty can be countered with strong on-going value-add beyond the initial connection (Escrow, insurance, ease-of-purchase are common “value-add” techniques) or owning the relationship with the customer (i.e. essentially anonymising the vendor actually providing the service like HomeJoy do).

Large market

Is there customer spending of $10bn+ in the market ?

  • As a rule of thumb the valuation of marketplace startups is roughly equivalent to their gross transaction volume (Fred Wilson has a good article on marketplace valuation)
  • Most VCs are looking for startups with the potential to reach $1bn+ valuation which implies a market an order of magnitude bigger than that. 

Expandable demand

Is there "untapped demand” that can be unleashed by a marketplace ?

  • Some markets have “untapped demand” - demand that exists but doesn’t convert into transactions because of complexity in purchase process. 
  • Common examples include markets where consumers are worried about quality/risk (e.g. peer-to-peer vacation rentals) or where finding the right vendor is hard (e.g. handmade items). 
  • In markets with pricing elasticity demand can be expanded by marketplace reducing cost of transactions (through competition, optimisation, etc. for example UberX) 

Consumer Experience

Are there key problems in the consumer experience that can be solved ?

  • Trust is a key-example; in many markets where marketplaces have succeeded there have been significant trust issues around factors such as personal safety (Uber, AirBnB, etc.) and remote transactions (Ebay, oDesk, etc.) where marketplaces have put in place trust structures (vetting, reviews, monitoring, etc.) that improve customer experience.
  • Lack of transparency/feedback is another common problem that marketplaces solve. GrubHub provides live-tracking of food deliveries; fiverrr shows average response time of service providers and Linkedin benchmarks job applicants against other applicants.
  • Improving consumer experience can allow marketplaces to charge a premium to consumers who would otherwise be incentivised to transact off-platform (Instacart for convenience; StubHub for guaranteeing ticket legitimacy).

Supply Side

Fragmented supply-side

Is there a fragmented supply-side with small vendors controlling a large part of market ?

  • A fragmented market typically means higher customer acquisition costs for vendors. Marketplaces deliver benefits of scale and marketing and give vendors more power to compete based on their service/product rather than on customer acquisition strategy.
  • Decision making for consumers is often complex due to variety of choices and difficulty in comparing vendors. Marketplaces add a lot of inherent value by making finding and comparing vendors easier.
  • Larger vendors have lesser need for marketplaces, have disproportionate negotiating/pricing power and prefer direct relationship with consumers. In many such industries (i.e. insurance, budget airlines) vendors go out of their way to even stop price aggregators so it can be tough environment to operate in.

Low-conversion/High-conversion

Is the market low-conversion for individual vendor but high-conversion for the overall transaction ?

  • Some markets (e.g. housing) are defined by high-levels of shopping around between vendors before a transaction takes place; marketplaces have a significant advantage in such markets because their overall conversion rate is going to be higher than that of any individual vendor giving an advantage for customer acquisition.

Supply-side servicing

Are there unique “add-on” services that a marketplace could deliver to their supply side because of their position ?

  • OpenTable offers restaurant management software, Ebay Motors offers consumer financing and Amazon Marketplace offers warehousing. In all cases they’re services that the marketplaces are in a unique position to offer and cause significant stickiness (discouraging vendors from moving elsewhere) as well as bringing in additional revenue streams.
  • Generally these add-on services are added as the marketplace matures rather than from the start and typically only form a small segment of overall revenue, but in maturing markets can provide significant defensive barrier against new entrants

Expandable Supply

Would a marketplace reduce the barrier-to-entry into the market ?

  • If a marketplace allows new vendors to enter the market or for vendors to service part of the market that was inaccessible to them before this can cause a fundamental economic shift in the market allowing for more varied offerings and more competitive pricing.
  • With some marketplaces it’s not unusual for customers to become vendors (Etsy, Ebay and AirBnB are all examples where this happens)
  • Such vendors can give marketplaces significant boost because they’re often exclusive to their platform and thus have closely aligned interests. 

Transaction

On-Platform Transactions

Can the entire transaction be captured on-platform ?

  • First-generation marketplaces (craigslist, autotrader, yelp, etc) were essentially lead generation tools with the transaction happening off-line; modern marketplaces have tended to focus on markets where the transaction can be captured online.
  • If a marketplace doesn't have access to the whole transaction it forces them to rely on intermediate stages. Typically this results in optimising for lead generation rather than for completed transaction. This can result in misaligned incentives, such as job boards focusing on maximising number of job applications rather than quality of applicants.
  • Without knowing the value of the transaction marketplaces are forced to price generically. They have to choose between over-charging for low-value transactions pushing them off the platform or underpricing for high-value transactions where they can’t capture the value being delivering to the vendor. 
  • In practice successful off-platform marketplaces use flat-fee or subscription models that tend to underprice the value delivered to high-value transactions but operate in high-value markets (housing, autos, recruitment, etc.) so even pricing for the low-end of the market results in decent profits.

Market open to a Marketplace

Are the market participants willing to move to an online marketplace ?

  • Some markets are reluctant to move online. Typical characteristics of such markets are being relationship driven, high-levels of secrecy, price negotiation and high-customisation.
  • A common case is markets where publicly advertising availability causes negative signalling  Employment market for high-profile individuals (actors, sports people, company executives) and the M&A market are examples.
  • Another common case is when there’s significant pre-sales work to customise the transaction. Many consultancy and professional service businesses falls into this category.
  • Very few marketplaces have succeeded in bringing these types of markets online so far. Angellist is a notable exception but it’s not clear if it’s success is generalisable. Solutions may have to be very domain specific.

Willingness to pay

Are the market participants willing to pay for a marketplace ?

  • Either the vendor needs to be willing to give up a cut of their revenue or the consumer needs to be willing to pay a premium.
  • Taking commission from vendors is more common; generally in sophisticated markets where the pay-per-customer  model is the norm (through online advertising, referral fees, etc.) it’s fairly straight-forward to on-board vendors on this basis. In markets where this isn’t the norm and especially low-margin industries this can be much more difficult.
  • Consumers paying a premium is more unusual; having to pay more via a marketplace than they would if they went direct incentivizes the consumer to avoid using the marketplace. Hence the marketplace has to deliver significant added value if this is the approach taken.
Markets don't have to have all of these properties for a marketplace to be successful, but markets without a significant number of these properties have tended to struggle. This framework is very much still a work in progress so feedback would be very welcome.

Interested in Marketplaces ?  - Follow me on Twitter @imranghory - I can also be reached at imranghory@gmail.com

Unicorn Marketplaces

As part of my research into marketplaces I've been digging into the IPO prospectuses of three recent marketplace IPOs:

  • Just-Eat - International takeaway marketplace; IPO'd in April on the LSE at a market cap of $2.46bn 
  • GrubHub - US focused takeaway marketplace; IPO'd in April on the NYSE at a market cap of $2.04bn
  • Zoopla - UK based real-estate listing site; IPO'd in June on the LSE at a market cap of $1.54bn

The three make for an interesting comparison study with Just-Eat and GrubHub being very similar businesses and Zoopla being a very different one, giving some insight into what a modern "unicorn" marketplace looks like and what early-stage marketplace startups should be aiming for in the long term.

(All currency amounts have been converted to USD;  market cap as of 14th August; the ratio is marketcap/revenue rather than PE ratio).

While all three marketplaces have broadly similar revenues it's notable how much the valuations of GrubHub and Just-Eat have diverged post-IPO with GrubHub trading at significantly higher ratio (they have similar expenditure and growth figures). 

It's hard to say if the pricing is "bubbly" but it shows a clear geographic influence in pricing. Although it's not obvious how much of the GrubHub premium is because of it's dominance in the US market (the largest single take-out market in the world) and how much of it is down to the fact it's listed on a US exchange.

Either way the implication is that US marketplaces can enter the unicorn club with lower revenue figures than non-US marketplaces.

On a broader level if we take 12-13x as a lower bound for these ratios then that implies a marketplace needs to reach annual revenues of around $80m to get into the unicorn club.  Although it's worth noting that these ratios are for high-growth marketplaces which are the dominant players in their markets.

The Vendor Side

Although from the consumer-side takeaway and real-estate are very different businesses, the vendor side is much more comparable if we look at number of vendors, average revenue per vendor and revenue per lead:

(The revenue figures are slightly-off from the earlier revenue figures as they exclude revenues from other sources such as third-party advertising) 

An insight worth pulling out is that even though the Just-Eat/GrubHub charge using a commission model (typically charging around 10% per transaction) and Zoopla charge on subscription model (vendors pay a monthly fee) that the revenue/vendor/lead figures work out roughly the same. It's not a case of unicorn marketplaces having to use one model or the other.

Given that the definition of a lead is very different for the takeaway marketplaces (an order being placed) and for real-estate (an expression of interest in seeing a property) the revenue-per-lead being similar might be coincidental;  however there might also be some market dynamics at play that explain the similarity (i.e unicorn marketplaces that charge an order of magnitude less would require vastly more transactions making them much rarer).

The number of vendors and revenue/vendor that early stage marketplaces should be targeting becomes much more obvious if we graph the companies. As all the companies have similar revenues they fall roughly along what we can call the "unicorn line" - if a marketplace is serving a market in which it's hard to cross-above the line (i.e too few vendors; not enough value generated for vendors) then it's likely to be hard space to build a unicorn marketplace.

Interested in Marketplaces ? - follow me on twitter @imranghory

Marketplace Pricing

Marketplace pricing strategies tend to fall into one of three categories: 

  • Flat-rate fees - where a fixed price is charged typically per-listing or per-lead generated.
  • Subscription - where participants on one or both sides of the market pay a regular fee for access to the marketplace.
  • Commission - where a percentage cut of transaction value is taken.

Of these three models the commission approach has largely become the standard with flat-rate and subscription models being relegated to marketplaces where the actual transaction happens off-platform (housing, cars, recruitment, etc.) or where transactions aren't priced (data sharing, dating, etc.). 

This convergence on commission pricing has meant it's possible to broadly compare marketplaces serving different industries and how the level of commission varies with the nature of the transactions. So to do this I pulled together an analysis looking at the commission rates for twenty-five different marketplaces across six categories: 

  • Digital Products - Apple App Store, CreativeMarket, ThemeForest, 99Design (Ready-Made store), Shutterstock
  • Local Services - TaskRabbit, Uber, Homejoy
  • Remote Services - oDesk, Amazon MechanicalTurk, Fiverr
  • Physical Products - Etsy, Ebay, Just-Eat, Seatwave, Vinted, Amazon Marketplace, Asos Marketplace
  • Holiday Rentals - AirBnB, Wimdu, HouseTrip
  • Fundraising - JustGiving, CrowdRise, Kickstarter, Pledge Music

To make the commissions as comparable as possible:

  • Both demand side and supply side commissions are combined
  • Commissions are inclusive of credit-card/payment processing fees
  • Commissions are based on non-exclusivity and don't include volume discounting
  • In cases where fees aren't officially published they're calculated based on other publicly available data

Looking at the data this way it's hard to ignore how high-commission digital product marketplaces are - the low-end of the category being equivalent to the high-end in every other category. It's not clear if it's a case of profit taking or if the digital marketplaces face unique costs (vetting product quality, marketing, etc.) but given that newer marketplaces such as Creative Market are operating at a much lower (although still high) commission the high-end digital marketplaces look distinctly vulnerable to disruption. 

There's also a noticeable gap between the marketplaces for remote and local services with the later commanding a notable premium. While the obvious explanation would be local marketplaces requiring greater local presence and deeper supply side vetting, the holiday rental market faces similar challenges without the same level of markup. It'll be interesting to see if the rates get driven down as competition grows in the local services space.

One factor not taken into account here is transaction value and volume; in theory markets in which those are larger should result in higher revenue from smaller commission. However even once segmented by category it's not obvious that there is a correlation between those factors and commission levels. One reason may be that marketplace startups tend to either be "low-value high-volume"  or "high-value low-volume" with the two factors essentially cancelling each other out (presumably "low-value low-volume" markets are hard to build significant companies in and "high-value high-volume" markets extremely rare).

If you have any comments or are working on a marketplace startup I'd love to hear from you. I can be reached via email <imranghory@gmail.com> or twitter at @imranghory

What Publishers Think: Subscriptions, Self-Publishing and Discovery

This weekend was #FutureBookHack, a hackathon which brought together hackers that were digital leaders from top-tier publishers (Faber, Harper Collins, Pan Macmillan, Penguin Random House, and Simon and Schuster), as well as other industry experts including industry analysts (Nielsen), booksellers (Blackwells) and literary agents.

While The Bookseller has extensive coverage of the event itself, there hasn't been much coverage of the key themes that emerged not only from the formal presentations/workshops, but also from the more informal discussions that took place over the weekend.

Given such a unique collection of experts, there was a lot of industry insight which I've gathered together to share with a wider audience.

I've split it into - the more formal topics; (discoverability, audio books and children's books); the informal ones (Netflix for eBooks, Publishers vs Self-Publishing); and then rounded up some of the more insightful smaller points.

THE INFORMAL

SPOTIFY/NETFLIX FOR EBOOKS

Given how "hot" this topic has been over the last year, it was almost an inevitable that this was bought up a number of times. Almost always via a participant raising it to one of the publishers rather than vice versa.  

The overwhelming view from the publishers, was that the model didn't work for the dynamics of the book market. The fundamental argument being, that most readers aren't restricted to the number of books they read because of cost, but because of time.

While a music listener or movie watcher might significantly increase their consumption as the incremental price drops to zero, book readers are already close to their limit in how many books they can read.

For example, with music a single is only 2-3 minutes long, so with a subscription service the amount of genres to which a consumer listens increases significantly. An average book takes 10-20 hours of reading time and many consumers already purchase more books than they are able to read.

The fact that piracy hasn't occurred for eBooks in the way it has for movies and music, adds weight to the viewpoint that eBook consumption isn't price restricted.

As this is the case, it seems impossible for a pricing model to exist that would both make sense for publishers and readers. Readers can't be given more value for the same amount of money, and publishers can't justify the cannibalization of their sales market if subscriptions can't produce similar revenues. Publishers also lack secondary revenue streams (merchandising, concerts, etc.) that allow other industries to justify the low revenues from services like Spotify.

However, there might be exceptions for particular genres where reading patterns are different, and this has been demonstrated in a few markets. Comic books (via Comixology and Marvel Unlimited), and technical books (Safari) are both examples where readers are likely to increase consumption. They're also both markets where consumers often purchase physical editions of books they already have in electronic form, further reducing the impact of cannibalization.

There may also be other factors that play into market dynamics for specific genres beyond volume of readership. For example with academic books, a subscription model on the reader's side would be of value to students who would no longer have to carry around heavy books, and on the publisher's side it would solve the major problem of secondary market resales.

Some publishers are however looking at alternative ways of segmenting subscription markets. For example, book serialisation and models where consumers can subscribe to a particular author.

PUBLISHERS VS SELF-PUBLISHING

When self-publishing of eBooks first took off, there was significant concern in the industry that the self-publishing market could end up disrupting the publishing industry. This left many publishers in an existential crisis, questioning what they actually did, and how they added value.

As the eBook industry has matured the initial concerns have largely been allayed, with publishers finding that in general, self-published authors would prefer a traditional publisher if given the option.

Publishers now often sign up successful self-published authors, and source new books from services such as WattPad. Taking on such authors reduces the upfront risk of investing in a book, as the authors have already proven demand, so the overall impact has actually been positive for publishers.

Arising from these discussions was also the wider topic of what precisely publishers consider their “key value-add” and why authors still wanted to go with traditional publishers. Reputation, risk and expertise were the three key factors.

  • Reputation: as authors still want the prestige that comes with a reputable publishing house, and getting printed books into bookshops
  • Risk: as the the publishers take on the upfront risk in paying for advances and investing in editorial, marketing, and printing costs.
  • Expertise: not only covers both the services such as - editorial, proofing, and marketing, but also the publishers expertise, and position, in managing the overall process.

While publishers outsource many of the services that they provide to freelancers, they add value by managing those relationships. This both prevents exploitation (as authors aren’t expert buyers of these services), and removes direct commercial incentives from the equation (if the author was paying the editor directly it would impact the author-editor relationship).

THE FORMAL

DISCOVERABILITY

The ability for consumers to discover new books was something that concerned all the publishers, who were keen to see technology formalizing traditional methods of discovery (social recommendation systems, aggregation of professional reviews ala Metacritic), as well as more innovative approaches.

The general feeling, was that current discovery approaches had significant weaknesses. Such as best sellers lists (both online and off) generally being rigged for editorial and commercial reasons, and recommendation engines (typically of the form of “people who bought x often bought y”) being skewed against new and more unusual books.

There was also interest in how improvements in book metadata could result in better discovery (i.e applying SEO techniques to optimize books appearing in the right searches), with most metadata currently being optimized through human expertise, rather than via data-driven approaches.

In part, it felt the concern over discoverability arose from the power that Amazon had over owning the key search, rankings, and book recommendation systems (both via Amazon.com and also via subsidaries such as GoodReads and Audible), and an interest in reducing the dependency on a single vendor.

AUDIO BOOKS

Channels to market was the key topic among the publishers addressing audio books. While consumer demand is growing, there was visible frustration in the difficulty publishers faced in getting their product to customers - both in terms of discovery and delivery (large file sizes, clunky software).

Audio books form a unique set of challenges due to the way the market is fragmented at the moment. A literature student listening to an author reading his/her own work, gets a significantly different take on the work, as against a listener who opts for audio books while driving, or from a parent who uses a children's audio book as a substitute for a bedtime story.

At the moment, the majority of the digital audio book market is core genre adult fiction. This is partially due to the dominance of Audible, who's subscription model and marketing focus is on that group.

Audio book production is expensive to do properly as it requires studio time, a producer, and a voice actor. Due to this cost, base audio books are generally priced on the basis of length, although there is significant variation in how each is priced by the retailer.

So while audio rights to books are typically inexpensive, the high fixed production costs, the nascency of the market, and the fact that audio books sales don't seem to cannibalize the sale of non-audio versions, makes publishers keen to broaden the appeal of this medium.

The overwhelming interest from publishers, was to see new services that made it easy for consumers to access audio books, and many were actively looking into technologies such as streaming, and more consumer-friendly business models.

CHILDREN’S (PICTURE) BOOKS

An unexpectedly hot area was children’s books, one of the few areas of publishing which is seeing a significant growth in print. It is also an area in which publishers are keen to see digital developments that will allow them to reuse their image assets for incremental revenue.

Due to the picture-heavy and interactive nature (pop-ups, pull-tabs, etc.) of these books, publishers are finding existing eBook formats and tools ineffective in bringing these books to digital platforms like the iPad.

AND FINALLY....

There were also number of smaller insights raised that deserve a mention:

  • Several publishers had experimented with building apps tailored around specific books offering enriched content, but found it wasn't successful as it was difficult to build into workflow, and also expensive with costs not being recouped from sales.
  • In general, enriched content (author interviews, extra background, etc.) was hard to sell. There was not much consumer interest in paying for it.
  • Industry experimenting with an eBook first model, where a traditional publisher publishes the eBook first and looks how it performs before deciding if they want to go to a print edition.
  • The creation of a long-tail market by growth of eBooks and print-on-demand is often overlooked. It used to be 100k unique books that were actively selling, now it’s 1m+.
  • Long-tail generates revenue, but not in economically viable amounts for anyone, but is useful for residual revenue from books that have done well in the past and require no further investment.
  • Most publishers make the core of their money from “mid-tail” books with blockbusters being “bonuses”.
  • The ebooks market is shifting to multifunction devices (phones, tablets), and away from eReaders. eReader users often read on other devices as well.
  • On mobile devices eBooks have to compete against not only other eBooks, but also other forms of entertainment (music, games, etc.) available on mobile devices - often for a very low cost.
  • Some print book genres have been decimated not by eBooks, but apps taking their place. For example, maps (replaced by Google Maps) and travel guides (replaced by Tripadvisor).

How YC S12 Companies Make Money

Following on from Y Combinator's Demo Day yesterday I ran an analysis on the companies that presented looking at how they made money. I looked at two factors:

  • Whether they were primarily b2b or b2c
  • What business model they had (if any) 

    The Data

    (this data excludes companies which didn't present "on-the-record" at demo day)

    The definitions I used for classifying startups:

    • Subscription - Startups that charge a regular flat fee (typically monthly) for their services, most typically SaaS services.
    • Marketplace - Startups dealing with multi-sided markets where a commision is taken when participants on different sides of the market transact with each other.
    • Advertising - Startups which generate money through selling advertising space or through lead-gen.
    • Pay-as-you-go - Startups using metred charging models where users are charged based upon usage. 
    • Retail - Startups making one-off sales in the traditional retail transaction model. 

    The Analysis

      The split between startups targeting businesses and consumers seems to be pretty even, in an article last month I covered how AngelPad strongly prefers b2b startups; this doesn't seem to be the case with YC.

      Only a quarter of the startups that presented had no business model to date, although that rises to forty percent if you look purely at the consumer startups which suggests YC is still open to funding consumer startups that have the potential to be massmarket without a clear revenue stream.

      Perhaps most striking was how popular the marketplace model is; historically YC have funded relatively few marketplace startups (presumably on the basis that the differentiator between marketplaces is down to traction and marketing rather than technology). However the huge success of AirBnB and other marketplace startups (Etsy, KickStarter, etc.) in recent years has possibly made them rethink their stance. 

      (This research wouldn't have been possible without the DemoDay coverage provided by both Dan Shapiro and Techcrunch so thanks to them both)

      If you enjoyed this article why not follow me on twitter @imranghory

      Applying to AngelPad

      Last night I attended a poker game sponsored by AngelPad (and kindly hosted by Pusher) who were looking to promote AngelPad as an accelerator to UK based startups. Thomas (founder of AngelPad) kicked off the evening by giving an introductory talk via Skype and holding a short Q&A session.

      As there was a lot of useful information in that session I decided to do a write-up to help anyone thinking of applying who wasn't able to attend. Their deadline for their next season is this sunday and you can apply here.

      I've add my own comments in [square brackets] to distinguish what Thomas said and my own take on what was said. 

      Keypoints

      On applying:

      • They typically get 2000 applications for 12 places
      • 90% of the application comes down to the video, some people who review the application may only see the video and nothing else. It's worth doing multiple takes to get it right.
      • What they expect to understand from the video is: team, business plan and what the startup needs to succeed
      • You'll only hear back if they're interested in you, they don't send rejection emails

      On the startup:

      • AngelPad don't care about revenues or sector per-se, but they care about the core thesis of the business [presumably on the assumption that the actual product can pivot if the basic problem being tackled is significant]
      • Majority of AngelPad companies are b2b
      • Most consumer startups that apply to AngelPad aren't going after big enough markets
      • They're primarily interested in startups where the technology is the distinguishing factor
      • A measure AngelPad use is: Would this be an acquisiton target of a big tech company (linkedin, facebook, google, etc.)
      • They want startups with big potential [I read this as being startups capable of reaching $50m+ valuations given most of their existing startups are raising money in the $5m-$8m valuation range]

      On the team:

      • There needs to be developers on the founding team
      • Team needs to have leadership/management potential

      On what stage startups should apply:

      • They want teams that are already commited to the startup and are already working on it full time (i.e aren't still in day jobs).
      • Under a year old, they worry that startups that are older than that tend to lose momentum and drive.

      On international applicants:

      • For international teams they're only interested in startups that plan to relocate to the US on a permanent basis [note this is different from YC where a number of notable startups like SongKick and Lanyrd have returned to the UK after the program]
      • They think it's too hard for startups to raise investment from US investors if they don't plan to stay in the US
      • They've had 15 non-american teams go through AngelPad so far, 13 of them are still in the US.

      On British applicants specifically:

      • They've had three British startups (Postmates, Vungle, and Rolepoint) [they seem to have missed Buffer from their list]
      • They've found British startups do better than other international startups, possibly due to not having to deal with a language barrier.
      • They like British founders as they tend to have a greater focus on revenue

       

      Seed Stage VCs in London

      One complaint I hear from startup founders raising seed rounds is that it's often hard to know which VCs are open to doing early stage seed round investments and what size of investments they make.

      Hence I've been doing some research and the following is a list of all significant VC funds who've made angel investments in London over the last year and a list of the companies they've invested in - in many cases the VCs in question have done a mix of seed and later rounds; the companies I've listed are specifically ones they've invested in at a seed level.

      Where a fund has explicity stated the size of investments they make I've used those numbers, in other cases the numbers are estimates based upon the reported size of investments they've made in their previous portfolio companies.

      Two VC funds in particular stand-out from the crowd in terms of the number and nature of the investments made: Passion Capital (who primarily focus on early stage investment) and Index - both of whom are definitely worth considering for any startups seeking to raise significant seed rounds.

      The List

      Passion Capital (typical seed investment size: £300k-£500k)

      2011 Investments: Luluvise (female social network), WireWAX (video tagging), EyeEm (photo sharing), Adzuna (classified ad aggregator), Pusher (realtime web infrastructure), GoCardless (payment system)

      Other notable investments: Mendeley (academic social network), smarkets (gambling), flattr (micropayments)

      Accel (typical seed investment size $500k+)

      2011 Investments: FantasyShopper (social shopping game), QRiously (real time sentiment analysis)

      Amadeus (typical seed investment size: ~1m)

      2011 Investments: TrialReach (clinical trial recruitment), oneDrum (Document collaboration)

      Atomico (typical seed investment size: £300k-£550k)

      2011 Investments: Ge.tt (file sharing), Silk (structured content), Siine (virtual keyboard), Fashiolista (fashion discovery), Hailo (taxi app)

      Index (typical seed investment size: $300k-$2m)

      2011 Investments: Reality Jockey (augmented music), EDITD (fashion trend analytics), Lightbox (photo app), Geckoboard (information dashboard), Lanyrd (conference discovery)

      Other notable investments: SnapTalent (job ad network), netvibes (social media dashboard)

      Octopus Ventures (typical seed investment size: £250k-£800k)

      2011 Investments: CertiVox (secure content control)

      Previous investments: TouchType (virtual keyboard), Secret Escapes (luxury travel), True Knowledge (natural language expert system) and Graze (postal snacks) (I've been told the last two were later than seed stage)

      Profounders Capital (typical seed investment size: £500k+)

      2011 Investments: Luluvise (female social network), Applifier (ad network), Lanyrd (conference discovery)

      Wellington Partners

      2011 Investments: EyeEm (photo sharing), Hailo (taxi app)

      I've not included the following VC funds because I've been unable to identify any seed round investments they've made since the start of 2011: Balderton Capital, Dawn Capital and Eden Ventures, The Accelerator Group.

      Apologies to anyone I've missed out or any mistakes in the above; feel free to contact me with corrections/additions at imranghory@gmail.com or @imranghory on twitter.

      Why Seedhack Missed Its Mark

      Last weekend Seedcamp ran their inaugural seedhack, a weekend hackathon style event bring together industry specialists and developers to form new startups. Seedhack was pitched as a much more "serious" event compared to similar events such as Startup Weekend and Launch48, with the aim of having real startups emerge from it. 

      Startup Weekend and Launch48 are much more pushed towards having fun, learning and networking. That some real startups such as Zaarly have emerged out of them has been more in the way of a bonus for these events.

      Seedhack bought in industry speakers, encouraged small team sizes and tackling of "real" problems, had paperwork to deal with ownership issues, and filtered the people attending (asking potential attendees if they were in a position to create a new startup, etc.). They also had two themes Healthcare and Big Data to help focus ideas on real problems for the weekend.

      Unfortunately it doesn't seem to have worked, the products built were mostly similar to those found at any other event (apps for dating and coordinating meetups with your friends) and the follow-on rate (the number actually becoming startups) is likely to be similar.

      What seedhack could have done better

      • The Themes

      The themes (healthcare and big data) were announced after most people had already signed up.  It was clear many of the attendees had little or no interest in these themes. 

      There was also a clear lack of inspiration when it came down to ideas for healthcare startups because there were so few people (either on the developer side or the business side) who had worked in healthcare.  It would have made much more sense to have the themes decided upfront and the event promoted it on that basis (as for example the Education themed Startup Weekends are doing). It would have produced much more "aligned" interest among the crowd and likely result in far more innovative ideas.

      • The Speakers

      The theme speakers while very interesting, were probably at a bit too high level for the event. As someone asked the healthcare speakers in the Q&A "but what are the problems you want us to solve?". There were also several vague references to healthcare APIs but it would have worked much better if speakers had gone into details of the APIs. Also it would have been better if the theme speakers went before the more general technical speakers as it would have allowed the audience to be thinking about theme related ideas that could be tackled with the technologies being talked about. 

      • The logistics of team creation

      The logistics of creating teams didn't work very well. Seedhack took the approach of having a forum for people to post their ideas and online voting. While the concept was sound, in practice this really didn't work with only a handful of people even voting on the forum. The flakey wifi in the room and lack of 3g signal probably made it difficult even for those who wanted to vote to do so. Team formation was done in breaks without any real co-ordination and it wasn't really clear what was going on.

      Launch48 and Startup Weekend both use a similar format which works much better: Anyone who wants can standup in front of everyone and deliver their pitch, followed by a manual voting process, typically either a show of hand or by use of post-it notes. The top voted pitchers then move to separate parts of the room holding signs indicating what they pitched and attendees can go and find the teams they want to join.

      In fairness L48 and SW have both run dozens of events and know from experience that online voting tends not to work (not just because of technical difficulties but offline voting also makes attendees more invested in the ideas they voted for - an important psychological aid to team building), but seedhack should draw on the expertise of these other events.

      • Focus on business development

      From speaking to a lot of the attendees business development and product management fundamentals was one area that many were lacking. While this was partially made up for by the mentors, it seemed many of the teams failed to ask fundamental questions such as "why is this better than the other solutions to the same problem?". While this doesn't really matter for weekend projects, if seedhack is seeking to inspire real startups out of it they should encourage participants to think about these issues.

      While I don't want to seem overly harsh about seedhack especially as this was their first time running it, I do think their concept of a more serious version of L48/SW makes a lot of sense and has potential, so hopefully they'll iterate in true startup style and the next event will be closer to its mark !