NoSQL and CMS – a Match made in Heaven?

As anyone who’s visited planet Earth in the last year or so knows, the NoSQL (“Not Only SQL”) movement is rapidly gaining both momentum and mind share, despite a number of prominent detractors. Rather than entering into a lengthy debate on the general pros and cons of NoSQL technologies, I’d like to reflect on the possible applications of these technologies to the specific problems of content management, a use case that (to my mind) it seems particularly well suited to. I briefly scraped the surface of this topic in a prior post.

As discussed previously what is meant when we refer to a “CMS” varies quite significantly depending on the use case, and my initial focus will be on the impact of NoSQL on CMS’s that target the Web Content Management (WCM) use case, followed with a post on the impact of NoSQL on the Document Management (DM) use case.

Even within the seemingly narrow confines of WCM, we’re discussing (at least!) two different problem domains (Content Production Systems and Presentation Management Systems) that have markedly different requirements (and are arguably unrelated), and I’ll discuss the impact of NoSQL on each of these areas in turn.

Let’s begin!

Share via Share via Digg Share via Facebook Bookmark in Google Share via MySpace Share via Reddit Share via StumbleUpon Favourite in Technorati Share via Twitter

Published in: on 2010-06-24 at 10:19 pm  Comments (5)  
Tags: ,


While deconstruction is serious business for the curmudgeon, I occasionally like to take a break from the rigours of sowing chaos and discord by presenting some more constructive observations.

In this post I’d like to capture the mental picture I have of how Content Management fits together, neatly putting all of the pieces of the CM puzzle (DM, WCM, RM, AA, etc.) in their rightful place. As a bonus we will also learn how and why various products (including our good friend WordPress) fit into the Content Management menagerie.

I consider AA to be part of CM, as beer consumption appears to be an increasingly important part of the Content Professional’s technical proficiency.

A Hierarchy of CM Problem Domains

In my previous post I introduced the “Reversi Rule” and noted that for CM we came up with the rather broad definition of “the management of content”. To me this generality is a large part of the appeal of the term (particularly when compared to ECM, which is just downright confusing) – it generously includes the diverse array of human endeavours that could conceivably be classified as “Content Management”, it doesn’t say anything about what those specific problem domains look like (beyond requiring that they involve the “management of content”, for some reasonable definition of “management” and “content”) and it doesn’t exclude any of the broad range of actors who face these problems (including, but not limited to, enterprises).

So what specific value, then, does such a broad definition for Content Management provide us?

Perhaps I’m betraying my technologist background, but to me Content Management clearly forms the root of a hierarchy of increasingly specialised problem domains – in graphical format, this hierarchy might start to look something like this:


Note: this diagram does not attempt to capture all possible CM problem domains, although doing so would be an illuminating exercise.

This diagram clearly illustrates a couple of important points:

  1. A vast array of activities can be referred to as “Content Management”.
  2. Many of these use cases have unique and highly specialised requirements, particularly as we get closer to the tips of the tree.
  3. Some of the management activities we think of as being common across the hierarchy actually have quite different semantics depending on the specific problem domain (versioning requirements are very different between Docroot Revision Control and Records Management, for example).
  4. File / folder-centric definitions of content are only part of the content management picture.

A graphical treatment also helps to highlight part of the reason why we’re all having so much trouble agreeing on what “Content Management” really is – we all tend to operate down at different tips of the tree, yet throw around our specific problem domain as The One True Form of Content ManagementTM!

I think this gets to the root of Pie’s earlier loss of composure, yet he is arguably guilty of the same sin, albeit while standing on a different soap box.

What About the Technology?

Typically software products are a trailing indicator of business problems, so it’s no surprise to find that there are systems for almost all of the use cases identified on the diagram. In fact adding the word “System” or “Software” to most of the labels on the diagram will result in an extant product classification. There are a few exceptions (“Docroot Revision Control System” and “Structured Content Production System”, for example), however there are products on the market today that are admirably described by these two terms.

The Bonus Round

Going back to our (by now somewhat fatigued) example of WordPress, it clearly falls into the node labeled “Blogs”, and by adding “System” to the label we get “Blog System”. Sounds fair – I doubt anyone would dispute that WordPress is indeed a Blog System.

Now by looking at the diagram we can see that a Blog System is a specialised form of Presentation Management System, which itself is a specialised form of Web Content Management System, which is finally a specialised form of Content Management System. I can hear some incredulous voices: “are you asserting that WordPress is all of these things?”. Absolutely!

Let’s pick some more examples, to see if we can break this model:

  • Alfresco RM – clearly a Records Management System therefore also a Document Management System, therefore also a Content Management System.
  • Virage MediaBin – this is an easy one: the web site explicitly touts it as Digital Asset Management, so only one step and we arrive at Content Management System. NEXT!
  • Ektron eWebEditPro (here’s a potentially contentious one!) – again the web site tells us it’s HTML Editing Software, therefore a Web Content Management System and a Content Management System.

Interesting eh? All these vastly different systems (we’ve just picked 4 that are completely different from one another), yet all of them provide specialised facilities for the management of content central to various different problem domains. They’re all Content Management Systems!

To paraphrase Drew Carey, next time you’re at a social event without companionship or sustenance, I’d encourage you to play “pin the CMS tail on the product donkey” (allowing yourself the ability to extend the hierarchy above with categories that I left out) – I think you’ll mostly find it a trivial exercise.

In Conclusion

At this point you might still be asking yourself what all this means and whether there is any real value in such a broad definition for Content Management.

My answer to that would be that an inclusive definition such as this one comes closest to the true meanings of the words “Content” and “Management”, without requiring us to open the can of worms that would be involved in trying to define these two words in detail (which is impossible anyway, since their precise definitions depend on the specific problem domain).

More importantly, by not requiring us to come to some global agreement about what “content” and “management” mean, this definition can help us move beyond the historical divides within the profession (notably the divide between the Web Content Management and Document Management camps), by giving us common terminology that is compatible with how these terms are used today by all camps, while also being sufficiently well defined that everyone knows what’s implied (and just as importantly, not implied) when someone make an assertion such as “Microsoft Word is a Content Management System”.

Published in: on 2010-05-07 at 5:58 pm  Comments (3)  
Tags: , , , , , , , , , , , ,

Bottom 10 Predictions for 2010

In mid December, Justin Cormack (of Technology of Content fame) and I agreed to do a little retrospective on the worst things we’d seen during 2009.  While Justin wasted no time in submitting a great post on the topic, I blinked on or around December 15th and when I opened my eyes it was January 2010 already, so in an attempt to regain Justin’s respect I offer a humble “Bottom 10 Predictions for 2010”, in no particular order:

  1. Content Management RFPs will continue to be liberally strewn hither and yon like confetti at a WASP wedding, with zero regard for their efficacy or otherwise.
  2. There will be at least one spectacular† flameout when an otherwise solid WCMS is shoehorned into a completely inappropriate use case, and then explodes in a shower of shiny sparks. This will be a direct result of WCM continuing to be misidentified as a single commoditised use case – an issue that has been beaten to death in the past, but still has no remedy in sight.
  3. Map / reduce will be touted as the One True Answer to Everything, while RDBMS stalwarts struggle in vain to calm the hysteria and remind everyone that it’s “merely” a handy way to crunch large data sets.
  4. Suffering indigestion, OpenText will divest themselves of at least one of their product portfolio.  My money is on the former Tower products acquired via the Vignette purchase – they were a going concern before Vignette acquired them, and have remained relatively independent since then.
  5. SMB / CIFS will remain the most popular remote filesystem protocol in existence, despite being the worst (except for all the others).  (with apologies to Winston Churchill)
  6. JCR will continue to be ignored by all of the “top shelfCMS vendors – rapid adoption of CMIS on the server side will amplify the collective yawn that JCR evokes amongst CMS vendors.
  7. Momentum will continue to build on the CMIS client side – I humbly disagree with Pie’s #3 prediction for 2010 on this point, as there’s already been significant CMIS client development, and that’s before the spec has even been ratified.  (I realise this is more of a “top” prediction, than a “bottom” prediction, but even a curmudgeon has his sunny moments!)
  8. The Cloud will be the teenage sex of 2010 – everyone says they’re doing it, but only a small subset really understands what’s involved and an even smaller minority of those are actually doing it “for real”.  (note: The Cloud narrowly beat out Enterprise 2.0 for this title – I expect to see a great return match next year!).
  9. (speculation) Ektron will be acquired by Microsoft in order to jump start Sharepoint’s WCM capabilities.
  10. (super speculation!) Duke Nukem Forever will be released and will cure all those evil Wiiborgs of their sordid, physically demanding “gaming”.

And there you have it – my bottom 10 list for 2010.  I hope this is enough to regain Justin’s respect, although I may have gone a little overboard with one or two items on this list.  😉

† “Spectacular” meaning it will become a cautionary tale in CMS circles.

Share via Share via Digg Share via Facebook Bookmark in Google Share via MySpace Share via Reddit Share via StumbleUpon Favourite in Technorati Share via Twitter

Published in: on 2010-01-07 at 4:32 pm  Comments (3)  
Tags: , , , , , ,