• Cassandra

    Analysis and discussion of the open source data management project Cassandra. Related subjects include:

    August 22, 2017

    Imanis Data

    I talked recently with the folks at Imanis Data. For starters:

    Read more

    August 7, 2016

    Notes on DataStax and Cassandra

    I visited DataStax on my recent trip. That was a tipping point leading to my recent discussions of NoSQL DBAs and misplaced fear of vendor lock-in. But of course I also learned some things about DataStax and Cassandra themselves.

    On the customer side:

    Customers in large numbers want cloud capabilities, as a potential future if not a current need.

    One customer example was a large retailer, who in the past was awful at providing accurate inventory information online, but now uses Cassandra for that. DataStax brags that its queries come back in 20 milliseconds, but that strikes me as a bit beside the point; what really matters is that data accuracy has gone from “batch” to some version of real-time. Also, Microsoft is a DataStax customer, using Cassandra (and Spark) for the Office 365 backend, or at least for the associated analytics.

    Per Patrick McFadin, the four biggest things in DataStax Enterprise 5 are: Read more

    July 19, 2016

    Notes on vendor lock-in

    Vendor lock-in is an important subject. Everybody knows that. But few of us realize just how complicated the subject is, nor how riddled it is with paradoxes. Truth be told, I wasn’t fully aware either. But when I set out to write this post, I found that it just kept growing longer.

    1. The most basic form of lock-in is:

    2. Enterprise vendor standardization is closely associated with lock-in. The core idea is that you have a mandate or strong bias toward having different apps run over the same platforms, because:

    3. That last point is double-edged; you have more power over suppliers to whom you give more business, but they also have more power over you. The upshot is often an ELA (Enterprise License Agreement), which commonly works:

    Read more

    July 19, 2016

    Notes from a long trip, July 19, 2016

    For starters:

    A running list of recent posts is:

    Subjects I’d like to add to that list include:

    Read more

    October 15, 2015

    Cassandra and privacy requirements

    For starters:

    But when I made that connection and checked in accordingly with my client Patrick McFadin at DataStax, I discovered that I’d been a little confused about how multi-data-center Cassandra works. The basic idea holds water, but the details are not quite what I was envisioning.

    The story starts:

    In particular, a remote replication factor for Cassandra can = 0. When that happens, then you have data sitting in one geographical location that is absent from another geographical location; i.e., you can be in compliance with laws forbidding the export of certain data. To be clear (and this contradicts what I previously believed and hence also implied in this blog):

    Read more

    October 15, 2015

    Basho and Riak

    Basho was on my (very short) blacklist of companies with whom I refuse to speak, because they have lied about the contents of previous conversations. But Tony Falco et al. are long gone from the company. So when Basho’s new management team reached out, I took the meeting.

    For starters:

    Basho’s product line has gotten a bit confusing, but as best I understand things the story is:

    Technical notes on some of that include:? Read more

    October 11, 2015

    Notes on privacy and surveillance, October 11, 2015

    1. European Union data sovereignty laws have long had a “Safe Harbour” rule stating it was OK to ship data to the US. Per the case Maximilian Schrems v Data Protection Commissioner, this rule is now held to be invalid. Angst has ensued, and rightly so.

    The core technical issues are roughly:

    Facebook’s estimate of billions of dollars in added costs is not easy to refute.

    My next set of technical thoughts starts: Read more

    September 14, 2015

    DataStax and Cassandra update

    MongoDB isn’t the only company I reached out to recently for an update. Another is DataStax. I chatted mainly with Patrick McFadin, somebody with whom I’ve had strong consulting relationships at a user and vendor both. But Rachel Pedreschi contributed the marvelous phrase “twinkling dashboard”.

    It seems fair to say that in most cases:

    Those generalities, in my opinion, make good technical sense. Even so, there are some edge cases or counterexamples, such as:

    *And so a gas company is doing lightweight analysis on boiler temperatures, which it regards as hot data. ??

    While most of the specifics are different, I’d say similar things about MongoDB, Cassandra, or any other NoSQL DBMS that comes to mind: Read more

    May 26, 2015

    IT-centric notes on the future of health care

    It’s difficult to project the rate of IT change in health care, because:

    Timing aside, it is clear that health care change will be drastic. The IT part of that starts with vastly comprehensive electronic health records, which will be accessible (in part or whole as the case may be) by patients, care givers, care payers and researchers alike. I expect elements of such records to include:

    These vastly greater amounts of data cited above will allow for greatly changed analytics.
    Read more

    March 15, 2015

    BI for NoSQL — some very early comments

    Over the past couple years, there have been various quick comments and vague press releases about “BI for NoSQL”. I’ve had trouble, however, imagining what it could amount to that was particularly interesting, with my confusion boiling down to “Just what are you aggregating over what?” Recently I raised the subject with a few leading NoSQL companies. The result is that my confusion was expanded. ?? Here’s the small amount that I have actually figured out.

    As I noted in a recent post about data models, many databases — in particular SQL and NoSQL ones — can be viewed as collections of <name, value> pairs.

    Consequently, a NoSQL database can often be viewed as a table or a collection of tables, except that:

    That’s all straightforward to deal with if you’re willing to write scripts to extract the NoSQL data and transform or aggregate it as needed. But things get tricky when you try to insist on some kind of point-and-click. And by the way, that last comment pertains to BI and ETL (Extract/Transform/Load) alike. Indeed, multiple people I talked with on this subject conflated BI and ETL, and they were probably right to do so.

    Read more

    Next Page →

    Feed: DBMS (database management system), DW (data warehousing), BI (business intelligence), and analytics technology Subscribe to the Monash Research feed via RSS or email:


    Search our blogs and white papers

    Monash Research blogs

    User consulting

    Building a short list? Refining your strategic plan? We can help.

    Vendor advisory

    We tell vendors what's happening -- and, more important, what they should do about it.

    Monash Research highlights

    Learn about white papers, webcasts, and blog highlights, by RSS or email.

  • 谁有万福彩票官网








    庄家 亚盘