Companies, products, and technologies included in the Big Data Landscape:

- Splunk, Loggly, SumoLogic
- Predictive Policing, BloomReach
- Gnip, Datasift, Space Curve, Inrix
- Oracle Hyperion,  SAP BusinessObjects, Microsoft Business Intelligence,IBM Cognos, SAS, MicroStrategy, GoodData
-  Tableau Software, Palantir, MetaMarkets, Teradata Aster, Visual.ly, KarmaSphere, EMC Greenplum, Platfora, ClearStory
- HortonWorks, Cloudera, MapR, Vertica
- Couchbase, Teradata, 10gen, Hadapt
- Amazon Web Services Elastic MapReduce, Infochimps, Microsoft Windows Azure
- Oracle, Microsoft SQL Server, MySQL, PostgreSQL, memsql
-  Hadoop, MapReduce, Hbase, Cassandra 

 


Comments

06/27/2012 14:30

Cassandra is a NoSQL database management system, designed to handle very large amounts of data spread out across many commodity servers while providing a highly available service with no single point of failure.

Cassandra has Hadoop integration, with MapReduce support. There is support also for Apache Pig and Apache Hive.

CQL (Cassandra Query Language) was introduced, an SQL-like alternative to the traditional RPC interface. Language drivers are available for Java (JDBC) and Python (DBAPI2).

Cassandra is in use at Netflix, Twitter, Urban Airship, Constant Contact, Reddit, Cisco, OpenX, Digg, CloudKick, Ooyala, and more companies that have large, active data sets. The largest known Cassandra cluster has over 300 TB of data in over 400 machines.

http://cassandra.apache.org/

Reply
06/27/2012 14:40

Sumo Logic is a cloud-based log management and analytics service that leverages Big Data to deliver real-time IT insights.

Sumo Logic’s architecture features an elastic petabyte scale platform that collects, manages and analyzes enterprise log data, reducing millions of log lines into valuable operational insights in real time. The cloud-based approach overcomes the inherent problems of premise-based solutions, including limits on scalability, inefficient or haphazard analysis, and uncontrolled costs.

The basic architecture of Sumo Logic's cloud log management system revolves around the best practice of divide-and-conquer by composing many small and independent services in order to build a scalable, flexible log management platform. Major components of the Sumo Logic system, such as the log data intake and collection management facilities, the full-text indexing pipeline, and the interactive analytics platform, are encapsulated into a set of decoupled services. Each service employs a set of shared lower-level modules to make use of common functionality. Each service runs as an independent executable, and each service can scale independently of the others in the cloud based on demand and the specific CPU and I/O requirements of each service.

http://www.sumologic.com/
support@sumologic.com

Reply
06/27/2012 15:36

Today Teradata leads the industry not only in active data warehousing technology that provides a single view of the enterprise in real time, but also in business intelligence analytics that help you use data in imaginative new ways to gain maximum value.

We offer a powerful suite of business intelligence technology platforms and solutions, a wide range of data access and management applications and robust data mining capabilities.
Regardless of the size of your organization or the complexity of your analytic needs, a Teradata solution can generate actionable business intelligence and help you achieve a competitive edge.

With the strength of the industry’s largest and most experienced force of consultants, Teradata will partner with you to effectively design, implement and maintain your system and train your staff. That's our end-to-end data warehousing solution.

Teradata was named “one of the world’s most ethical companies” by The Ethisphere Institute.

http://www.teradata.com

U.S. and Canada
1-866-548-8348

For International Callers:
(937) 242-4030

World Headquarters
10000 Innovation Drive
Dayton, OH 45342

Reply
06/27/2012 15:43

Splunk indexes and makes searchable data from any app, server or network device in real time including logs, config files, messages, alerts, scripts and metrics.

Your IT infrastructure generates massive amounts of data. Machine data - generated by websites, applications, servers, networks, mobile devices and the like.

By monitoring and analyzing everything from customer clickstreams and transactions to network activity to call records, Splunk turns your machine data into valuable insights.

Troubleshoot problems and investigate security incidents in minutes (not hours, or days). Monitor your end-to-end infrastructure to avoid service degradation or outages. And gain real-time visibility into customer experience, transactions and behavior.

http://www.splunk.com/

250 Brannan Street, 1st Floor,
San Francisco, CA 94107

phone: +1 415.848.8400

Reply
06/27/2012 15:57

Greenplum is a big data analytics company driving the future of Big Data analytics with breakthrough products that harness the skills of data science teams to help global organizations realize the full promise of business agility and become data-driven, predictive enterprises.

Greenplum Unified Analytics Platform (UAP) combines the co-processing of structured and unstructured data with a productivity engine that enables collaboration among your data science team. Greenplum UAP includes Greenplum Database, Greenplum HD, and Greenplum Chorus.

Greenplum has over 500 customers, including Silver Spring Networks, Zions Bancorporation, Reliance Communications, NYSE Euronext, Bakrie Telecom, Orbitz, Havas Digital, China Unicom, ClickFox, Frank Templeton Investments and Tagged.

http://www.greenplum.com/

Reply
06/27/2012 16:05

Vertica Systems is an analytic database management software company: http://www.vertica.com

The grid-based, column-oriented, Vertica Analytic Database is designed to manage large, fast-growing volumes of data and provide very fast query performance when used for data warehouses and other query-intensive applications. Its design features include:

Column-oriented storage organization, which increases performance of sequential record access at the expense of common transactional operations such as single record retrieval, updates, and deletes.

Out-of-place updates and hybrid storage organization, which increase the performance of queries, insertions, and loads, but at the expense of updates and deletes.

Compression, which reduces storage costs and I/O bandwidth. High compression is possible because columns of homogeneous datatype are stored together and because updates to the main store are batched.

Shared nothing architecture, which reduces system contention for shared resources and allows gradual degradation of performance in the face of hardware failure.

Vertica's specialized approach aims to significantly increase query performance in data warehouses. One example of a use case detailed in a research paper shows a performance improvement of hundreds of times with Vertica in a specific application due to the use of the vertical DBMS approach.

HP Vertica
150 Cambridgepark Dr
Cambridge, MA 02140
Phone: 617-386-4400

EMEA OFFICE
Phone: +44 02070 783 299
Fax: 978-600-1001

ASIA-PACIFIC OFFICE
Phone: +614 57 732 880
Fax: 978-600-1001

Reply
06/27/2012 16:17

Benefits of Data Virtualization

Data virtualization is the process of offering data consumers a data access interface that hides the technical aspects of stored data, such as location, storage structure, API, access language, and
storage technology. Consuming applications may include: business intelligence, analytics, CRM, enterprise resource planning, and more across both cloud computing platforms and on-premises.

Data Virtualization Benefits:

● Decision makers gain fast access to reliable information
● Improve operational efficiency - flexibility and agility of integration due to the short cycle creation of virtual data stores without the need to touch underlying sources
● Improved data quality due to a reduction in physical copies
● Improved usage through creation of subject-oriented, business-friendly data objects
● Increases revenues
● Lowers costs
● Reduces risks

Data virtualization abstracts, transforms, federates and delivers data from a variety of sources and presents itself as a single access point to a consumer regardless of the physical location or nature
of the various data sources.

Data virtualization is based on the premise of the abstraction of data contained within a variety of data sources (databases, applications, file repositories, websites, data services vendors, etc.) for the purpose of providing a single-point access to the data and its architecture is based on a shared semantic abstraction layer as opposed to limited visibility semantic metadata confined to a single
data source.

Data Virtualization software is an enabling technology which provides the following capabilities:

• Abstraction – Abstract data the technical aspects of stored data, such as location, storage structure, API, access language, and storage technology.

• Virtualized Data Access – Connect to different data sources and make them accessible from one logical place

• Transformation / Integration – Transform, improve quality, and integrate data based on need across multiple sources

• Data Federation – Combine results sets from across multiple source systems.

• Flexible Data Delivery – Publish result sets as views and/or data services executed by consuming application or users when requested In delivering these capabilities, data virtualization also addresses requirements for data security, data quality, data governance, query optimization, caching, etc. Data virtualization software includes functions for development, operation and management.

http://www.rosebt.com/uploads/8/1/8/1/8181762/benefitsofdatavirtualization.pdf

Reply
06/27/2012 17:50

How Many Computers to Identify a Cat?

THE NEW YORK TIMES 6/27/2012 – Inside Google’s secretive X laboratory, known for inventing self-driving cars and augmented reality glasses, a small group of researchers began working several years ago on a simulation of the human brain. There Google scientists created one of the largest neural networks for machine learning by connecting 16,000 computer processors, which they turned loose on the Internet to learn on its own.

http://www.nytimes.com/2012/06/26/technology/in-a-big-network-of-computers-evidence-of-machine-learning.html?pagewanted=all

Reply
06/27/2012 18:05

Platfora makes the data that lives inside Hadoop clusters understandable and easier to use.

We bring business intelligence into the 21st century, giving business analysts the intuitive and richly interactive tools to explore and produce business insights from massive and rapidly evolving datasets.

Whether a company has Gigabytes, Terabytes or Petabytes of data, our platform eliminates the need for traditional data warehouses, ETL tools and the legacy BI products of the past. We replace complexity and scaling pain with simplicity and beauty.

PLATFORA’S BREAKTHROUGH SOLUTION is a combination of server technology, user experience innovation, and data science.

Our platform works with existing Hadoop clusters (Cloudera, MapR, Amazon EMR, etc.), and automatically turns the questions of business users into Hadoop jobs that synthesize and distill Hadoop datasets into dimensional and predictive dashboards, reports and insights.

The system intelligently drives Hadoop to create and maintain ‘work products’ — highly compressed partial results that are refined at the click of a button to achieve subsecond report delivery, analytics overlay, and drilldown performance.

http://www.platfora.com

100 S Ellsworth Ave, Suite 400
San Mateo, CA, 94401
650-918-1100

Reply
06/27/2012 20:45

Business Intelligence and Analytic Applications Market Share, Worldwide, 2008-2010

http://www.rosebt.com/1/post/2012/01/business-intelligence-and-analytic-applications-market-share-worldwide-2008-2010.html

Reply
06/27/2012 21:03

DataSift is the world's most powerful and scalable platform for managing large volumes of information from a variety of social data sources.

Social Data is more complicated to process and analyze because it is unstructured. DataSift's platform has been built specifically to process large volumes of this unstructured data and derive value from it.

DataSift lets you access the full Twitter firehose in a cost-effective way. Pick between pay-as-you-go and subscription models, so you pay just for the Tweets that you use.

DataSift was architected to handle large volumes of real-time data and immense numbers of sophisticated queries. This infrastructure supports billions of interactions and queries with regular response times of less than 200 ms.

http://datasift.com/

Reply

NuoDB is the first database that is 100% SQL compliant, guarantees ACID transactions, and scales out elastically on cloud-based computing resources.

NuoDB is a NewSQL cloud database. It looks and behaves like a traditional SQL database from the outside but under the covers it's a revolutionary database solution. It is a new class of database for a new class of datacenter.

Created by an experienced team of software experts with the support of a visionary investment team, NuoDB is redefining the world of transactional databases.

Announcing Beta 7!

NuoDB is pleased to announce today the much anticipated release of lucky Beta 7. Major areas of development include:

Performance and scalability-- Beta 7 now supports up to 50 nodes enabling the database to elastically scale to tens of thousands of transactions per second on Amazon EC2 or local commodity servers.

Product hardening-- Beta 7 provides additional hardening and increased stability.

SQL -- Beta 7 now supports SQL 92 with 99 extensions as well as incorporating several querying and indexing enhancements and enhances SQL Standards compliance to cover a majority of application requirements.

NuoDB Administrative Console--Functional enhancements to graphical console, which was introduced previously in Beta 6.
New language drivers-- Incorporates new Ruby, PHP/PDO, and Perl drivers to address growing Web development needs. These drivers were developed by the NuoDB Github community.

Try it. Download @ https://www.nuodb.com/download.php

http://www.nuodb.com

info@nuodb.com

NuoDB, Inc.
215 First Street
Suite 005
Cambridge, MA 02142
P: +1 (617) 500-0001

Reply
06/27/2012 22:25

MongoDB is a scalable, high-performance, open source NoSQL database system. Written in C++

Instead of storing data in tables as is done in a "classical" relational database, MongoDB stores structured data as JSON-like documents with dynamic schemas (MongoDB calls the format BSON), making the integration of data in certain types of applications easier and faster.

http://www.mongodb.org

Reply
06/28/2012 07:23

10gen develops MongoDB, and offers production support, training, and consulting for the open source database.

http://www.10gen.com

US: (866) 237-8815
International: (650) 440-4474

Reply
06/28/2012 07:59

Hadapt offers a Hadoop-based adaptive analytical platform for performing complex analytics on structured and unstructured data.

The Hadapt Adaptive Analytic Platform is the first big data platform to combine the benefits of Apache Hadoop and relational DBMS technology into a single system for applications that rely on multi-structured data analytics.

Hadapt was designed for the cloud, and is optimized for virtualized environments. In addition to providing the full power of MapReduce, Hadapt offers enhanced SQL support and the ability to work with all of your data within one platform.

http://www.hadapt.com

614 Massachusetts Ave.
4th Floor
Cambridge, MA 02139
(617) 539-6110
info@hadapt.com

Reply
06/28/2012 08:23

We connect business people with the data they need to make better decisions.

Traditional BI solutions are monolithic; they don’t evolve at the speed of business. GoodData makes BI an on-demand service. You can adopt it quickly and evolve it simply without the usual broken promises and soul-crushing failures you’ve experienced with BI. Start small, grow with your success and become more metrics-driven with each new dashboard.

For business users, that means operational dashboards that put key metrics and reports front-and-center in your daily business life.

For the technical people, it’s an entirely new way to deliver BI for your company.

http://www.gooddata.com

(415) 200-0186
info@gooddata.com

Reply
06/28/2012 08:36

Hadoop has both good and bad points. It is one tool among many and is not the right tool for every organization.

Hadoop has a high latency (long delay), which makes it bad for traditional business intelligence functions that require a low latency (small delay). It also comes with a steep learning curve and a workload that can significantly vary.

Hadoop does some things really well: analytical capabilities, support for ad-hoc queries on conventional databases, high degree of processing complexity, support for high volumes of unstructured data.

A better way to view Hadoop is to add and integrate Hadoop to the traditional data warehouse / BI systems, embracing it as a companion technology.

Reply
06/28/2012 08:42

More than 100x faster than traditional databases, database technology processes live, fast data instantaneously with full ACID compliance

Starcounter's VMDBMS technology is designed to improve database performance. The VMDBMS is more than 100 times faster than traditional databases and 10 times faster than high performance databases, and integrates the Virtual Machine and the Database Management System.

As data is physically stored only in one place, the VMDBMS eliminates costly data transfers between application and database as well as transformation between different data formats. This is possible as the application can access data in the database just as fast as its internal temporary data.

http://www.starcounter.com

Tel. +46 8 410 282 10
info@starcounter.com

Reply
06/28/2012 08:56

Hybrid RDBMS Systems Now Coined NewSQL Databases

When you look at the evolution of NoSQL it has primary evolved to solve specific problems that were difficult, not impossible, to solve using a traditional RDBMS. In other words, NoSQL became perhaps a "better" or "different" way to solve a problem. A lot of NoSQL solutions achieve their scalability by choosing to make compromises on things like consistency and transactions.

But what if you didn't want to compromise? What if you wanted scalability and things like ACID compliant transactions? Well a new crop of database startups started to emerge. Solutions like VoltDB, Clustrix and ScaleDB. Essentially they were a hybrid database offering a traditional RDBMS that could scale. Well now that group of databases have a new name, NewSQL!

The 451Group published a blog post in which they described what the term meant along with the players they felt fit within the group. Here's what the had to say about the term:

“NewSQL” is our shorthand for the various new scalable/high performance SQL database vendors. We have previously referred to these products as ‘ScalableSQL’ to differentiate them from the incumbent relational database products. Since this implies horizontal scalability, which is not necessarily a feature of all the products, we adopted the term ‘NewSQL’ in the new report.

The reaction so far by many of the companies included in the group has been extremely positive. In the end it's just a name, a way to categorize a group of similar solutions. It does bring a sense of legitimacy to the product group, as well as, a name to rally around as these solutions grow. And so the NewSQL era begins!

Reply
06/28/2012 10:03

OrientDB is an Open Source NoSQL DBMS with both the features of Document and Graph DBMSs. It's written in Java and it's amazing fast: can store up to 150,000 records per second on common hardware.

It supports schema-less, schema-full and schema-mixed modes. It has a strong security profiling system based on users and roles and supports SQL as a query language. OrientDB uses a new indexing algorithm called MVRB-Tree, derived from the Red-Black Tree and from the B+Tree; this has benefits of having both fast insertions and ultra fast lookups.

http://orientdb.org

Reply
06/28/2012 10:24

Luca Garuilli provides a slide presentation about using OrientDB as the primary data store for a web application.

Couple of interesting facts from the presentation:

100% compliant with TinkerPop's Blueprints
Multiple run modes i.e. embedded, in-memory, client/server
Supports transactions
Supports SQL queries

http://www.slideshare.net/lvca/orientdb-nosqlday

Reply
06/28/2012 10:45

Citrusleaf is a NoSQL database that uniquely delivers reliability, linear scalability and exceptional performance for high volume, data intensive, web-scale and mobile businesses.

Citrusleaf provides the following:

Non-Stop Transactions
ACID (Atomic, Consistent, Isolated, Durable)
Immediate consistency
Automated clustering / Shared nothing
Flexible data storage supporting both Flash/SSD and rotational disk
Balanced Read/Writes with high throughput (200K+ TPS per node) and low latency (sub millisecond)
Support for billions of objects and terabytes of data

http://citrusleaf.net

+1 650-336-5323 (336-LEAF)
info@citrusleaf.com

Citrusleaf, Inc.
2525 Charleston Road
Mountain View, CA 94043

Cross Datacenter Replication Datasheet

http://citrusleaf.com/_docs/Citrusleaf_XDR_Datasheet.pdf

Reply
06/28/2012 11:00

ScaleDB is a pluggable storage engine for MySQL. It turns your MySQL application into an enterprise-class, highly-available, clustered database that scales elastically in a public cloud, private cloud, or on premise. ScaleDB is a NewSQL pioneer, delivering the advantages of both SQL and NoSQL, while targeting the cloud.

Benefits

Elasticity & Scalability

Cloud Elasticity: add/remove database nodes on the fly

Database Virtualization

Runs on any cloud, SaaS infrastructure, or on premise

Leading NewSQL database solution

High-Availability

Automatic recovery of failed nodes

No interruption to the application

Ease of Use

Scale your application without partitioning/sharding

No slaves, replication or slave promotion

Setup time: about 10 minutes

100% compatible with MySQL applications & tools

Data is consistent on a real-time basis

Storage/Caching tier provides HA using PC-based storage

Dramatically simplifies set-up, maintenance & tuning, resulting in lower TCO

http://www.scaledb.com

3723 Haven Ave, Suite 114
Menlo Park, CA 94025
Phone: (650) 587-8787

Reply
06/28/2012 11:13

We are now getting disruption in the database market.

The two primary DBMS architectures are shared-nothing and shared-disk. Shared-nothing databases split or partition the data so that each database server exclusively processes and maintains its own piece of the database. Shared-disk is analogous to a single large trough of data, where any number of database nodes can process any portion of that data. Based upon traditional computing constraints, the shared-nothing architecture has been the price-performance leader.

That is now changing. The shared-nothing DBMS architecture gained widespread adoption on the basis of performance and cost advantages that no longer exist.

Shared storage, with the help of extremely fast interconnects, now
delivers data to the CPU several times faster than a local disk.

Cloud computing economics (leveraging the power of multi-tenancy) delivers extremely fast shared storage at a dramatically reduced cost.

Virtualization then compounds these advantages by enabling users to scale elastically and to pay only for the resources they use. The cost/performance advantages have decisively shifted in favor of the
shared-disk DBMS.

It was just a matter of time before the shared-disk DBMS established dominance in the cloud.

We are now seeing some significant disruption in the database landscape for enterprises and private clouds as well.

About time!

Reply
06/28/2012 11:32

Clustrix is the leader in NewSQL databases for transactional big-data applications.

The Sierra Clustered Database Engine, the technology at the heart of the Clustrix solution, is a shared-nothing environment that includes the Sierra Parallel Planner and the Sierra Distributed Execution Engine.

Sierra provides an entirely new architectural approach to query resolution. It moves the query to the data, not the data to the query.

This revolutionary database technology makes it possible to scale a single database across nodes, and still support massive concurrency and deliver high performance, full relational functionality, transactional consistency (ACID), and seamless deployment.

Expect optimal and predictable performance from Clustrix’s unique, performant and highly stable architecture, designed for the scale challenges of big-data applications. The shared-nothing, low-latency compute platform—built on SSD and industry standard building blocks—has automatic data redistribution that maintains performance as the database changes and grows.

Clustrix is a 1U server housing 7 SSD Drives, 1 Spinning Disk, Dual Processors, 48GB of ram, and an infiniband connection for communications between servers. The minimum configuration includes 3 1U servers and provides 1 server worth of redundancy so that a failed server does not bring down the database.

The system is a drop-in replacement for MySQL designed to overcome MySQL scalability issues with a minimum of disruption to an enterprise's production activities.

http://www.clustrix.com

Clustrix, Inc.
201 Mission St., Ste 800
San Francisco, CA 94105
+1 415-501-9560
info@clustrix.com

Reply

Karmasphere is a Big Data Intelligence Software company bringing Apache Hadoop power to the desktop.

The first step in analyzing Big Data is to connect to your Hadoop cluster and distribution – not an easy task if you are an analyst business user and all you have is a Linux prompt. Karmasphere Analyst lets you connect to any data on any Hadoop cluster, in a private data center, or private or public cloud, even behind firewalls from your Windows, MacOS or Linux desktop. We make it look as if it you were working on your local file system, all from a unified graphical interface with wizards and point-and click features.

The second step in analyzing Big Data is to assemble a variety of files and formats containing large volumes of data so that they can be explored for insight. The goal is to work on all the raw data and all its detail so that discoveries are not limited to aggregations and predefined structures and functions as in the traditional business intelligence world. Karmasphere Analyst guides the user with wizards to gather, organize and prepare any kind of raw data on-the-fly and then create Hive tables in preparation for analytics. This bypasses the lengthy aggregation and modeling steps usually taken to prepare business intelligence data warehouses.

The third A for Big Data Analytics is the analyze phase. This is when the analyst role changes to data miner and discoverer, navigating through the raw data to find the patterns, trends and insights that can transform the business. With Karmasphere Analyst, the data professional performs iterative ad hoc queries with the SQL-like Hive query language and interactive formatting, filtering, sorting and charting wizards to visualize the results in different ways. Over 150 predefined functions can be used in queries or you can add your own analytic functions. Once insight is discovered, the path the analyst took to that learning is retained so that it can be repeated and operationalized.

In order for the insights attained from Big Data to transform your business they must find their way into the hands of other individuals or integrated into business processes and applications. So the final phase for the Big Data analyst is to make sure insights incite action within the business. Karmasphere Analyst provides facilities for the business to “Act” on the Big Data insights created by the data professional. Results can be published and shared with other users and applications, and intellectual capital can be retained, shared, and operationalized. For example Karmasphere Analyst can send insights to enterprise data warehouses, Excel, Tableau, or other reporting and Business Intelligence tools.

https://karmasphere.com

19200 Stevens Creek Blvd., Suite 130
Cupertino, CA 95014
650.292.6100

Reply

https://karmasphere.com/karmasphere-features

Karmasphere 2.0 changes the Big Data landscape ushering in the next generation of actionable, self-service, Big Data Insights with Collaborative, Social and Unconstrained Analytics. It's designed for the data-driven business and will be available July, 2012.

Reply
06/28/2012 12:32

Get started using Hadoop today with a SGI Hadoop Starter Kit.

SGI delivers all you need in one integrated package:

Pre-configured Hadoop clusters optimized for performance and capacity SGI Rackable servers, storage, and networking delivered racked, cabled and ready for production

Dedicated applications server

Pre-installed Hadoop software platform - Red Hat Enterprise Linux, Cloudera Distribution including Apache Hadoop (CDH) and SGI
Management Center

Pre-installed BI out of the box - full editions of Datameer, Kitenga, Pentaho and Quantum4D

Built with the latest Intel® Xeon® processors

Available in 2 complete, high-density, performance-optimized options:

176 TB in 20 servers (half rack)
336 TB full rack

Multi-rack systems are also available - tell us your needed data capacity and SGI will provide additional options.

http://www.sgi.com/products/hadoop/starter_kit.html?gclid=CIWqiZTQ8bACFQ8CQAodDXjtug

46600 Landing Parkway
Fremont, CA 94538
510-933-8300

Reply
06/28/2012 12:46

VoltDB is an in-memory, fast NewSQL database system. It is specifically designed to run on modern scale-out architectures - fast, inexpensive servers connected via high-speed data networks.

VoltDB is aimed at a new generation of database applications – real-time feeds, sensor-driven data streams, micro-transactions, low-latency trading systems – requiring database throughput that can reach millions of operations per second. What’s more, the applications that use this data must scale on demand, provide flawless fault tolerance and enable real-time visibility into the data that drives business value.

VoltDB is more than an ultra-fast database. All product distributions come complete with developer productivity tools, sample apps and reference implementations to get your project off the ground quickly. And pre-packaged images for Amazon EC2 and VMware allow to try VoltDB without making significant infrastructure investments.

http://voltdb.com

6 Fortune Drive
Billerica, MA 01821 USA
978.528.4660
info@voltdb.com

Reply
06/28/2012 13:07

The Hortonworks Data Platform is an open source data management software powered by Hadoop. Together with the Apache community, we are making Hadoop more robust, extensible and easier to use for enterprises and solution providers.

We believe that Apache Hadoop will process half of the world’s data within the next five years. To make this happen, we are addressing the technical and knowledge gaps that exist today. We have dedicated significant engineering resources to make Apache Hadoop more robust and easier to integrate, extend, deploy and use. This has resulted in Hortonworks Data Platform, a completely open source and tightly integrated and tested distribution of Apache Hadoop.

http://hortonworks.com

455 W. Maude Avenue
Suite 200
Sunnyvale, CA 94085
(408) 916-4121

Reply
06/28/2012 13:14

MemSQL places data into memory and translates SQL into C++ for the utmost optimization in query execution. This enables MemSQL to write and read data at incredible speeds, and by offering a relational interface, you can unify the data you’d normally store in a short-lived medium—cache or key-value store—and place it directly into a database along with your existing data.

Get rid of custom code and caches that slow down your engineering teams.

MemSQL is the fastest way to ingest large volumes of data while simultaneously analyzing that data in real time. With the power to arrive at answers and actionable insights in seconds, you can adapt and adjust your business processes in concert with fluctuating conditions.

http://memsql.com

380 10th St, Ste 25
San Francisco, CA 94103
info@memsql.com

Reply

Datameer specializes in analysis of large volumes of data for business users of Hadoop. Datameer is a single application for data analytics.

Datameer is the only analytics application that scales with your needs. Empower users to work independently, in a group, or across the company. Develop on your laptop, test with your work group, deploy to your company and scale with your needs: on your laptop, server or cluster.

Datameer Analytics Solution (DAS), is a BI platform for Hadoop and includes data source integration, an analytics engine with a spreadsheet interface designed for business users with over 180 analytic functions and visualization including reports, charts and dashboards. DAS is available for all major Hadoop distributions including Apache, Cloudera, EMC GreenPlum HD, IBM BigInsights, MapR, Yahoo!, and Amazon.

http://www.datameer.com

2040 Pioneer Court
San Mateo, CA 94403 USA
650.286.9100

Reply

SpaceCurve geospatial-temporal database and graph analysis tools enable application developers and organizations to leverage the real-time models required for more powerful geospatial and other classes of applications and to extend existing applications.

SpaceCurve stores, manipulates and analyzes large quantities of geospatial, temporal, sensor network and social graph data in real time to empower a new class of applications, and extend existing applications in new ways.

SpaceCurve will deliver instantaneous intelligence for location-based services, commodities, defense, emergency services and other markets.

The company is developing cloud-based Big Data solutions that continuously store and immediately analyze massive amounts of multidimensional geospatial, temporal, sensor network and social graph data.

http://spacecurve.com
info@spacecurve.com

101 Yesler Way
Suite 507
Seattle, WA 98104

Reply
01/07/2013 22:22

First of all i would like to thank you for the great and informative entry. I have to admit that I have never heard about this information I have noticed many new facts for me. Thanks a lot for sharing this useful and attractive information and I will be waiting for other interesting posts from you in the nearest future. Keep it up.


Reply

Bloggers like you are very few on World Wide Web and I am happy to found you. It’s like finding a pearl in the sea, tough but fruitful. Best wishes and regards.

Reply



Leave a Reply