Cloud Computing Resources Directory

  the Basics  
  for Buyers & Users  
  for Vendors  
  for Developers  
  by Industry  
  Analysis  
  Custom View  
  Research & Development  
  Applications  
  Platforms  
  Infrastructure  
  Security  
  Standards & Interoperability  
Cloud Platforms Research Projects
Cloud Computing Technology with BPOS and Windows Azure
Cloud Computing Technology with BPOS and Windows Azure
by Nidhi Bansal
We see cloud computing offerings today that are suitable to host enterprise architectures. But while these offerings provide clear benefits to corporations by providing capabilities complementary to what they have, the fact that they can help to elastically scale enterprise architectures should not be understood to also mean that simply scaling in this way will meet twenty-first-century computing requirements.
read the full story >>
Infrastructure Management and Monitoring in Hybrid Cloud Environment PDF
Komal Gohil - SSE at iGATE

In today’s scenario, enterprises are using diverse technology platforms for their application portfolios, and as part of their Cloud journey, they are adopting multiple Clouds. This paper provides an approach for unified management and monitoring of enterprise’s infrastructure, platforms, and applications deployed in a hybrid environment comprising of Public and Private Cloud platforms offered by different vendors.


The Pop-Up Cloud
Joey Widener - Sr. Product Evangelist at AT&T Hosting & Cloud Services

How the cloud can support your limited life application needs!


Applying ITIL to Cloud Operations
Komal Gohil - SSE at iGATE

Modern Enterprises entail many challenges like scalability, availability, agility, shorter time to market, increasing complexity, etc. Enterprises are adopting cloud to handle these challenges as well as derive business benefits by cost transformation and increased revenue. With cloud gaining momentum enterprise now have to manage applications in different environments namely on-premise and cloud. ITIL is a standard process that organizations follow in managing their data centers. They need to now see how ITIL can be applied to cloud operations without impacting the current on-premise set-up. This paper is our experience report on making ITIL services cloud aware. We have come up with a platform for managing hybrid environment. We have configured ITSM tool to add cloud elements as configuration items and implemented the complete life cycle for various manageable items. We close this paper with a note on related work like Continuous delivery, DevOps and Behavior Driven Operations and how this platform is a step towards achieving these goals and helping business become more agile.


Scaling Out PDF
Tinniam V Ganesh - Founder & Owner at INWARDi Technologies

This article describes some key issues in scaling on the cloud and the solutions to this problem.


PHP’s Forecast: Partly Cloudy
Wil Sinclair - Cloud Evangelist Consultant at Makara

You’ve probably heard about “the cloud”. It’s the latest buzzword circulating in Silicon Valley and beyond. In some ways, “cloud” is a fitting name - everyone sees something different when they look at it. For example, offerings as dissimilar as force.com and Amazon’s MapReduce service have worked their way in to “the cloud” moniker at some point. Even the cloud’s biggest advocates can’t agree on what “the cloud” is. But in teasing the facts out of all the hype, you’ll find some services that could change how you write and deploy your PHP applications forever.


NASA's Nebula Rolls Out in the Cloud
Chris C Kemp - Chief Technology Officer of IT at NASA

Agencies across the federal government are exploring cloud computing, but NASA's Nebula Cloud Computing Platform could become the model for its use. The Chief Information Officer at NASA Ames Research Center, explains the benefits of the Nebula platform.


Some Thoughts on Cloud Adoption Methodology
Jason Carolan - Director of Cloud Solution Development at VMware, Inc

Cloud data centers focus on two main areas: time to market and cost management. Technically it provides an abstract platform to deploy services and APIs to manipulate those services. The enabler is automation and the ability to systematically optimize the platforms and services. Granted the focus here is moving forward with a transformational platform but its useful to look backwards to address other challenges beyond technology.


Don't Pass on PaaS in 2010
Sam Charrington - VP, Product Management & Marketing at Appistry, Inc

There is no question that enterprises today are looking to the clouds to create efficiencies, gain competitive advantage and realize significant cost savings. Where the debate often emerges is around the cloud approach selected and employed - namely, Infrastructure as a Service (IaaS) or Platform as a Service (PaaS). While IaaS solutions are interesting, they fail to fully deliver on the scalability, elasticity and cost improvements enterprises seek to gain by moving applications to the cloud. This is due to the fact that the tru costs of delivering applications and services in a cloud environment relate not to the racking and stacking of infrastructure, but rather to the development, deployment and management of applications - and only PaaS solutions can address these costs. Here is a look at what exactly a platform-oriented cloud solution provides.


Application Packaging for Cloud Computing: A Proposal
James Urquhart - Manager, Cloud Computing & Virtualized Data Centers Marketing at Cisco Systems, Inc

Here is a vision of what software packaging architectures in cloud computing environments may look like. Specifically, Infrastructure-as-a-Service (IaaS) and Platform as a Service (PaaS) offerings, and the enabling infrastructure that will handle application deployment to these services in the future. How they may possibly evolve to make deployment and operations as easy as possible.


Overcast Show: Focus on PaaS
Geva Perry - Founder & Blogger at Thinking Out Cloud
James Urquhart - Manager, Cloud Computing & Virtualized Data Centers Marketing at Cisco Systems, Inc

James and Geva catch up on some of the latest developments in cloud computing, with a particular emphasis on Platform-as-a-Service. Some of the companies, products and technologies mentioned include: Amazon, Salesforce.com, Engine Yard, Heroku, Canonical, Eucalyptus, Chef, Sauce Labs and Microsoft Azure.


Optimizing Applications for Cloud Computing Environments
Jason Carolan - Director of Cloud Solution Development at VMware, Inc

Cloud Computing can help you reduce costs, increase flexibility, and reduce risk. You can leverage the cloud to host applications ranging from the business critical to the experimental. But not all applications are suited for cloud computing environments. When deciding whether and how to move an application to the cloud, you must first assess the expected risks and rewards. Once you've determined that an application can run in the cloud, you next should determine whether it can be further optimized to harness the energy of the cloud. This paper presents a methodology for determining when and how to refactor applications for cloud computing environments.


A Performance and Usability Comparison of Hadoop and Relational Database Systems PDF
Company Profile: Massachusetts Institute of Technology (MIT)

A Performance and Usability Comparison of Hadoop and Relational Database Systems


HadooDB An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads PDF
Company Profile: Massachusetts Institute of Technology (MIT)

HadooDB An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads


A Comparison of Approaches to Large-Scale Data Analysis PDF
Company Profile: Massachusetts Institute of Technology (MIT)

A Comparison of Approaches to Large-Scale Data Analysis


Large-Scale Data Cleaning Using Hadoop PDF
Company Profile: University of California, Irvine

Large-Scale Data Cleaning Using Hadoop


Relaxed Synchronization and Eager Scheduling in MapReduce PDF
Company Profile: Purdue University

Relaxed Synchronization and Eager Scheduling in MapReduse


The Purpose Driven Cloud Part 2 - Technology Framework
Geva Perry - Founder & Blogger at Thinking Out Cloud

Framework based cloud platforms are designed to be used with a specific programming language, product or technology stack. They enable developers who subscribe to that particular programming model to ficus on the innovative aspect of their application and let the platform handle the problems of scalability, reliability, and performance (and in some cases, integration).


ElasTraS: An Elastic Transactional Data Store in the Cloud PDF
Company Profile: University of California Santa Barbara

By Sudipto Das, Divyakant Agrawal, and Amr El Abbadi. Abstract: Over the last couple of years, “Cloud Computing” or “Elastic Computing” has emerged as a compelling and successful paradigm for internet scale computing. One of the major contributing factors to this success is the elasticity of resources. In spite of the elasticity provided by the infrastructure and the scalable design of the applications, the elephant (or the underlying database), which drives most of these web-based applications, is not very elastic and scalable, and hence limits scalability. In this paper, we propose ElasTraS which addresses this issue of scalability and elasticity of the data store in a cloud computing environment to leverage from the elastic nature of the underlying infrastructure, while providing scalable transactional data access. This paper aims at providing the design of a system in progress, highlighting the major design choices, analyzing the different guarantees provided by the system, and identifying several important challenges for the research community striving for computing in the cloud.


Towards Optimizing Hadoop Provisioning in the Cloud PDF
Company Profile: Purdue University

By Karthik Kambatla and Abhinav Pathak, Purdue University; Himabindu Pucha, IBM Research Almaden. Abstract: Data analytics is becoming increasingly prominent in a variety of application areas ranging from extracting business intelligence to processing data from scientific studies. MapReduce programming paradigm lends itself well to these data-intensive analytics jobs, given its ability to scale-out and leverage several machines to parallely process data. In this work we argue that such MapReduce-based analytics are particularly synergistic with the pay-as-you-go model of a cloud platform. However, a key challenge facing end-users in this environment is the ability to provision MapReduce applications to minimize the incurred cost, while obtaining the best performance. This paper firstmotivates the importance of optimally provisioning a MapReduce job, and demonstrates that existing approaches can result in far from optimal provisioning. We then present a preliminary approach that improves MapReduce provisioning by analyzing and comparing resource consumption of the application at hand with a database of similar resource consumption signatures of other applications.


In Search of an API for Scalable File Systems: Under the Table or Above It? PDF
Company Profile: Carnegie Mellon University

By Swapnil Patil, Garth A Gibson, Gregory R Ganger, Julio Lopez, Milo Polte, Wittawat Tantisiroj, and Lin Xiao. Abstract: “Big Data” is everywhere – both the IT industry and the scientific computing community are routinely handling terabytes to petabytes of data. This preponderance of data has fueled the development of data-intensive scalable computing (DISC) systems that manage, process and store massive data-sets in a distributed manner. For example, Google and Yahoo have built their respective Internet services stack to distribute processing (MapReduce and Hadoop), to program computation (Sawzall and Pig) and to store the structured output data (Bigtable and HBase). Both these stacks are layered on their respective distributed file systems, GoogleFS and Hadoop distributed FS, that are designed “from scratch” to deliver high performance primarily for their anticipated DISC workloads. However, cluster file systems have been used by the high performance computing (HPC) community at even larger scales for more than a decade. These cluster file systems, including IBM GPFS, Panasas PanFS, PVFS and Lustre, are required to meet the scalability demands of highly parallel I/O access patterns generated by scientific applications that execute simultaneously on tens to hundreds of thousands of nodes. Thus, given the importance of scalable storage to both the DISC and the HPC world, we take a step back and ask ourselves if we are at a point where we can distill the key commonalities of these scalable file systems. This is not a paper about engineering yet another “right” file system or database, but rather about how do we evolve the most dominant data storage API – the file system interface – to provide the right abstraction for both DISC and HPC applications. What structures should be added to the file system to enable highly scalable and highly concurrent storage? Our goal is not to define the API calls per se, but to identify the file system abstractions that should be exposed to programmers to make their applications more powerful and portable. This paper highlights two such abstractions. First, we show how commodity large-scale file systems can support distributed data processing enabled by the Hadoop/MapReduce style of parallel programming frameworks. And second, we argue for an abstraction that supports indexing and searching based on extensible attributes, by interpreting BigTable as a file system with a filtered directory scan interface.


Contributions Results for Developers: Cloud Platforms

Showing 1 - 20 of 33 Next > Last >>

Run Your Core Business in the Cloud: SuiteCloud
Zach Nelson - President & CEO at NetSuite Inc
Company Profile: NetSuite Inc

NetSuite CEO Zach Nelson provides an introduction to SuiteCloud.


Learn How to Connect Java to the Cloud with jclouds
Adrian Cole - Founder at JClouds

The founder of jclouds, Adrian Cole, explains how to connect Java apps to the cloud. His open-source framework makes it possible to write cloud-portable apps for Amazon, VMWare, Azure, and Rackspace.


An Introduction to Heroku
James Lindenbaum - Co-Founder at Heroku

Heroku Co-Founder, James Lindenbaum, introduces us to Heroku, and Heroku's focus as an application platform.


Cloud Abstractions
James Lindenbaum - Co-Founder at Heroku

James Lindenbaum, CEO and Co-Founder of Heroku, discusses various distractions at play in the Cloud Computing market, including platform abstraction.


Heroku Developer Example
James Lindenbaum - Co-Founder at Heroku

Heroku Co-Founder, James Lindenbaum, provides an example of how a developer can utilize Heroku's application platform.


Ecosystems are the New Killer Apps
Chris Keene - CEO at WaveMaker Software, Inc

The CEO of Wavemaker Software discusses Ecosystems as the New Killer Apps at SIIA OnDemand.


Owen O'Malley on the Future of MapReduce
Owen O'Malley - VP of Hadoop & Software Architect at Yahoo

Owen O'Malley, Hadoop architect at Yahoo, describes the current state of affairs for MapReduce jobs in Hadoop, and walks through a variety of approaches that are being implemented to ensure better forward and backward compatibility.


Microsoft on the Windows Azure Platform
Doug Hauger - GM, Cloud Infrastructure Services at Microsoft Corporation

Doug Hauger, GM for Windows Azure at Microsoft, shares details of the business and channel model and pricing for windows Azure, SQL Azure, and .NET Services at WPC09.


Talking with Joyent's Director of Platform Strategy
James Duncan - VP of Product Development at Joyent, Inc

Dell's Cloud Computing Evangelist, Barton George, talks to James Duncan, Director of Platform Strategy at Joyent.


Joyent's JavaScript Smart Platform as a Service at the JSConf
James Duncan - VP of Product Development at Joyent, Inc

Joyent's Smart Platform is an open source Platform-as-a-Service, letting developers build scalable, complex web applications in JavaScript that runs on the server, and then deploy those applications into the Cloud with Git. James outlines some of the economic imperatives for the rise of Platform-as-a-Service, discusses the need for not only open standards but open source and demonstrates some of the Smart Platform's capabilities.


Contributions Results for Developers: Cloud Platforms

Showing 1 - 10 of 19 Next > Last >>

Infrastructure Management and Monitoring in Hybrid Cloud Environment PDF
Komal Gohil - SSE at iGATE

In today’s scenario, enterprises are using diverse technology platforms for their application portfolios, and as part of their Cloud journey, they are adopting multiple Clouds. This paper provides an approach for unified management and monitoring of enterprise’s infrastructure, platforms, and applications deployed in a hybrid environment comprising of Public and Private Cloud platforms offered by different vendors.


Applying ITIL to Cloud Operations
Komal Gohil - SSE at iGATE

Modern Enterprises entail many challenges like scalability, availability, agility, shorter time to market, increasing complexity, etc. Enterprises are adopting cloud to handle these challenges as well as derive business benefits by cost transformation and increased revenue. With cloud gaining momentum enterprise now have to manage applications in different environments namely on-premise and cloud. ITIL is a standard process that organizations follow in managing their data centers. They need to now see how ITIL can be applied to cloud operations without impacting the current on-premise set-up. This paper is our experience report on making ITIL services cloud aware. We have come up with a platform for managing hybrid environment. We have configured ITSM tool to add cloud elements as configuration items and implemented the complete life cycle for various manageable items. We close this paper with a note on related work like Continuous delivery, DevOps and Behavior Driven Operations and how this platform is a step towards achieving these goals and helping business become more agile.


Optimizing Applications for Cloud Computing Environments
Jason Carolan - Director of Cloud Solution Development at VMware, Inc

Cloud Computing can help you reduce costs, increase flexibility, and reduce risk. You can leverage the cloud to host applications ranging from the business critical to the experimental. But not all applications are suited for cloud computing environments. When deciding whether and how to move an application to the cloud, you must first assess the expected risks and rewards. Once you've determined that an application can run in the cloud, you next should determine whether it can be further optimized to harness the energy of the cloud. This paper presents a methodology for determining when and how to refactor applications for cloud computing environments.


A Comparison of Approaches to Large-Scale Data Analysis PDF
Company Profile: Massachusetts Institute of Technology (MIT)

A Comparison of Approaches to Large-Scale Data Analysis


ElasTraS: An Elastic Transactional Data Store in the Cloud PDF
Company Profile: University of California Santa Barbara

By Sudipto Das, Divyakant Agrawal, and Amr El Abbadi. Abstract: Over the last couple of years, “Cloud Computing” or “Elastic Computing” has emerged as a compelling and successful paradigm for internet scale computing. One of the major contributing factors to this success is the elasticity of resources. In spite of the elasticity provided by the infrastructure and the scalable design of the applications, the elephant (or the underlying database), which drives most of these web-based applications, is not very elastic and scalable, and hence limits scalability. In this paper, we propose ElasTraS which addresses this issue of scalability and elasticity of the data store in a cloud computing environment to leverage from the elastic nature of the underlying infrastructure, while providing scalable transactional data access. This paper aims at providing the design of a system in progress, highlighting the major design choices, analyzing the different guarantees provided by the system, and identifying several important challenges for the research community striving for computing in the cloud.


Towards Optimizing Hadoop Provisioning in the Cloud PDF
Company Profile: Purdue University

By Karthik Kambatla and Abhinav Pathak, Purdue University; Himabindu Pucha, IBM Research Almaden. Abstract: Data analytics is becoming increasingly prominent in a variety of application areas ranging from extracting business intelligence to processing data from scientific studies. MapReduce programming paradigm lends itself well to these data-intensive analytics jobs, given its ability to scale-out and leverage several machines to parallely process data. In this work we argue that such MapReduce-based analytics are particularly synergistic with the pay-as-you-go model of a cloud platform. However, a key challenge facing end-users in this environment is the ability to provision MapReduce applications to minimize the incurred cost, while obtaining the best performance. This paper firstmotivates the importance of optimally provisioning a MapReduce job, and demonstrates that existing approaches can result in far from optimal provisioning. We then present a preliminary approach that improves MapReduce provisioning by analyzing and comparing resource consumption of the application at hand with a database of similar resource consumption signatures of other applications.


In Search of an API for Scalable File Systems: Under the Table or Above It? PDF
Company Profile: Carnegie Mellon University

By Swapnil Patil, Garth A Gibson, Gregory R Ganger, Julio Lopez, Milo Polte, Wittawat Tantisiroj, and Lin Xiao. Abstract: “Big Data” is everywhere – both the IT industry and the scientific computing community are routinely handling terabytes to petabytes of data. This preponderance of data has fueled the development of data-intensive scalable computing (DISC) systems that manage, process and store massive data-sets in a distributed manner. For example, Google and Yahoo have built their respective Internet services stack to distribute processing (MapReduce and Hadoop), to program computation (Sawzall and Pig) and to store the structured output data (Bigtable and HBase). Both these stacks are layered on their respective distributed file systems, GoogleFS and Hadoop distributed FS, that are designed “from scratch” to deliver high performance primarily for their anticipated DISC workloads. However, cluster file systems have been used by the high performance computing (HPC) community at even larger scales for more than a decade. These cluster file systems, including IBM GPFS, Panasas PanFS, PVFS and Lustre, are required to meet the scalability demands of highly parallel I/O access patterns generated by scientific applications that execute simultaneously on tens to hundreds of thousands of nodes. Thus, given the importance of scalable storage to both the DISC and the HPC world, we take a step back and ask ourselves if we are at a point where we can distill the key commonalities of these scalable file systems. This is not a paper about engineering yet another “right” file system or database, but rather about how do we evolve the most dominant data storage API – the file system interface – to provide the right abstraction for both DISC and HPC applications. What structures should be added to the file system to enable highly scalable and highly concurrent storage? Our goal is not to define the API calls per se, but to identify the file system abstractions that should be exposed to programmers to make their applications more powerful and portable. This paper highlights two such abstractions. First, we show how commodity large-scale file systems can support distributed data processing enabled by the Hadoop/MapReduce style of parallel programming frameworks. And second, we argue for an abstraction that supports indexing and searching based on extensible attributes, by interpreting BigTable as a file system with a filtered directory scan interface.


Mochi: Visual Log-Analysis Based Tools for Debugging Hadoop PDF
Company Profile: Carnegie Mellon University

By Jiaqi Tan, Xinghao Pan, Soila Kavulya, Rajeev Gandhi, and Priya Narasimhan. Abstract: Mochi, a new visual, log-analysis based debugging tool correlates Hadoop’s behavior in space, time and volume, and extracts a causal, unified control- and data- flow model of Hadoop across the nodes of a cluster. Mochi’s analysis produces visualizations of Hadoop’s behavior using which users can reason about and debug performance issues. We provide examples of Mochi’s value in revealing a Hadoop job’s structure, in optimizing real-world workloads, and in identifying anomalous Hadoop behavior, on the Yahoo! M45 Hadoop cluster.


Cloud Servers Developer Guide PDF
Erik Carlin - Sr Architect, The Rackspace Cloud at Rackspace Hosting, Inc

This document is intended for software developers interested in developing applications using the Cloud Servers Application Programming Interface (API).


Cloud Computing Infrastructure and Architecture PDF
Jason Carolan - Director of Cloud Solution Development at VMware, Inc

Cloud computing promises to speed application deployment, increase innovation, and lower costs, all while increasing business agility. But it also transforms the way we design, build, and deliver applications. What are the architectural considerations that enterprises must make when adopting cloud computing technology? This white paper discusses the nature of cloud computing and how it is transforming the way that enterprises everywhere build and deploy applications. It proceeds to discuss the architectural considerations that cloud architects must make when designing cloud-based applications, and concludes with a discussion of Sun's technologies that support cloud computing. Also Available in PDF Format


A Comparative Study of Approaches to Cluster-Based Large Scale Data Analysis
Company Profile: Massachusetts Institute of Technology (MIT)

This is a collaborative study being conducted by MIT, University of Wisconsin, and Yale University. These three universities are using a National Science Foundation CLuE grants for a comparative study of approaches to cluster-based, large-scale data analysis. Both MapReduce and parallel database systems provide scalable data processing over hundreds to thousands of nodes, yet it's important for researchers to know the differences in performance and scalability of these two approaches to know which is more suitable when designing new data-intensive computing applications. This project is engaged in systems research, much of which requires the ability to change the operating environment. Since this is not possible on the IBM/Google cluster, the project is also hosted on the Cloud Computi ....
A Hadoop Toolkit for Distributed Text Retrieval
Company Profile: University of Maryland College Park

Text search is a technology that is vital for modern information-based societies. Today's systems face the daunting challenge of handling quantities of text previously unimaginable. Cluster computing is the only practical solution for addressing the issue of scale. This project leverages the MapReduce framework (via the open-source Hadoop implementation) to tackle issues of robustness and scalability in processing large amounts of data for information retrieval applications.
Aneka
Company Profile: University of Melbourne

Aneka is a market oriented Cloud development and management platform with rapid application development and workload distribution capabilities. Aneka is an integrated middleware package which allows you to seamlessly build and manage an interconnected network in addition to accelerating development, deployment and management of distributed applications using Microsoft .NET frameworks on these networks. It is market oriented since it allows you to build, schedule, provision and monitor results using pricing, accounting, QoS/SLA services in private and/or public (leased) network environments.
AppScale
Company Profile: University of California Santa Barbara

AppScale is an open-source implementation of the Google AppEngine cloud computing interface from the RACELab at UC Santa Barbara. AppScale enables execution of GAE applications on virtualized cluster systems. In Particular, AppScale enables users to execute GAE applications using their own clusters with greater scalability and reliability than the GAE SDK provides. Moreover, AppScale executes automatically and transparently over cloud infrastructures such as the Amazon Web Services (AWS) Elastic Compute Cloud (EC2) and Eucalyptus, the open-source implementation of the AWS interfaces. ....
Eager Maps and Lazy Folds for Graph Structured Applications
Company Profile: Purdue University

This project is investigating linguistic extensions to MapReduce abstractions for programming modern, large-scale systems, with special focus on applications that manipulate large, unstructured graphs. This will impact a broad class of scientific applications.
The FLAMINGO Project on Data Cleaning
Company Profile: University of California, Irvine

Abstract: In many applications, data-quality issues resulting from a variety of errors create inconsistencies in structures, representations or semantics. Dealing with these issues is becoming increasingly important as the value of data being processed increases. This project is providing support for efficient fuzzy queries on large text repositories. Supporting fuzzy queries can ultimately help applications mitigate their data quality issues because entities with different representations can be matched. ....
Cloud computing underwhelms PHP developers
By: Paul Krill
"While technology vendors continue to pound home the message of cloud computing, PHP developers Tuesday viewed the concept as overhyped and were not in agreement on its benefits. Developers at the ZendCon 2010 PHP conference in Santa, Clara, Calif., heard Zend Technologies CEO Andi Gutmans tout the company's cloud computing plans, which involve developing Zend PHP Cloud Platform. During his presentation, however, developers appeared mostly underwhelmed when Gutmans asked if cloud computing was game-changing or just hype. Afterward, developers gave cloud computing mixed reviews."
read the full article >>