Open Source

By danberke

Puppet

 
 
 
Put simply, Puppet is a system for automating system administration tasks. To learn more, read our big picture overview of Puppet, or take a deeper look at what Puppet can do with the Puppet Introduction. There’s also an about Puppet page which gives the highlights of Puppet’s functionality.
 
 

Pig

 
 
 
We are creating infrastructure to support ad-hoc analysis of very large data sets. Parallel processing is the name of the game. Our system runs on a cluster computing architecture, on top of which sit several layers of abstraction that ultimately bring the power of parallel computing into the hands of ordinary users. The layers in between automatically translate user queries into efficient parallel evaluation plans, and orchestrate their execution on the raw cluster hardware.
 
 

Nimbus

 
 
 
The University of Chicago Science Cloud, codenamed “Nimbus”, provides compute capability in the form of Xen virtual machines (VMs) that are deployed on physical nodes of the University of Chicago TeraPort cluster (currently 16 nodes) using the workspace service.

Nimbus is available to all members of scientific community wanting to run in the cloud. To obtain access you will need to provide a justification (a few sentences explaining your science project) and a valid grid credential (If you don’t have a credential, email us. We can help). Based on the project, you will be given an allocation on the cloud.

 
 

Hadoop

 
 
     
 

Eucalyptus

 
 
 
EUCALYPTUS – Elastic Utility Computing Architecture for Linking Your Programs To Useful Systems – is an open-source software infrastructure for implementing “cloud computing” on clusters. The current interface to EUCALYPTUS is compatible with Amazon’s EC2 interface, but the infrastructure is designed to support multiple client-side interfaces. EUCALYPTUS is implemented using commonly-available Linux tools and basic Web-service technologies making it easy to install and maintain.
 
 

OpenNebula

 
 
     
 

Enomalism

 
 
 
Enomalism is an open source web-based virtual infrastructure platform. Designed to answer the complexity of managing globally disperse virtual server environments. Enomalism helps to automate the transition to a cloud computing environment by reducing an IT organizations overall workload. The easy to use dashboard can help with issues including deployment planning, load balancing, automatic VM migration, configuration management, capacity diagnosis and resource monitoring/metering.

Gridgain

 
 
 
GridGain is focused on doing one thing – providing the computational grid platform for Java.
 
 

Globus Toolkit

 
 
 
The open source Globus Toolkit is a fundamental enabling technology for the “Grid,” letting people share computing power, databases, and other tools securely online across corporate, institutional, and geographic boundaries without sacrificing local autonomy. The toolkit includes software services and libraries for resource monitoring, discovery, and management, plus security and file management.
 
 

Mosix

 
 
 
MOSIX is a management system that allows a Linux cluster or a Grid of clusters to perform like a single computer with multiple processors. It is particularly suitable to run intensive computing and applications with moderate amounts of I/O.
 
 

Jini

 
 
 
Jini.org is a central place and resource for the Jini CommunitySM. It is a site to discover new information, discuss, collaborate, exchange source code and ideas, and advance Jini™ network technology.
Jini network technology is an open software architecture that enables the creation of network-centric solutions which are highly adaptive to change.
 
 

SUN Grid Engine

 
 
 
The Grid Engine project is an open source community effort to facilitate the adoption of distributed computing solutions. Sponsored by Sun Microsystems and hosted by CollabNet, the Grid Engine project provides enabling distributed resource management software for wide ranging requirements from compute farms to grid computing.
 
 

Unicore

 
 
 
UNICORE (Uniform Interface to Computing Resources) offers a ready-to-run Grid system including client and server software. UNICORE makes distributed computing and data resources available in a seamless and secure way in intranets and the internet.
 
 

Open MPI

 
 
 
A High Performance Message Passing Library
Open MPI is a project combining technologies and resources from several other projects (FT-MPI, LA-MPI, LAM/MPI, and PACX-MPI) in order to build the best MPI library available. A completely new MPI-2 compliant implementation, Open MPI offers advantages for system and software vendors, application developers and computer science researchers.
 
 

OSCAR

 
 
 
OSCAR (Open Source Cluster Application Resources) is a snapshot of the best known methods for building, programming, and using HPC clusters. It consists of a fully integrated and easy to install software bundle designed for high performance cluster computing. Everything needed to install, build, maintain, and use a Linux cluster is included in the suite, making it unnecessary to download or even install any individual software packages on your cluster.
 
 

Xen

 
 
 
Modern computers are sufficiently powerful to use virtualization to present the illusion of many smaller virtual machines (VMs), each running a separate operating system instance. Successful partitioning of a machine to support the concurrent execution of multiple operating systems poses several challenges. Firstly, virtual machines must be isolated from one another: it is not acceptable for the execution of one to adversely affect the performance of another. This is particularly true when virtual machines are owned by mutually untrusting users. Secondly, it is necessary to support a variety of different operating systems to accommodate the heterogeneity of popular applications. Thirdly, the performance overhead introduced by virtualization should be small.
 
 

OGSA-DAI

 
 
 
The aim of the OGSA-DAI project is to develop middleware to assist with access and integration of data from separate sources via the grid. The project was conceived by the UK Database Task Force and is working closely with the Global Grid Forum DAIS-WG, the OMII and the Globus team.
 
 

OpenVZ

 
 
 
OpenVZ is an Operating System-level server virtualization solution, built on Linux. OpenVZ creates isolated, secure virtual private servers (VPSs) or virtual environments on a single physical server enabling better server utilization and ensuring that applications do not conflict. Each VPS performs and executes exactly like a stand-alone server; VPSs can be rebooted independently and have root access, users, IP addresses, memory, processes, files, applications, system libraries and configuration files.
 
 

openQRM

 
 
 
openQRM is designed to deal with all sorts of failures automatically, thus preventing interrupts because of unexpected events.
Implementing openQRM greatly improves the reliability of the x86 data-center.
openQRM is an open source systems management platform which integrates with existing components in enterprise data centers to create scalable, highly available and customizable infrastructures.
 
 

Gridsphere

 
 
 
The GridSphere portal framework provides an open-source portlet based Web portal. GridSphere enables developers to quickly develop and package third-party portlet web applications that can be run and administered within the GridSphere portlet container. Here you will find the GridSphere portal framework available for download and documentation related to the installation and development of portlets using GridSphere.
 
 

GAT – Grid Application Toolkit

 
 
 
The objective of this workpackage is to design and build a Grid Application Toolkit (GAT) and to plug-in to this GAT the services developed in other GridLab workpackages.
GAT is a set of coordinated, generic and flexible APIs for accessing Grid services from e.g. generic application codes, portals, data managements systems, together with working implementations provided by the tools developed in the Grid Lab project (See the figure below). GAT is designed in a modular plug-and-play manner, such that tools developed anywhere can be plugged into GAT.
 
 

Mandriva

 
 
 
Mandriva is a worldwide Linux and Open Source leader providing easy-to-use solutions to individuals and organizations.
 
 

Alchemi

 
 
 
Alchemi is an open source software framework that allows you to painlessly aggregate the computing power of networked machines into a virtual supercomputer (desktop grid) and to develop applications to run on the grid.

It has been designed with the primary goal of being easy to use without sacrificing power and flexibility.

Alchemi includes:
The runtime machinery (Windows executables) to construct computational grids.
A .NET API and tools to develop .NET grid applications and grid-enable legacy applications.

 
 

NGRID

 
 
 
NGrid is an open source (LGPL) grid computing framework written in C#. NGrid aims to be platform independent via the Mono project. NGrid aims to provide

a transparent multithread programming model for grid programming.
a physical grid framework & some grid implementations.
common utilities both for grid programming or grid implementations.

 
 

Crossbow – Network Virtualization and Resource Control

 
 
 
Crossbow provides the building blocks for network virtualization and resource control by virtualizing the stack and NIC around any service (HTTP, HTTPS, FTP, NFS, etc.), protocol or Virtual machine.

Each virtual stack can be assigned its own priority and bandwidth on a shared NIC without causing any performance degradation. The architecture dynamically manages priority and bandwidth resources, and can provide better defense against denial-of-service attacks directed at a particular service or virtual machine by isolating the impact just to that entity. The virtual stacks are separated by means of H/W classification engine such that traffic for one stack does not impact other virtual stacks.

Project Crossbow is next step in the evolution of Solaris networking stack and brings bandwidth resource control and virtualization as part of the architecture itself instead of the usual add-on layers which have heavy overheads and complexity.

 
 

ProActive

 
 
 
ProActive is a GRID middleware (a Java library with Open Source code under LGPL license) for parallel, distributed, and concurrent computing, also featuring mobility and security in a uniform framework. With a reduced set of simple primitives, ProActive provides a comprehensive API to simplify the programming of Grid Computing applications: distributed on Local Area Network (LAN), on clusters of workstations, or on Internet Grids. Portability, Openness, Agility: Write Once, Deploy Everywhere !
 
 

Solr

 
 
 
Solr is an open source enterprise search server based on the Lucene Java search library, with XML/HTTP and JSON APIs, hit highlighting, faceted search, caching, replication, and a web administration interface. It runs in a Java servlet container such as Tomcat.
 
 

Distributed Search for Solr

 
 
     
 

Hadoop

 
 
 
Hadoop is a software platform that lets one easily write and run applications that process vast amounts of data.
 
 

dCache

 
 
     
 

Linux Virtual Server

 
 
     
 

VirtualBox

 
 
 
innotek VirtualBox is a family of powerful x86 virtualization products for enterprise as well as home use. Not only is VirtualBox an extremely feature rich, high performance product for enterprise customers, it is also the only professional solution that is freely available as Open Source Software under the terms of the GNU General Public License (GPL). See “About VirtualBox” for an introduction; see “innotek” for more about our company.
 
 

Talend Open Studio – open source data integration solution

 
 
 
Talend Open Studio, the industry’s first pure open source data integration solution, combines metadata-driven design and execution, with an easy-to-use graphical development environment, to deliver better scalability at a lower total cost of ownership than traditional data integration or Extract, Transform and Load (ETL) solutions. Talend’s technology and business vision shatters the traditional proprietary model and provides the flexibility required to meet the needs of all organizations – regardless of their size, level of expertise or budgetary constraints. To download, please visit

Leave a Reply