Our open source project information is based on the Black Duck KnowledgeBase. Black Duck Software continuously spiders the Internet collecting information on open source code and binary files into a vast and growing KnowledgeBase. The Black Duck KnowledgeBase is the industry's most comprehensive database of open source software and associated license and other information. It covers over 200,000 projects collected from over 4,300 code forges and repositories. Since our founding in 2002, Black Duck has collected tens of billions of lines of code. We don't delete old files/versions and therefore have amassed a significant amount of information on code that is no longer available anywhere else.
This page offers public information about the language use of open source projects. Please keep checking back for regular updates. The data on this page is refreshed regularly.
An open source cloud platform for creating a private cloud infrastructure inside an organization’s firewall. It supports different hypervisors and can deploy instances in public clouds.
A data warehouse system for Terabyte and Petabyte scale analytics, built on top of Map-Reduce architecture. The current code has been developed to Hadoop’s map-reduce implementation.
An API that abstracts the differences between clouds; protects your apps from cloud API changes and incompatibilities, so you can concentrate on managing cloud instances the way you want.
An Infrastructure as a Service (IaaS) software framework for private cloud computing that provides users with the ability to run and control isolated collections of virtual machine instances.
An open source web desktop following the cloud computing concept that leverages collaboration and communication among users. It is mainly written in PHP, XML, and JavaScript. It acts as a platform for web applications written using the eyeOS Toolkit. It includes a Desktop environment with 67 applications and system utilities. Accessible by portable devices via a mobile front end.
An open source mobile cloud sync project that provides address book, calendar (PIM), email synchronization using the SyncML (OMA) protocol. It offers a sync server and clients for most mobile platforms.
A software framework that supports data-intensive distributed applications, enabling applications to work with thousands of nodes and petabytes of data.
An open source framework that helps you get started in the cloud and reuse your java and clojure development skills. The API gives you the freedom to use portable abstractions or cloud-specific features. Supports many clouds including Amazon, VMWare, Azure, and Rackspace.
An open-source, enterprise-oriented cloud management tool developed at NYTimes.com. The Nimbul UI helps you manage and monitor cloud provider accounts, server clusters and instances, and user accounts. Nimbul also offers a publisher framework and command-line tools.
A toolkit for wrapping scientific applications as Web services on Grid and cloud resources without writing a single line of code. Users can access scientific codes using simple Web service APIs from a multitude of languages and platforms.
An open source fork of the Enomaly Elastic Computing Platform (ECP) following its commercialisation. It is a web-based management platform for hypervisors which can be used to create "public" and "private" cloud computing environments.
OpenI is an open source Business Intelligence application for on-demand or SaaS deployments. Based on J2EE, OpenI is an out-of-box solution to easily visualize data from OLAP and relational databases.
OpenNebula is a Virtual Infrastructure Manager that orchestrates storage, network and virtualization technologies to enable the dynamic placement of multi-tier services (groups of interconnected virtual machines) on distributed infrastructures, combining both data center resources and remote cloud resources, according to allocation policies
The ubiquitous Open Source Cloud Computing platform that will meet the needs of public and private clouds regardless of size, by being simple to implement and massively scalable.
A redundant, self-curing and auto-scaling hosting environment built on Amazon EC2. It allows you to create server farms through a web-based interface using prebuilt Amazon machine images (AMI) for load balancers (pound or nginx), app servers (Apache, others), databases (MySql master-slave, others), and a generic AMI to build on.
Provides a way to rapidly clone virtual machines for users in cloud-computing environments. Based on Python scripts and a C library using an older version of Citrix's Xen hypervisor.
Language Breakdown in Open Source
We’ve taken the KnowledgeBase of open source code and broken it down by language using source lines of code. The first table shows the breakdown by language for the last release of all open source projects in the Black Duck KnowledgeBase. This code spans many years including projects that were written in the early days of the Internet. Most releases are older than 12 months. The second table shows a breakdown by language for only those projects that produced a release in the most recent twelve month period. A comparison of these two tables indicates the trend and provides insight into how the most recent choices of developers compares with longer term history. The difference between these two tables, i.e. the gain/loss in share, is also shown. See below for more information about the methodology used to produce this data.
All open source project releases
Rank
Language
%
1
C
40.31
2
C++
14.49
3
Java
10.58
4
Shell
8.02
5
Javascript
5.92
6
PHP
4.97
7
Perl
2.90
8
Python
2.82
9
SQL
1.54
10
Assembler
1.31
11
C#
1.22
12
Ruby
1.01
13
Pascal
0.76
14
Ada
0.54
15
XML Schema
0.50
Releases within the last 12 months
Rank
Language
%
1
C
36.36
2
C++
17.28
3
Java
9.65
4
Javascript
7.48
5
Shell
5.59
6
PHP
5.23
7
Python
2.83
8
Perl
2.29
9
SQL
1.70
10
Ruby
1.59
11
Assembler
1.56
12
Ada
1.20
13
XML Schema
1.15
14
C#
1.12
15
D
0.81
Methodology
To calculate the percent share represented by each language of open source, Black Duck analyzed the information on all the open source projects that we have collected in our KnowledgeBase. To avoid double counting, we only use information on the latest code release from each project. When our spidering process archives information from an open source project, we record the release date of that code. In this analysis, we used only the latest release from each open source project - to avoid double counting. We show two columns of data. One includes the latest releases of all projects. The second shows only those releases that have a date stamp within the last twelve months. About 16% of all projects have released code in the last 12 months. The release date of the code (not the date of collection) was used to determine whether a release took place within the last 12-months.
We run an analyzer that identifies the language of each project file and counts the number of lines in the file. White space and comments are removed from the line count. The lines-of-code counter does not attempt to compensate for the fact that some languages are more verbose than others. We did not include html, css, documentation and most other non-executing file types in the analysis. Binary files are also not included in this calculation. A previous analysis indicated that on average there are four binary files for every source file in open source projects. Projects with no source code, i.e. empty projects or binaries only, were not included in this analysis.
Encryption in Open Source
Our analysis of projects in the Black Duck KnowledgeBase indicates that over 14,000 open source projects contain encryption algorithms.
Top 10 Encryption Algorithms Used in Open Source Software
Algorithm
%
Type
Encryption Only
RSA
13%
Asymmetric
DSA
9%
Signature
*
DES
9%
Symmetric
MD5
8%
Hash
*
SHA
8%
Hash
*
Blowfish
6%
Symmetric
Diffie-Hellman
6%
Keyman
HMAC
5%
Mac
*
ElGamal
5%
Asymmetric
AES
5%
Symmetric
Sub Total
74%
Other
26%
Total
100%
Top Searches by Language from Black Duck Koders.com
Kodes.com is Black Duck's popular free code search website. We collect the top searches and language reports are generated each month from the previous month's search data. Three different categories of data are provided for each language.
Top Searches
Top Searches are the most popular search words and phrases for each programming language.
Top Solutions
The Solution Map is a heatmap that displays search terms directly linked to the most downloaded files associated with those terms. Darker boxes indicate a higher correlation of searches to particular files. In other words, when multiple developers, searching for the same thing, identify the same file as useful, that Solution Map entry's score goes up.
Top Projects
Projects are ranked by the number of code downloads developers have made from each project.
Choose a Language
Choose from the chart below to view the Black Duck Koders.com language report for a particular language.