Hadoop is an open-source software framework for storage and processing of datasets on clusters of commodity hardware