Download

You will find on this page a list of OpenRefine distributions and extensions available for download. Are we missing something? Want to fix a typo? You can submit changes (pull request) from here.

Official Distribution

Read the installation instructions.

You can also download all official releases and source from our GitHub releases page

OpenRefine 3.4.1

The latest stable release of OpenRefine 3.4.1, released on September 24, 2020. Please backup your workspace directory before installing and report any problems that you encounter. A change log is provided on the release page.

OpenRefine 3.5 beta1

The first beta release of OpenRefine 3.5, released on May 29, 2021. Please backup your workspace directory before installing and report any problems that you encounter. A change log is provided on the release page.

Other Distributions

The following distributions have been customized for a specific usage or integration with other technologies.

Distribution Description Authors
LODRefine LODRefine is actually OpenRefine with integrated extensions that make transition from tabular data to Linked Data a bit easier. Integrated extensions are: RDF extension, DBpedia extension, CrowdFlower crowdsourcing extension, Stats extension Sparkica
OpenDataRise Tool to cleanse and semantify datasets from CKAN repositories. Based on OpenRefine. Open Data in Trentino
p3-batchrefine BatchRefine adds batch processing capabilities to OpenRefine and support multiple back end including spark SpazioDati
RefineOnSpark RefineOnSpark is a driver program to run OpenRefine jobs on the Spark cluster SpazioDati
Reconciliation-and-Matching-Framework A framework to allow the matching of string entities using customised sets of transformations and matchers, plus a tool to produce the necessary configurations and another to expose them as OpenRefine reconciliation services. RBGKew
OntoRefine GraphDB OntoRefine is a data transformation tool, based on OpenRefine and integrated in the GraphDB Workbench. It can be used for converting tabular data into RDF and importing it into a GraphDB repository, using simple SPARQL queries and a virtual endpoint. Ontotext

List of Extensions

You can install the following extensions to add functionalities to OpenRefine. We do not vouch for the function of any of these extensions. Some only work with earlier version of OpenRefine, or with third-party distributions. See how to install an extension.

Extension Name Description Authors Latest compatible version
FAIR metadata Supports FAIR metadata by integrating with FAIR Data Point to store your data and export to FAIR. It is a successor of the legacy FAIRifier OpenRefine 2.6 fork. Manual. FAIRDataTeam 3.4.1
VIB-Bits Adds history tools, pivot tool, cross() graphical user interface (GUI), and scatterplot tool. Manual PDF. VIB-Bits 3.4.1
Named-Entity Recognition Extracts identifiable or reconcilable entities from text fields. It supports AlchemyAPI, DBpedia Lookup, and Zemanta. Manual. Ruben Verborgh and Stuart Kenny 3.4.1
String-Transformers Adds a collection of Java string transformers to Jython functions. Some generic, some aimed at botany and taxonomy. Download this and this and follow their installation instructions. RBGKew 3.4.1
RDF extension RDF Refine - an improved fork for exporting RDF. Adds a graphical user interface (GUI) for exporting data in RDF format. stkenny & DRI 3.4.1
GOKb Utilities Adds three new features: prepend rows (add new blank rows); trim all data (remove preceding/trailing whitespace from across all cells); extractHost() (GREL function that extracts a host name from a URL). GOKb & Owen Stephens 3.1
Stats extension for Google Refine 2.5+ Computes elementary statistics on column data. sparkica 3.1
CKAN Storage Extension Exports data to CKAN Storage 2.2+ using CKAN storage API. Ontodia 3.1
GeoRefine Add geospatial processing GREL functions. Ryan Baumann 2.8
D2Refine A clinical study metadata harmonization and validation workbench to 1) convert restrictions (specified in a spreadsheet-like interface) into a clinical model; and 2) provide reconciliation services leveraging the Common Terminology Services 2 (CTS2). Manual. Deepak K. Sharma 2.7

Legacy Extensions

These extensions were produced in the past and are good examples of desired uses of OpenRefine. They are maintained here for historical reference, and in case a contributor would like to renew them.

Extension Name Description Authors Suggested compatible version
VIB-BITS Diff plugin Adds a “diff” tool which compares two text values. Manual PDF. VIB-Bits 3.4.1. The VIB website states “We are investigating how to provide [the diff plugin] again with version 3.4 of OpenRefine.”
VIB-BITS Save facets plugin Adds support for save facet settings and re-apply the same facets at a later time. Manual PDF. VIB-Bits Unknown. The VIB website states “We are investigating how to provide [the diff plugin] again with version 3.4 of OpenRefine.”
geoXtension geoXtension is based on GDAL, Proj, and GEOS libraries. Adding this extension takes a lot of time, usage of the ready-made docker is recommended. giTorto (circa 2.6)
geoXtension geoXtension is based on GDAL, Proj, and GEOS libraries. Adding this extension takes a lot of time, usage of the ready-made docker is recommended. giTorto (circa 2.6)
extraCTU-plugin To extract e-mails, telephone numbers, URLs and identification numbers from text. giTorto (circa 2.7)
Opentree Displays phylogenetic trees from the Open Tree of Life. Manual. nickynicolson 2.5
BioVeL Adds functions for cleaning & clustering biodiversity data. The entire package can be installed and run locally as well as on a dedicated server. Botanic Garden and Botanical Museum Berlin-Dahlem 2.5
DBpedia extension Adds columns from DBpedia to reconciled data. Data has to be reconciled with DBpedia or SPARQL endpoint, which returns DBpedia resource types. Zemanta 2.5
Stats extension for Google Refine 2.1 Computes elementary statistics on column data. The Chicago Tribune 2.1

Reconciliation Services

OpenRefine can connect to several reconciliation services. Please read our wiki for a list of them.

Client Libraries

You can automate some OpenRefine operations using one of the existing libraries. Those libraries are using the OpenRefine API.