You will find on this page a list of OpenRefine distributions and extensions available for download. Are we missing something? Want to fix a typo? You can submit changes (pull request) from here.
Official Distribution
Read the installation instructions
You can also download all official releases and source from our GitHub releases page
OpenRefine 3.4.1
The latest stable release of OpenRefine 3.4.1, released on September 24, 2020. Please backup your workspace directory before installing and report any problems that you encounter. A change log is provided on the release page.
- Windows kit, This requires Java to be installed on your computer. Download, unzip, and double-click on openrefine.exe or refine.bat if the latter does not work.
- Windows kit with embedded Java, Download, unzip, and double-click on openrefine.exe or refine.bat if the latter does not work.
- Mac kit, Download, open, drag icon into the Applications folder and double click on it. You do not need to install Java separately.
- Linux kit, Download, extract, then type ./refine to start. This requires Java to be installed on your computer.
OpenRefine 3.3
The stable release of OpenRefine 3.3, released on January 31, 2020. Please backup your workspace directory before installing and report any problems that you encounter. A change log is provided on the release page.
- Windows kit, Download, unzip, and double-click on openrefine.exe. If you’re having issues with the above, try double-clicking on refine.bat instead.
- Mac kit, Download, open, drag icon into the Applications folder and double click on it. If you encounter a security warning, see workaround.
- Linux kit, Download, extract, then type ./refine to start.
Other Distributions
The following distributions have been customized for a specific usage or integration with other technologies.
| Distribution | Description | Authors |
|---|---|---|
| LODRefine | LODRefine is actually OpenRefine with integrated extensions that make transition from tabular data to Linked Data a bit easier. Integrated extensions are: RDF extension, DBpedia extension, Crowdsourcing extension, Stats extension | Sparkica |
| OpenDataRise | Tool to cleanse and semantify datasets from CKAN repositories. Based on OpenRefine. | Open Data in Trentino |
| p3-batchrefine | BatchRefine adds batch processing capabilities to OpenRefine and support multiple back end including spark | SpazioDati |
| RefineOnSpark | RefineOnSpark is a driver program to run OpenRefine jobs on the Spark cluster | SpazioDati |
| Reconciliation-and-Matching-Framework | A framework to allow the matching of string entities using customised sets of transformations and matchers, plus a tool to produce the necessary configurations and another to expose them as OpenRefine reconciliation services. | RBGKew |
| OntoRefine | GraphDB OntoRefine is a data transformation tool, based on OpenRefine and integrated in the GraphDB Workbench. It can be used for converting tabular data into RDF and importing it into a GraphDB repository, using simple SPARQL queries and a virtual endpoint. | Ontotext |
List of Extensions
You can install the following extensions to add functionalities to OpenRefine. See how to install an extension.
| Extension Name | Description | Authors |
|---|---|---|
| (FAIR) metadata | Support FAIR metadata by integrating with FAIR Data Point (metadata storage) and various data storages. It is a successor of the legacy FAIRifier (OpenRefine 2.6 fork) | FAIRDataTeam |
| RDF extension Latest for OpenRefine 3.0+ | RDF Refine - an improved fork of OpenRefine extension for exporting RDF | stkenny http://www.dri.ie/ |
| RDF extension 0.9 for OpenRefine 2.6 and prior | RDF Refine - an older version of RDF extension for exporting RDF | DERI |
| Opentree | An extension for Google/Open Refine to display phylogenetic trees from the Open Tree of Life | nickynicolson |
| BioVel, BioVel@GitHub | The BioVeL Extension adds functionality specific to cleaning biodiversity data. The entire package can be installed and run locally as well as on a dedicated server. | Botanic Garden and Botanical Museum Berlin-Dahlem |
| GeoRefine | Add Geospatial processing GREL functions for OpenRefine | Ryan Baumann |
| String-Transformers | A collection of Java string transformers, suitable for use with OpenRefine. Some generic, some aimed at botany and taxonomy. | RBGKew |
| CKAN Extension Storage for OpenRefine 2.6-beta1 and CKANv2.2+ | Upload data directly from OpenRefine using CKAN storage API | Ontodia |
| CKAN Extension Storage v0.1 (alpha) | This extension allows data of Google Refine projects to be uploaded to CKAN Storage and connected to a package on a running CKAN instance. released 2014-03-06 | DERI |
| GOKB | The GOKb extension will allow you to use OpenRefine to check in a project, resolve errors and warnings, and ingest data into GOKb. | |
| Crowdsourcing | An extension for OpenRefine for creating jobs and uploading data to CrowdFlower crowdsourcing service | Zemanta |
| DBpedia extension | This project is an extension for Google Refine and provides easy way to add columns from DBpedia to reconciled data (columns). Data has to be reconciled with DBpedia or SPARQL endpoint, which returns DBpedia resource types. | Zemanta |
| History tools, pivot tool, cross function gui, and scatterplot tool using D3 | A manual is provided via the download page. The project is coded in Javascript solely so the code distributed is also the source code | VIB-BITS |
| Diff plugin | A manual is provided via the download page. The project is coded in Javascript solely so the code distributed is also the source code | VIB-BITS |
| LMF Extension | The Linked Media Framework is an easy-to-setup server application that bundles together some key open source projects to offer some advanced services for linked media management. The core development of this project has been moved to Apache Marmotta | Salzburg Research |
| Named-Entity Recognition | With this extension, you can enrich text fields right from your workspace. It supports AlchemyAPI, DBpedia Lookup and Zemanta. - see also their documentation | Ruben Verborgh (Free Your Metadata) and Stuart Kenny |
| Stats extension for Google Refine 2.1 | A extension for OpenRefine to compute elementary statistics on a column data. | the Chicago Tribune |
| Stats extension for Google Refine 2.5+ | A extension for OpenRefine to compute elementary statistics on a column data. | sparkica |
| extraCTU-plugin | to extract E-mails, Telephone numbers, Urls and the Identification Numbers from a messy text | giTorto |
| geoXtension | geoXtension is based on gdal, proj and geos libraries. Adding this extension to OpenRefine takes a lot of time, usage of the ready-made docker is recommended | giTorto |
| D2Refine | A clinical study metadata harmonization and validation workbench to 1) convert restrictions (specified in a spreadsheet like interface) into a clinical model; and 2) provide metadata reconciliation services leveraging the Common Terminology Services 2 (CTS2) | Deepak K. Sharma https://github.com/dksharma |
Reconciliation Services
OpenRefine can connect to several reconciliation services. Please read our wiki for a list of them.
Client Libraries
You can automate some OpenRefine operations using one of the existing libraries. Those libraries are using the OpenRefine API.