A free, open source, powerful tool for working with messy data
OpenRefine (formerly Google Refine) is a powerful tool for working with messy data: cleaning it; transforming it from one format into another; and extending it with web services and external data.
Here we have some videos that focus on joining two or more data sets and perhaps even from different data sources and even VLOOKUPs kind of operations to add more meaning to the current data.
Some other references I like :
- http://enipedia.tudelft.nl/wiki/OpenRefine_Tutorial
- http://schoolofdata.org/handbook/recipes/cleaning-data-with-refine/
- http://openrefine.org/
- https://docs.google.com/presentation/d/1YkArEiaws0dMcyFZEppg4eZ7CxvqCTckjY78ao93zIw/edit#slide=id.p
- https://github.com/OpenRefine/OpenRefine/wiki/Fetching-URLs-From-Web-Services
- https://www.youtube.com/watch?v=xtm6y8yB-Ho