Title: | Restaurant Data for Entity Resolution |
---|---|
Description: | Duplicated restaurant data (pre-processed and formatted) for entity resolution. This package contains formatted data from a data set that contains information about different restaurants, with the Zagats portion containing 331 records and the Fodors portion containing 533 records. The following variables are included in the data set: id, name, address, city, phone, type. The data set has a respective gold data set that provides information on which records match based on id. |
Authors: | Rebecca Steorts [aut, cre], Andee Kaplan [aut], Srini Sunil [aut] |
Maintainer: | Rebecca Steorts <[email protected]> |
License: | CC0 |
Version: | 0.1.0 |
Built: | 2025-01-29 04:52:53 UTC |
Source: | https://github.com/cleanzr/restaurant |
This provides a record linkage data set with information about different restaurants.
restaurant
restaurant
A data frame with 16 variables: id
, name
, address
, city
, phone
, type
, class
This data set includes 533 restaurants from the Fodors database and 331 records from the Zagat database. It is appropriate for performing various types of record linkage and can be assessed by standard record linkage methods.
head(restaurant) dim(restaurant)
head(restaurant) dim(restaurant)
This data set includes the matched record pairs based on ID.
restaurant_gold
restaurant_gold
A data frame with 3 variables: class
, id1
, id2
This data set includes the matched record pairs based on ID from the Zagats and Fodors data sets. This data set can be used to evaluate the performance of record linkage methods performed on the Zagats and Fodors data sets.
head(restaurant_gold) dim(restaurant_gold)
head(restaurant_gold) dim(restaurant_gold)