Package 'restaurant'

Title: Restaurant Data for Entity Resolution
Description: Duplicated restaurant data (pre-processed and formatted) for entity resolution. This package contains formatted data from a data set that contains information about different restaurants, with the Zagats portion containing 331 records and the Fodors portion containing 533 records. The following variables are included in the data set: id, name, address, city, phone, type. The data set has a respective gold data set that provides information on which records match based on id.
Authors: Rebecca Steorts [aut, cre], Andee Kaplan [aut], Srini Sunil [aut]
Maintainer: Rebecca Steorts <[email protected]>
License: CC0
Version: 0.1.0
Built: 2025-01-29 04:52:53 UTC
Source: https://github.com/cleanzr/restaurant

Help Index


Restaurants data set

Description

This provides a record linkage data set with information about different restaurants.

Usage

restaurant

Format

A data frame with 16 variables: id, name, address, city, phone, type, class

This data set includes 533 restaurants from the Fodors database and 331 records from the Zagat database. It is appropriate for performing various types of record linkage and can be assessed by standard record linkage methods.

Examples

head(restaurant)
dim(restaurant)

Restaurants Gold

Description

This data set includes the matched record pairs based on ID.

Usage

restaurant_gold

Format

A data frame with 3 variables: class, id1, id2

This data set includes the matched record pairs based on ID from the Zagats and Fodors data sets. This data set can be used to evaluate the performance of record linkage methods performed on the Zagats and Fodors data sets.

Examples

head(restaurant_gold)
dim(restaurant_gold)