Population Synthesis: Comparing the Major Techniques Using a Small, Complete Population of Firms Journal Articles uri icon

  •  
  • Overview
  •  
  • Research
  •  
  • Identity
  •  
  • Additional Document Info
  •  
  • View All
  •  

abstract

  • Recently, disaggregate modeling efforts that rely on microdata have received wide attention by scholars and practitioners. Synthetic population techniques have been devised and are used as a viable alternative to the collection of microdata that normally are inaccessible because of confidentiality concerns or incomplete because of high acquisition costs. The two most widely discussed synthetic techniques are the synthetic reconstruction method (IPFSR), which makes use of iterative proportional fitting (IPF) techniques, and the combinatorial optimization (CO) method. Both methods are described in this article and then evaluated in terms of their ability to recreate a known population of firms, using limited data extracted from the parent population of the firms. Testing a synthetic population against a known population is seldom done, because obtaining an entire population usually is too difficult. The case presented here uses a small, complete population of firms for the City of Hamilton, Ontario, for the year 1990; firm attributes compiled are number of employees, 3‐digit standard industrial classification, and geographic location. Results are summarized for experiments based upon various combinations of sample size and tabulation detail designed to maximize the accuracy of resulting synthetic populations while holding input data costs to a minimum. The output from both methods indicates that increases in sample size and tabulation detail result in higher quality synthetic populations, although the quality of the generated population is more sensitive to increases in tabular detail. Finally, most tests conducted with the created synthetic populations suggest that the CO method is superior to the IPFSR method.

publication date

  • April 2009