Title: Identification and estimation of network statistics with missing link data
Authors: Matthew Thirkettle - Rice University (United States) [presenting]
Abstract: Informative bounds are obtained on network statistics in a partially observed network whose formation is explicitly modeled. Partially observed networks are commonplace due to, for example, partial sampling or incomplete responses in surveys. Network statistics (e.g., centrality measures) are not point identified when the network is partially observed. Worst-case bounds on network statistics can be obtained by letting all missing links take values zero and one. We dramatically improve on the worst-case bounds by specifying a structural model for network formation. An important feature of the model is that we allow for positive externalities in the network-formation process. The network-formation model and network statistics are set identified due to multiplicity of equilibria. We provide a computationally tractable outer approximation of the joint identified region for preferences determining network-formation processes and network statistics. In a simulation study on Katz-Bonacich centrality, we find that worst-case bounds that do not use the network formation model are 44 times wider than the bounds we obtain from my procedure.