1. Census Digitization Project (with Raj Arunachalam, Alexander Persaud and Ajay Shenoy): The project digitizes the Census of India and creates a long-run district panel starting from 1871 until 2011. By obtaining archival material on Census volumes, and converting them to digital format we seek to create a rich and unique dataset of the longest Census panel outside the Western hemisphere.

 

2. Pathways to Adjustment Project (with John Bound, Breno Braga and Sarah Turner): The project studies the immigration of high-skilled workers and the impact on the US labor market, including the route via the US educational system, and the rising concentration of H1-B visas in the IT sector.

 

3. Income Dynamics Lab (with Raj Arunachalam, Alexander Persaud and Ajay Shenoy): The project creates long-run panels of households using nationally-representative household surveys in India, in order to study long-term consequences and income-dynamics at the household level.

 

4. Non-Secondary Data Collection for India:

a. Terrorism Data (with Laura Zimmerman): A hand-coded dataset of all Maoist related incidents from 2005 to 2009 in India, including number of fatalities, injuries, arrests, captures, and who initiates the attacks on what targets.

 

b. GIS data for India: Including satellite night-time data, roads, highways, rivers, elevation, rail-lines, and other geographic features. This data is at the state, district and sub-district level (and includes information on Census tehsils). Tehsil level data is available only through private companies at a high-cost.

 

c. Sub-district level Data: Consists of (a) schooling data for the entire country (not freely available), and (b) Primary Census Abstracts from 1991 to 2011.

 

d. District-level Data: Including District GDP (not freely available), and other features compiled from household surveys and Census tables.

 

e. Caste-level Affirmative Action dataset: Laws by state on which sub-castes are eligible for affirmative action policies