This document was produced using Claude, an AI assistant by Anthropic. Content should be reviewed for accuracy.
Last Updated: March 24, 2026
| # | Dataset | File(s) | Rows | Columns | Time Span | Grain |
|---|---|---|---|---|---|---|
| 1 | NYC 311 Service Requests | NYC311-2010-to-20260301.csv (25 GB) |
~42.96M | 44 | 2010 – Mar 2026 | Individual complaint |
| 2 | Housing Database (DCP) | HousingDB_post2010.csv (46 MB) |
~81.7K | 63 | Post-2010 jobs | DOB job application |
| 3 | PLUTO | Primary_Land_Use_Tax_Lot_Output_(PLUTO)_20260206.csv (411 MB) |
~858K | 101 | Snapshot (Feb 2026) | Tax lot (BBL) |
| 4 | Furman Center SHD — BBL Analysis | FC_SHD_bbl_analysis_2025-05-13.csv (5.6 MB) |
~13.9K | 126 | As of May 2025 | Subsidized tax lot |
| 5 | Furman Center SHD — Subsidy Analysis | FC_SHD_subsidy_analysis_2025-05-13.csv (6.1 MB) |
~14.1K | 124 | As of May 2025 | Individual subsidy record |
| 6 | StreetEasy Master Report | 44 zip archives in StreetEasy_Master_Report/ |
~198 rows each | ~195 month-columns | Jan 2010 – Feb 2026 | Area × month (wide format) |
| 7 | BLS Economic Indicators | BLS_NYC_2010_2026.csv, BLS_NYC_Annual_2010_2026.csv |
~2K monthly / 17 annual | 6–8 | 2010 – 2026 | Month or year × series |
| 8 | Census ACS 1-Year Estimates | Census_ACS_NYC_Boroughs_2010_2023.csv |
~70 | 8 | 2010 – 2023 | Borough × year |
| 9 | NYCHA Physical Needs Assessment | NYCHA_PNA/NYCHA 2025 PNA Development Results 5-16-2025.xlsx (10,671 rows), NYCHA_PNA/NYCHA 2023 PNA Development Results 7-1-2023.xlsx (11,783 rows) |
~22K combined | 6 | 2023 & 2025 assessments | Development × work type |
Data Dictionaries (xlsx): Housing_Database_Data_Dictionary.xlsx, FC_SHD_bbl_analysis_data_dictionary_2025-05-13.xlsx, FC_SHD_subsidy_analysis_data_dictionary_2025-05-13.xlsx
NYCHA PNA Reports (PDF): 2023-PNA-Report-Physical-Needs-Assessment-NYCHA.pdf (188 pages, STV/AECOM), PNA 2017.pdf, transparency-pna-2011.pdf, Physical Needs Assessment FAQ.pdf
The fundamental spatial unit across most datasets. A Borough-Block-Lot (BBL) is a 10-digit identifier (1-digit borough + 5-digit block + 4-digit lot) assigned by the NYC Department of Finance. Present in: PLUTO (BBL), HousingDB (BBL), Furman Center BBL (bbl), 311 (BBL).
A Building Identification Number uniquely identifies a structure. Present in: HousingDB (BIN). A single BBL may contain multiple BINs.
A permit application filed with the Department of Buildings for new construction, alteration, or demolition. The primary entity in HousingDB, keyed by Job_Number.
An individual complaint or service request filed through the NYC 311 system, keyed by Unique Key. Contains complaint type, location (address, BBL, lat/lon), timestamps, and resolution info.
A residential property receiving one or more government subsidies. The Furman Center BBL file is at the property level; the Subsidy file has one row per subsidy-property pair (a single BBL may have multiple active subsidies).
A named geographic area (neighborhood, submarket, borough, or citywide) for which StreetEasy reports monthly market metrics. Areas are identified by areaName, Borough, and areaType.
A time series from the Bureau of Labor Statistics identified by a series ID (e.g., CUURS12ASA0L2 for NYC CPI-Shelter). Queried via BLS Public Data API v2 at monthly or annual granularity.
An American Community Survey 1-Year Estimate for a specific geography (NYC borough = county FIPS) and variable (e.g., B25071_001E for median gross rent as % of income). Retrieved via Census Data API.
A NYCHA housing development as assessed in the Physical Needs Assessment. Identified by DEVELOPMENT name. Each development has multiple rows in the PNA Excel files — one per work type combination. Unit count (NUMBER OF UNITS) is constant per development and repeats across work type rows.
Sample agencies: HPD, DOB, NYPD, DEP, DSNY, DOT, DOHMH, DPR, DCA, DHS, DFTA, TLC, EDC Sample complaint types (housing-relevant): HEAT/HOT WATER, DOOR/WINDOW, ELECTRIC, ELEVATOR, FLOORING/STAIRS, GENERAL, Boilers, Building/Use, General Construction/Plumbing, Asbestos, Lead, Noise, Dirty Conditions
Same structure as BBL Analysis but keyed by fc_subsidy_id (one row per subsidy). Adds: agency_name, subsidy_name, sub_subsidy_name, project_name, start_date, end_date, tenure, preservation, program, reac_score, reac_date, tot_bbls, tot_buildings, tot_units
Sales Market Metrics (by property type: All, Condo, Coop, Sfr):
Rental Market Metrics (by bedroom count: All, Studio, OneBd, TwoBd, ThreePlusBd):
Structure: Wide-format time series. Columns: areaName, Borough, areaType, then one column per month (YYYY-MM). Index files (priceIndex, rentalIndex) have columns: month, Brooklyn, Manhattan, NYC, Queens.
Geographic levels (areaType): city, borough, submarket, neighborhood (~198 areas) Boroughs: Manhattan, Brooklyn, Queens, Bronx, Staten Island
┌──────────────┐
│ BBL (10d) │ ← Primary spatial join key
└──────┬───────┘
┌───────────────┼───────────────────┐
│ │ │
┌─────▼─────┐ ┌────▼─────┐ ┌────────▼────────┐
│ PLUTO │ │ HousingDB│ │ Furman Ctr (BBL) │
│ 858K lots │ │ 81K jobs │ │ 13.9K subsidized│
└─────┬─────┘ └────┬─────┘ └────────┬────────┘
│ │ │
│ BIN ─┘ fc_subsidy_id
│ │
│ ┌────────▼────────┐
│ │ Furman Ctr (Sub) │
│ │ 14.1K subsidies │
│ └─────────────────┘
│
┌─────▼──────────┐
│ NYC 311 │
│ 43M complaints │ ← joins on BBL (28% populated),
│ │ or Incident Zip / Borough /
└────────────────┘ Community Board / Council Dist
StreetEasy does not share BBL. Linkage to lot-level data requires geographic bridging:
| StreetEasy areaType | Bridge to lot-level data via |
|---|---|
| borough | PLUTO borocode, 311 Borough |
| neighborhood | Manual or fuzzy mapping to NTA / community district names |
| submarket | Manual mapping (StreetEasy-proprietary groupings) |
| city | N/A (citywide) |
BLS, Census, and NYCHA PNA link to other datasets at the borough or citywide level only (no BBL-level joins). NYCHA development names can be cross-referenced with Furman Center properties via the data_nycha and prog_nycha_ph flags in the BBL analysis file.
Cross-dataset join keys summary:
| Key | PLUTO | HousingDB | FC-BBL | FC-Sub | 311 | StreetEasy | BLS | Census | NYCHA PNA |
|---|---|---|---|---|---|---|---|---|---|
| BBL | ✅ PK | ✅ | ✅ PK | ✅ (ref_bbl) | ✅ (partial) | — | — | — | — |
| BIN | — | ✅ | — | — | — | — | — | — | — |
| Borough | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | — | ✅ (FIPS) | — |
| Community District | ✅ | ✅ | ✅ | ✅ | ✅ | — | — | — | — |
| Council District | ✅ | ✅ | ✅ | ✅ | ✅ | — | — | — | — |
| Census Tract | ✅ | ✅ | ✅ | ✅ | — | — | — | ✅ | — |
| NTA | — | ✅ | — | — | — | ~ (neighborhood) | — | — | — |
| Zip Code | ✅ | — | — | — | ✅ | — | — | — | — |
| Lat/Lon | ✅ | ✅ | ✅ | — | ✅ | — | — | — | — |
| Metro area | — | — | — | — | — | — | ✅ (NYC MSA) | — | — |
| Development name | — | — | ~ (via NYCHA flags) | — | — | — | — | — | ✅ PK |
| Dataset | Start | End | Granularity |
|---|---|---|---|
| NYC 311 | 2010 | Mar 2026 | Daily (Created/Closed dates) |
| HousingDB | Post-2010 | Current vintage | Event dates (Filed, Permit, Complete) |
| PLUTO | Snapshot | Feb 2026 | Point-in-time |
| Furman Center | Varies by subsidy | May 2025 vintage | Subsidy start/end dates |
| StreetEasy | Jan 2010 | Feb 2026 | Monthly |
| BLS | Jan 2010 | Feb 2026 | Monthly |
| Census ACS | 2010 | 2023 | Annual (1-Year Estimates) |
| NYCHA PNA (2023) | Assessment date | Jul 2023 | Development × work type |
| NYCHA PNA (2025) | Assessment date | May 2025 | Development × work type |
Primary source: HousingDB Key measures: ClassANet (net new units), Job_Type, Job_Status, CompltYear, PermitYear Supports: Tracking new construction, alterations, demolitions; net unit change by geography and year
Primary source: PLUTO Key measures: landuse, bldgclass, zonedist, builtfar vs residfar/commfar, lotarea, bldgarea, unitsres Supports: Zoning analysis, underbuilt lot identification, land use classification
Primary sources: Furman Center BBL + Subsidy Key measures: Subsidy programs (28 types), start/end dates, res_units, income targeting, REAC scores, tenure, preservation vs. new construction Supports: Subsidy expiration risk analysis, portfolio composition, geographic concentration, physical condition (REAC)
Primary source: NYC 311 Key measures: Complaint type (HEAT/HOT WATER, etc.), volume trends, resolution time, geographic clustering Supports: Identifying problem buildings/areas, correlating complaints with subsidized housing, seasonal patterns
Primary source: StreetEasy Key measures: Median prices/rents, inventory levels, days on market, price cuts, sale/list ratios, price/rental indices Supports: Price trend analysis by neighborhood/borough, market tightness indicators, affordability benchmarking Analysis completed: Borough-level annual averages for 12 metrics (2010–2026), bedroom-level rent analysis, rental index trends
Primary source: PLUTO Key measures: assessland, assesstot, exempttot, yearbuilt, ownername, ownertype Supports: Tax base analysis, age of housing stock, ownership patterns
Primary sources: PLUTO (firm07_flag, pfirm15_flag), HousingDB (PL_FIRM07, PL_PFIRM15) Supports: Identifying housing in flood zones, climate risk overlay with subsidized housing and public land
Primary sources: BLS (CPI, wages, employment, unemployment), Census ACS (income, rent burden, poverty, tenure) Key measures: CPI-Shelter vs general CPI, real wage growth, rent burden % by borough, median household income trends Supports: Validating rent burden claims against federal data, contextualizing housing costs within macroeconomic trends Analysis completed: Shelter inflation premium (54.0% since 2010 vs wages +43.9%), borough-level rent burden validation (Bronx 75.9%)
Primary source: PLUTO (ownertype=’C’) Key measures: Developable parcels (non-park, residentially zoned), unused FAR capacity, potential housing units (unused_FAR × lotarea / 800 sqft) Supports: Identifying city-owned land for “Public Land for Public Good” development initiative Analysis completed: 6,632 developable parcels, 453,910 potential units, top 50 opportunities ranked, flood zone cross-reference by borough
Primary sources: NYCHA PNA (2023, 2025), Furman Center (NYCHA flags) Key measures: 5-year and 20-year capital needs by building system, per-unit costs, development-level prioritization, 2023→2025 trend Supports: Capital budget planning, system prioritization (apartments/heating/exteriors = 76% of need), identifying accelerating deterioration Analysis completed: $61.6B 5-year / $78.6B 20-year needs across 14 work types, +8.0% per-unit cost growth, top 15 developments identified
NYC (city)
└── Borough (5)
└── Community District (~59)
└── Neighborhood / NTA (~262)
└── Census Tract (~2,168)
└── Census Block
└── Tax Lot (BBL, ~858K)
Additional administrative overlays: Council District, Police Precinct, School District, Fire Company, Sanitation District, Health Area, Zip Code.
StreetEasy uses its own geography: city → borough → submarket → neighborhood (~198 areas total).
By category:
Data sources: DOF, HCR-LIHTC, HPD, HUD Contracts, HUD Financing, HUD-LIHTC, Mitchell-Lama, NYCHA
Job_Status field (and the noted Job_Inactv flag) should be used to filter to active/completed projects for pipeline analysis.lotarea column contains comma-formatted strings (e.g., “6,660”) that cause Polars SQL CAST to fail. Workaround: Python extraction with manual s.replace(',','') cleanup, or qsv processing.end_421a field uses UInt32 format YYYYMMDD (e.g., 20260101). To extract expiration year: end_421a / 10000. 100% of 421-a properties had populated start/end fields.invalid utf-8 sequence errors). The BBL analysis file is the more reliable source for property-level analysis.ownername LIKE '%PARKS AND RECREATION%' and landuse='09', then require residfar > 0 for residential development capacity.This project produced two categories of output from the datasets above:
Narrative analysis: Mamdani_Policy_Analysis_EXPANDED.md (~2,600 lines) — comprehensive policy brief covering all six analytical domains with data tables, findings, policy recommendations, risk scenarios, and methodological notes.
Interactive dashboards (8 HTML files):
| Dashboard | Primary Data Sources | Key Visualizations |
|---|---|---|
Explorer_Master_Dashboard.html |
All sources | 7-tab hub with borough/time filters |
Explorer_A_Supply_Pipeline.html |
HousingDB | Permit trends, completions lag, borough production |
Explorer_B_Affordability.html |
StreetEasy, Census ACS, BLS | Rent trends, income gap, burden, bedroom analysis |
Explorer_C_Subsidized_Housing.html |
Furman Center | Subsidy portfolio, program breakdown, expiration risk |
Explorer_D_Conditions.html |
311 (HPD filtered) | Complaint volume, seasonal patterns, borough trends |
Explorer_421a_Timeline.html |
Furman Center BBL | Bimodal expiration 2026–2060, borough deep dive |
Explorer_E_Public_Land.html |
PLUTO | Developable parcels, potential units, flood zones |
Explorer_NYCHA_Capital.html |
NYCHA PNA (2023, 2025) | Work type breakdown, top developments, 2023→2025 trend |
This document was produced using Claude, an AI assistant by Anthropic. Content should be reviewed for accuracy.