MSD: A Benchmark Dataset for Floor Plan Generation of Building Complexes

1 Delft University of Technology
2 Sungkyunkwan University
3 Indepedent Researcher

European Conference on Computer Vision (ECCV), 2024

*Main author — c.c.j.vanengelenburg@tudelft.nl
examples-msd-grid

A realistic floor plan dataset — In this article, we propose and benchmark a new large-scale floor plan dataset, which we call [M]odified [S]wiss [D]wellings. MSD exceeds other floor plan dataset, such as RPLAN and LIFULL, in terms of complexity and better represents what could be found in the real-world.

Current models are not powerful enough — Most interestingly is that today's models that are designed for floor plan generation – mostly specialized neural networks – seem not robust and/or smart enough to tackle the complexity of MSD. Exciting new research in floor plan generation as well as understanding lies ahead.

Multimodality and coding guidelines — The floor plans come in various linked modalities: image, geometry, and graph. The main data container is the graph (`networkx.Graph()` or `torch_geometric.data.Data()`) on-top-of which the room shapes and types (as node-level attributes), the connectivity types (as edge-level attributes) and the full image (as graph-level attribute) are modelled. Additionally, we prepared clear coding guidelines on how to effectively load and visualize the floor plans and their corresponding access graphs.

MSD Dataset

MSD is an ML-ready dataset for floor plan generation and analysis at building-level scale. The MSD dataset is derived from the Swiss Dwellings database (v3.0.0). The MSD dataset contains 5372 highly-detailed floor plans of single- as well as multi-unit building complexes across Switzerland, hence extending the building scale w.r.t. of other well-known floor plan datasets. Some highlights:

  • Origin — MSD is the first European large-scale floor plan dataset originating from Switzerland;
  • Realism — MSD is the first large-scale floor plan dataset that contains information on the connectivity between the apartments; in addition, compass orientation is included;
  • Complexity — MSD exceeds other datasets in the the average number of corners per room, rooms per apartment, and apartments per floor; furthermore, MSD has the largest share of irregularly-shaped rooms;
  • Multimodality — MSD contains floor plan raster images, vectorized formats, and corresponding access graphs;
  • Diversity — MSD has the largest diversity of access graphs.

Models

We developed two baseline models to benchmark MSD: a diffusion- and segmentation-based. The former is built on-top-of HouseDiffusion (HD) — a state-of-the-art model for floor plan generation. The latter combines a U-Net and graph convolutional network. Details of the models can be found in the figures below.

Results

Overall, the floor plans often look infeasible. We could, however, train MHD on RPLAN successfully (see suppl. mat.); hence, we believe that the poor results do not come from improper training. Instead, we attribute the somewhat poor results to the more complex benchmark we set: more complex graphs; more irregularly shaped rooms; unit connectivity; no axis alignment; etc.

results_viz

Example generations of MHD and UN — Columns 1 and 2 show the inputs: the zone graph and building structure respectively. Columns 3 - 6 show the floor plans generated by the MHD variants. Columns 7 - 9 show the floor plans generated by the UN variants. Column 10 shows the ground truth.

Paper (incl. suppl. mat.)

BibTeX

@article{vanengelenburg2024msd,
        title={MSD: A Benchmark Dataset for Floor Plan Generation of Building Complexes},
        author={van Engelenburg, Casper and Mostafavi, Fatemeh and Kuhn,
                   Emanuel and Jeon, Yuntae and Franzen, Michael and Standfest,
                   Matthias and van Gemert, Jan and Khademi, Seyran},
        journal={arXiv preprint arXiv:2407.10121},
        year={2024}
    }