Generating and Evaluating Sustainable Procurement Criteria for the Swiss Public Sector using In-Context Prompting with Large Language Models

📅 2026-03-23
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work proposes a configurable large language model (LLM)-assisted pipeline to systematically translate high-level sustainability regulations into sector-specific, verifiable public procurement criteria—a process traditionally reliant on manual effort and expert knowledge. By integrating contextual prompting with structured policy documents, the approach enables auditable, cross-sector automation aligned with Swiss regulatory requirements. The framework incorporates automated validation and LLM-driven quality assessment mechanisms to ensure rigor and consistency. Experimental results demonstrate that the generated criteria exhibit strong alignment with official guidelines, achieving high performance in both automated checks and expert evaluations, thereby substantially reducing the burden of manual drafting.

Technology Category

Application Category

📝 Abstract
Public procurement refers to the process by which public sector institutions, such as governments, municipalities, and publicly funded bodies, acquire goods and services. Swiss law requires the integration of ecological, social, and economic sustainability requirements into tender evaluations in the format of criteria that have to be fulfilled by a bidder. However, translating high-level sustainability regulations into concrete, verifiable, and sector-specific procurement criteria (such as selection criteria, award criteria, and technical specifications) remains a labor-intensive and error-prone manual task, requiring substantial domain expertise in several groups of goods and services and considerable manual effort. This paper presents a configurable, LLM-assisted pipeline that is presented as a software supporting the systematic generation and evaluation of sustainability-oriented procurement criteria catalogs for Switzerland. The system integrates in-context prompting, interchangeable LLM backends, and automated output validation to enable auditable criteria generation across different procurement sectors. As a proof of concept, we instantiate the pipeline using official sustainability guidelines published by the Swiss government and the European Commission, which are ingested as structured reference documents. We evaluate the system through a combination of automated quality checks, including an LLM-based evaluation component, and expert comparison against a manually curated gold standard. Our results demonstrate that the proposed pipeline can substantially reduce manual drafting effort while producing criteria catalogs that are consistent with official guidelines. We further discuss system limitations, failure modes, and design trade-offs observed during deployment, highlighting key considerations for integrating generative AI into public sector software workflows.
Problem

Research questions and friction points this paper is trying to address.

sustainable procurement
public sector
procurement criteria
large language models
in-context prompting
Innovation

Methods, ideas, or system contributions that make the work stand out.

in-context prompting
large language models
sustainable procurement
automated criteria generation
public sector AI
🔎 Similar Papers
No similar papers found.
Y
Yingqiang Gao
University of Zurich
V
Veton Matoshi
Bern University of Applied Sciences
L
Luca Rolshoven
Bern University of Applied Sciences, University of Bern
T
Tilia Ellendorff
University of Zurich
J
Judith Binder
Bern University of Applied Sciences
J
Jeremy Austin Jann
Bern University of Applied Sciences
G
Gerold Schneider
University of Zurich
Matthias Stürmer
Matthias Stürmer
Head of Institute for Public Sector Transformation at Bern University of Applied Sciences
Digital SustainabilityOpen Source SoftwareOpen DataOpen GovernmentNatural Language Processing