Abstract: Existing open-vocabulary object detectors typically require a predefined set of categories from users, signifi-cantly confining their application scenarios. In this pa-per, we introduce ...