Abstract: As digital archives of newspapers continue to grow, the need for automated methods to extract and organize information from PDF files becomes increasingly critical. This study addresses the ...