Scrape + score Vergabemarktplatz Brandenburg (portal connector)
Source:R/check_tenders.R
vmp_bb_tenders.RdThe VMP-BB connector for screen_portals() / screen_all_portals(): a thin
wrapper around cosinex_tenders() pinned to Vergabemarktplatz Brandenburg. It
opens a chromote session, optionally logs in, scrapes tenders, scores them
(score_relevance()), enriches via the detail and (optional) notice layers,
applies the title exclusions (apply_title_excludes()) and tags
Plattform = "Vergabemarktplatz Brandenburg". Returns the scored tibble (it writes no
report); the detail/notice screening caches are read/written under cache_dir.
Usage
vmp_bb_tenders(
keywords = tender_keywords(),
login = FALSE,
max_pages = Inf,
since_days = NULL,
publication_types = c("ExAnte", "Tender"),
contracting_rules = "VOL",
screen_details = TRUE,
max_detail = Inf,
screen_notice = FALSE,
max_notice = Inf,
username = Sys.getenv("VMP_BB_USERNAME"),
password = Sys.getenv("VMP_BB_PASSWORD"),
cache_dir = "reports",
relevant_only = FALSE,
headless = TRUE
)Arguments
- keywords
Keyword list for relevance scoring (default
tender_keywords()).- login
Log in before scraping (default
FALSE; the search is public).- max_pages
Maximum number of result pages to scrape (default
Inf).- since_days
If set, stop scraping pages older than this many days (results are newest-first);
NULL(default) scrapes up tomax_pages.- publication_types, contracting_rules
Search filter passed to
vmp_bb_scrape_tenders().- screen_details
Detail-page layer (default
TRUE; seeenrich_with_details()).- max_detail
Maximum number of detail pages to screen (default
Inf).- screen_notice
Notice-PDF layer (default
FALSE; forceslogin = TRUE; seeenrich_with_notice()).- max_notice
Maximum number of new notice PDFs to read (default
Inf).- username, password
Credentials when
login = TRUE(default env varsVMP_BB_USERNAME/VMP_BB_PASSWORD).- cache_dir
Directory for the detail/notice caches (default
"reports").- relevant_only
Return only relevant tenders (default
FALSE; the combined multi-portal run inscreen_all_portals()sets thisTRUE).- headless
Run chromote headless (default
TRUE).