Link Search Menu Expand Document

Publishers

Overview

The publishers dataset, alongside the Ad Systems dataset represent parent-level objects within the Sincera platform. Companies are considered publishers or adsystems. In rare cases, they are considered both. The publishers dataset serves as both a mapping table for any table that includes a publisher_id as well as a top-level summary of attributes of the publisher, such as the CMP used, as well as the parent entity (which is also a publisher_id).

Dataset

Field Type Description
id integer Primary identifier for the for publisher object.
name string Name of the publisher.
domain string Primary domain of the publisher. Should be a top-level domain, with no sub-domains applied.
created_at datetime Date when the publisher was added to the Sincera platform.
updated_at datetime Date when the publisher was last updated by the Sincera platform.
twitter_url integer Identifier used on twitter for the publisher.
facebook_url integer Identifier used on facebook for the publisher.
newsapi_source string DEPRECATED: identifier used by newsAPI for the publisher.
image string Filename of publisher logo image jpeg.
policies_count integer Sum of active ads.txt policies found on the publisher.
pub_description text Small blurb of text describing the publisher.
speed_median_load_time integer The median of time it takes to load the publisher’s web page.
speed_percentile integer The percentile the publisher’s page speed is in. Higher percentiles indicate faster pages. Ex. 99 percentile indicates that only 1% of sites are faster.
traffic_rank integer Global rank of the platform, by traffic, as determined by a derived Tranco ranking set.
pv_per_user float Average number of page views consumed per user, per session.
comscore_rank integer Comscore rank of the publisher.
parent_entity_id integer publisher_id that is the parent company of the publisher. Null if no parent company is known.
ppid_found boolean Indicates whether PPIDs are present on page. Null if no parent company is known.
invalid_asset_count integer Number of assets identified as invalid on page.
pages_count integer Count of the total unique pages the platform has crawled and logged within the system. The count excludes daily crawls of home pages.
accent_color string Derived style-based html color values of publisher image.
publisher_text_assets_count integer Count of text asset extractions found on the publisher.
publisher_image_assets_count integer count of image asset extractions found on the publisher.
policy_apps_count integer Count of app-ads.txt values found on the publisher.
deleted boolean flag to indicate whether or not the publisher has been deleted from Sincera.
unique_gpid_count integer count of the total number of unique global placement ids on the publisher’s site
unique_aps_slot_count integer count of the total number of unique APS-slots on the publisher (under construction)
avg_page_weight integer The average size, in megabytes, of a publisher’s page / sheet.
accent_color_light string derived style-based html color values of publisher image.
avg_ad_units integer Average no. of declared ad units (i.e. slots) the publisher has per page / sheet. This is distinct from average rendered ad units.
slug string Abbreviation of publisher name.
ranked_position integer Publisher rank. Will be set to NULL if outside of top 10K.
categories array List of categories detected by Sincera platform, aligned to IAB framework.
avg_network_calls float Average number of network calls detected on page.
infinite_scroll_count integer Indicates count of pages within publisher’s domain space with infinite scroll enabled.
avg_cpu float Indicates average CPU utilization detected for a publisher’s domain space.
url_swapping_count integer Count of times where URL swapping is detected in a publisher’s domain space.

Table of contents