Publishers
Overview
The publishers dataset, alongside the Ad Systems dataset represent parent-level objects within the Sincera platform. Companies are considered publishers or adsystems. In rare cases, they are considered both. The publishers dataset serves as both a mapping table for any table that includes a publisher_id as well as a top-level summary of attributes of the publisher, such as the CMP used, as well as the parent entity (which is also a publisher_id).
Dataset
| Field | Type | Description |
|---|---|---|
| id | integer | Primary identifier for the for publisher object. |
| name | string | Name of the publisher. |
| domain | string | Primary domain of the publisher. Should be a top-level domain, with no sub-domains applied. |
| created_at | datetime | Date when the publisher was added to the Sincera platform. |
| updated_at | datetime | Date when the publisher was last updated by the Sincera platform. |
| twitter_url | integer | Identifier used on twitter for the publisher. |
| facebook_url | integer | Identifier used on facebook for the publisher. |
| newsapi_source | string | DEPRECATED: identifier used by newsAPI for the publisher. |
| image | string | Filename of publisher logo image jpeg. |
| policies_count | integer | Sum of active ads.txt policies found on the publisher. |
| pub_description | text | Small blurb of text describing the publisher. |
| speed_median_load_time | integer | The median of time it takes to load the publisher’s web page. |
| speed_percentile | integer | The percentile the publisher’s page speed is in. Higher percentiles indicate faster pages. Ex. 99 percentile indicates that only 1% of sites are faster. |
| traffic_rank | integer | Global rank of the platform, by traffic, as determined by a derived Tranco ranking set. |
| pv_per_user | float | Average number of page views consumed per user, per session. |
| comscore_rank | integer | Comscore rank of the publisher. |
| parent_entity_id | integer | publisher_id that is the parent company of the publisher. Null if no parent company is known. |
| ppid_found | boolean | Indicates whether PPIDs are present on page. Null if no parent company is known. |
| invalid_asset_count | integer | Number of assets identified as invalid on page. |
| pages_count | integer | Count of the total unique pages the platform has crawled and logged within the system. The count excludes daily crawls of home pages. |
| accent_color | string | Derived style-based html color values of publisher image. |
| publisher_text_assets_count | integer | Count of text asset extractions found on the publisher. |
| publisher_image_assets_count | integer | count of image asset extractions found on the publisher. |
| policy_apps_count | integer | Count of app-ads.txt values found on the publisher. |
| deleted | boolean | flag to indicate whether or not the publisher has been deleted from Sincera. |
| unique_gpid_count | integer | count of the total number of unique global placement ids on the publisher’s site |
| unique_aps_slot_count | integer | count of the total number of unique APS-slots on the publisher (under construction) |
| avg_page_weight | integer | The average size, in megabytes, of a publisher’s page / sheet. |
| accent_color_light | string | derived style-based html color values of publisher image. |
| avg_ad_units | integer | Average no. of declared ad units (i.e. slots) the publisher has per page / sheet. This is distinct from average rendered ad units. |
| slug | string | Abbreviation of publisher name. |
| ranked_position | integer | Publisher rank. Will be set to NULL if outside of top 10K. |
| categories | array | List of categories detected by Sincera platform, aligned to IAB framework. |
| avg_network_calls | float | Average number of network calls detected on page. |
| infinite_scroll_count | integer | Indicates count of pages within publisher’s domain space with infinite scroll enabled. |
| avg_cpu | float | Indicates average CPU utilization detected for a publisher’s domain space. |
| url_swapping_count | integer | Count of times where URL swapping is detected in a publisher’s domain space. |
Table of contents
- Ads to Content Ratio (A2CR)
- Crawl Profile
- Pages
- Publisher Domains
- Publisher Trends
- Policies (ads.txt)
- Policy Apps (app-ads.txt)
- Publisher Affiliate Providers
- Publisher Direct Adservers
- Publisher Direct Analytics Providers
- Publisher Direct Auth Providers
- Publisher Direct Survey Providers
- Publisher Javascripts
- Publisher Video Placements
- Resold Subdomains
- Variable Ad Load (VAL)