You are using an unsupported browser. Please update your browser to the latest version on or before July 31, 2020.
close
You are viewing the article in preview mode. It is not live at the moment.
Home > Revelio Wage & Salary Data > Documentation > Data Dictionary - Revelio Wage and Salary Data
Data Dictionary - Revelio Wage and Salary Data
print icon

Revelio Wage and Salary Base Data

The Revelio Labs wage and salary estimate model has been applied to the RAW dataset to provide a baseline and backfill of salary and wage estimates up to 3/23/23. Use the backfilled_date_time files to base RAW with wage and salary data then append using daily wage and salary files joinable by hash. 

 

FTP file path: /PartnerProducts/SalaryData/backfilled_date_time/
S3/GCP/Azure file path: .../SalaryData/salary_information_acl/backfilled_date_time/
Description: Backfill of wage and salary estimates by hash up to March 23rd, 2023, split across 32 parquet files. Joinable to RAW job records by hash.
File Name Structure: data_#_#_#
Format: .parquet
 
NOTE: It is recommended to use the last version of each salary mapping using the date_time field, however, salary data can be mapped point-in-time if desired.

 

Field Name Data Type Description
hash STRING/VARCHAR The unique identifier for job records. This is used to join wage and salary data to job records.
mean_salary FLOAT The average salary.
lower_bound FLOAT The lower bound of the salary within a confidence range.
upper_bound FLOAT The upper bound of the salary within a confidence range. 
date_time VARCHAR The date and time this hash was mapped to this salary. This value will change if the salary datapoints are updated. | Used to track each time a salary mapping is updated and can be used to find the latest salary value for a hash. | {date}_{hour}_{minute} (GMT time zone)

 

Revelio Wage and Salary Daily Files

 

FTP file path: /PartnerProducts/RevelioLabs/salary_information_acl/YYYY-MM-DD_##_##/
S3/GCP/Azure file path: /salary_information_acl/YYYY-MM-DD_##_##/
Description: Daily wage and salary files, joinable to RAW job records by hash.
Folder Structure: YYYY-MM-DD_##_##/
File Name Structure: data_#_#_#
Format: .parquet
 
NOTE: It is recommended to use the last version of each salary mapping using the date_time field, however, salary data can be mapped point-in-time if desired. 

 

 

Field Name Data Type Description
hash STRING/VARCHAR The unique identifier for job records. This is used to join wage and salary data to job records.
mean_salary FLOAT The average salary.
lower_bound FLOAT The lower bound of the salary within a confidence range.
upper_bound FLOAT The upper bound of the salary within a confidence range.
date_time VARCHAR The date and time this hash was mapped to this salary. Used to track each time a salary mapping is updated and can be used to find the latest salary value for a hash.  | {date}_{hour}_{minute} (GMT time zone)

 

 

Related: Click here to access the Revelio Labs Wage and Salary Data Methodology Guide

Feedback
0 out of 0 found this helpful

scroll to top icon