I'm currently exporting my HubSpot data to a RedShift datalake using Segment.com. The result is that my HubSpot data is available across several tables, however some critical pieces of information -- noteably a contact's "origional source", Page View, and Session traffic -- is missing.
Is this data available using HubSpots APIs? From some other searches, it appears that HubSpot doesn't allow you to bulk export Page View or Session data for a HubSpot property. But is it still possible to get the "original source" for a contact?
My understanding of the origional source attribute for a contact is that when a contact is first created (e.g. automatically upon form submission, or programatically via the first time an email address is seen via Identify request), it is given a source.
For example, contacts created via API call are initially given "Offline Sources". But later, after the HubSpot tracking ID for that new contact is found to match an existing user, the contact's source may be updated to be something else like "Paid Social" or "Organic Search".
This process of updating the Origional Source for a contact is explained here:
Anyways, the schema of the HubSpot data exported via Segment doesn't seem to include that "Origional Source" property in the `Contacts` table. (Nor any custom properties.) It doesn't seem to be in the `Contact Identities`, `Contact Identity Profiles`, or `Contact Lists` tables either.
So I'd just like to confirm, is the "origional source" property of a content available using the HubSpot API? If so, which method(s) should I be using to gather it? (Since if it is, I assume then the issue is in how that Segment export is configured or how it operates.)