I tried to import a list of organisations using the domain name as the unique identifier so that it would ignore, and therefore not import, any records that already had that domain name in HS.
When I reviewed the import list I noticed a lot of duplicate records had been created. The only reason I can think of is that the domain name must be identical. For example https://pixl8.co.uk vs http://www.pixl8.co.uk
Is this expected behaviour or should it only look at the 'pixl8.co.uk' part?
HubSpot strips away the https:// or http://www. during import, that shouldn't be the issue. Could you find two duplicates and double check whether both have a Company domain name? If for some reason one of the does not have a value in there, that would explain the duplicate. (This could've been caused by incorrect mapping or simply omitting the Company domain name column in an import.)
Best regards!
Karsten Köhler HubSpot Freelancer | RevOps & CRM Consultant | Community Hall of Famer
Yes you have to make sure that the Company Domian Name field is included in the CSV, so that the companies can be identified as duplicates. Here is the relevant documentation on this.
Also wanted to mention that Insycle provides you with a lot of flexibility here. Full disclosure, I work for them as a marketer. But Insycle allows you to deduplicate companies using any field in HubSpot CRM as a matching field. So you could use fields like Company Name, for instance, to duplicate companies as well. Then, you can choose between similar matching (instead of exact) to catch companies with similar domains or names. You can also instruct Insycle what terms to ignore...so subdomains/https/extensions for domains. Or stuff like Inc/Co/LLC for company names. This let's you catch many more duplicates in the database by looking more broadly for matches.
HubSpot strips away the https:// or http://www. during import, that shouldn't be the issue. Could you find two duplicates and double check whether both have a Company domain name? If for some reason one of the does not have a value in there, that would explain the duplicate. (This could've been caused by incorrect mapping or simply omitting the Company domain name column in an import.)
Best regards!
Karsten Köhler HubSpot Freelancer | RevOps & CRM Consultant | Community Hall of Famer
I think @SCole7 is correct with their findings. I have a large about of duplicates due to the fact that HS seem to take into consideration the "www." factor. Is this a glitch in the system?