• Live group demo of Marketing Hub + Data Agent

    Standardize reporting, reduce manual work, and introduce AI without cleanup

    Join us on March 12
  • Marketing that's efficient and human? That's Loop Marketing.

    Explore HubSpot Academy's 39-video playlist and put it into practice.

    Start learning

Deduplication using ‘exact’ domain name?

SCole7
Member

Hi All,

I tried to import a list of organisations using the domain name as the unique identifier so that it would ignore, and therefore not import, any records that already had that domain name in HS.

When I reviewed the import list I noticed a lot of duplicate records had been created. The only reason I can think of is that the domain name must be identical. For example https://pixl8.co.uk vs http://www.pixl8.co.uk

 

Is this expected behaviour or should it only look at the 'pixl8.co.uk' part?


Thanks

Scott

 

 

0 Upvotes
1 Accepted solution
karstenkoehler
Solution
Hall of Famer | Partner
Hall of Famer | Partner

Hi @SCole7,

 

HubSpot strips away the https:// or http://www. during import, that shouldn't be the issue. Could you find two duplicates and double check whether both have a Company domain name? If for some reason one of the does not have a value in there, that would explain the duplicate. (This could've been caused by incorrect mapping or simply omitting the Company domain name column in an import.)

 

Best regards!

Karsten Köhler
HubSpot Freelancer | RevOps & CRM Consultant | Community Hall of Famer

Beratungstermin mit Karsten vereinbaren

 

Did my post help answer your query? Help the community by marking it as a solution.

View solution in original post

4 Replies 4
RBozeman
Participant

Hi @SCole7,

 

Yes you have to make sure that the Company Domian Name field is included in the CSV, so that the companies can be identified as duplicates. Here is the relevant documentation on this. 

 

Also wanted to mention that Insycle provides you with a lot of flexibility here. Full disclosure, I work for them as a marketer. But Insycle allows you to deduplicate companies using any field in HubSpot CRM as a matching field. So you could use fields like Company Name, for instance, to duplicate companies as well. Then, you can choose between similar matching (instead of exact) to catch companies with similar domains or names. You can also instruct Insycle what terms to ignore...so subdomains/https/extensions for domains. Or stuff like Inc/Co/LLC for company names. This let's you catch many more duplicates in the database by looking more broadly for matches. 

0 Upvotes
DianaGomez
Community Manager
Community Manager

Hi @RBozeman hope you are doing well!

 

Thanks for sharing 🙂

 

Best,

Diana


loop Loop Marketing is a new four-stage approach that combines AI efficiency and human authenticity to drive growth.
Learn More

karstenkoehler
Solution
Hall of Famer | Partner
Hall of Famer | Partner

Hi @SCole7,

 

HubSpot strips away the https:// or http://www. during import, that shouldn't be the issue. Could you find two duplicates and double check whether both have a Company domain name? If for some reason one of the does not have a value in there, that would explain the duplicate. (This could've been caused by incorrect mapping or simply omitting the Company domain name column in an import.)

 

Best regards!

Karsten Köhler
HubSpot Freelancer | RevOps & CRM Consultant | Community Hall of Famer

Beratungstermin mit Karsten vereinbaren

 

Did my post help answer your query? Help the community by marking it as a solution.

MLy1
Contributor

Hi @karstenkoehler,

 

I think @SCole7 is correct with their findings. I have a large about of duplicates due to the fact that HS seem to take into consideration the "www." factor. Is this a glitch in the system?

 

Thank you,

Mel

 

MLy1_0-1657127958539.png

MLy1_1-1657127985516.png