Загрузка...

Informatica CDQ Deduplication Explained | Practical Training Session

Informatica CDQ Deduplication Explained | Practical Training Session

Yeah, hello, everyone. My name is Sujeet Patel (MDM Expert having 14.5+ Experience), again I am back with the practical part of deduplication in Informatica Cloud Data Quality. So let's try to understand deduplication in detail practically. Let's do practicals and all for deduplication. Okay? So all the things are there in the same de duplicate asset. So let's start with these 2 duplication types of understanding.

So here. objective, if you see. So here, if you select the objective. You can see different kinds of business entities, basically like the product, a business entity personal name, an organisation, name, individual household. So this objective option is saying what kind of data set you have? Please let us know. You have address data. You have a kind of family data, and you have household data. You have organisation data, right? So, what kind of data sets do you have? Please let us know so that we can help you. And it can suggest to us, better kind of matching algorithm. So each of these objective have a different meaning. If you select one, you will see that here the options will change. Okay, let's try to select if I'm selecting a supported address. So if I'm selecting that, I want to make the address master.

Then see what options we are getting here we are getting. It is part one, cluster ID, and cluster size. That's it. We are getting only 3 options. So, the cluster ID and cluster size will be in the output. Okay, so just, we need to pass only the address part one. And based on it is part one, we can do the match and much. And we can get the output data.

So let's start with this. Suppose here, if you are passing. If you are taking this address as a kind of objective. And here, let's take by default whatever it's coming. Let's take the default part in the end address key, so this will help us to find the more better candidate, a more precise, better kind of match candidate. If you are selecting the appropriate one, otherwise keep the default. Whatever you are getting here is local data. So it is asking us, what kind of data set do you have, from which country, from which kind of population? Okay, so from which population set, you have the data set so that it is asking to select.

So by default, we have English character data sets we can keep for the United States. Now, the optional fields mean here. Now you can see that only the mandatory field is the address, part one. If you are selecting an address here, that means you have to just select, address part one, right? So, address part one, you have to select. But if you click on this optional field, you will get more field options. Let's try to select it. See, once you are selecting the optional field. Here, then, you can see here you are getting the address, part one, as a mandatory. But you are getting the other part, or other columns, also of the address part 2, postal telephone
ID date attribute. And all right. So this is an optional field, if you have the data. If you have the data for these optional fields, you should select that optional field. If you don't have data, then you should not select that; just select the mandatory fields. Okay? So, optional field. If you are passing the data for this optional field, it will give us, better kind of matching candidate and a more trusted record. But if you don't have. If you have only a mandatory field, just disable it.

If you want to buy our online course that will build your career and help you get a high-paying job in the IT industry by becoming Informatica MDM certified, this is the right opportunity for you!

📞 Call Now: +91-9821931210
📧 E-Mail: support@inventmodel.com
🌐 Visit Website: https://inventmodel.com/course/informatica-cloud-data-quality-cdq-training

#InformaticaCDQ #DataQuality #Deduplication #InformaticaTraining #CloudDataQuality #CDQTraining #DataCleansing #MasterDataManagement #InformaticaTutorial #DataIntegration #InformaticaExperts #ETLTraining

Видео Informatica CDQ Deduplication Explained | Practical Training Session канала Data360 By InventModel
Яндекс.Метрика
Все заметки Новая заметка Страницу в заметки
Страницу в закладки Мои закладки
На информационно-развлекательном портале SALDA.WS применяются cookie-файлы. Нажимая кнопку Принять, вы подтверждаете свое согласие на их использование.
О CookiesНапомнить позжеПринять