locked
Exchange 2010 Deduplication Question RRS feed

  • Question

  • When we do a discovery search with multiple sets of keywords and deduplication enabled for each set the Keyword Statistics entry says "(Duplicates not excluded)" Is this because although each set of keywords has deduplication checked, duplicates are not excluded between each set of keywords?

    For example, 

    # Keyword(s)

    1 "shoes" AND "socks (deduplication checked)

    2 "coat" AND "hat (deduplication checked)

    Although e-mails we will be deduplicated within set #1 and separately within set #2, there may be duplications in the combined output. Is that correct?

    If this is true, is there any way to exclude duplicates between each set in a single search?

    Thursday, March 31, 2016 6:30 PM

Answers

  • Hi Barry,

    Yes, I think so.

    Best Regard,

    Jim Xu

    Please remember to mark the replies as answers if they help, and unmark the answers if they provide no help. If you have feedback for TechNet Support, contact tnmff@microsoft.com.

    Jim Xu
    TechNet Community Support

    • Marked as answer by BarryWallis Tuesday, April 19, 2016 4:52 PM
    Tuesday, April 19, 2016 8:43 AM
    Moderator

All replies

  • Hi BarryWallis,

    Welcome to Technet forum.

    By this issue, we suggest you use the following command to exclude it:

    ("shoes" AND "socks") NOT ("coat" AND "hat)

    The more details could be referred by the following link:

    https://technet.microsoft.com/en-us/library/dd335072(v=exchg.141).aspx 

    Best Regard,

    Jim Xu

    Please remember to mark the replies as answers if they help, and unmark the answers if they provide no help. If you have feedback for TechNet Support, contact tnmff@microsoft.com.

    Jim Xu
    TechNet Community Support

    Friday, April 1, 2016 2:00 AM
    Moderator
  • Thank you very much, but I think you may have misunderstood my question. 

    Why does the statistics box say "(Duplicates not excluded)" when I have Deduplicatin checked?

    Friday, April 1, 2016 5:46 PM
  • I think I found the answer. The "(Duplicates not excluded)" phrase is referring to the statistics that follow rather than the delivered e-mail. This is true because the number of items in the statics are always greater than the number of items that were delivered.

    Can you confirm?

    Tuesday, April 5, 2016 7:38 PM
  • Hi BarryWallis,

    As I am concerned, it will be worked very well, you could have a try.

    Best Regard,

    Jim Xu

    Please remember to mark the replies as answers if they help, and unmark the answers if they provide no help. If you have feedback for TechNet Support, contact tnmff@microsoft.com.

    Jim Xu
    TechNet Community Support

    Thursday, April 7, 2016 1:36 AM
    Moderator
  • Does this mean that you my answer above was correct?

    Monday, April 11, 2016 3:17 PM
  • Hi Barry,

    Yes, I think so.

    Best Regard,

    Jim Xu

    Please remember to mark the replies as answers if they help, and unmark the answers if they provide no help. If you have feedback for TechNet Support, contact tnmff@microsoft.com.

    Jim Xu
    TechNet Community Support

    • Marked as answer by BarryWallis Tuesday, April 19, 2016 4:52 PM
    Tuesday, April 19, 2016 8:43 AM
    Moderator
  • Thank you very much.
    Tuesday, April 19, 2016 4:52 PM