locked
Disk freezes (Access) for a few seconds during daytime on a 2012 R2 fileserver RRS feed

  • Question

  • One of the disks based on Storage Spaces on 2012 R2 is not behaving well, storing home folders. Several times per hour during the working day the disk freezes up and doesn't respond to requests. The duration of the freezes are random and have been observed to last up to 70 seconds. It seems like this happens when users are at work and accessing their home folders, not during the weekend.
    The freeze is observed in Resource Monitor with zero disk activity on the disk. No apparent events in event logs. A Powershell script running on the server is logging file access time that exceeds one second. See the log below for the first half of 25th October. Delay in milliseconds

    25/10-08:11:54;28292
    25/10-08:12:31;12353
    25/10-08:12:44;3859
    25/10-08:15:37;28980
    25/10-08:16:17;5576
    25/10-08:31:27;1173
    25/10-08:46:30;2931
    25/10-08:49:52;47122
    25/10-08:50:49;17503
    25/10-08:51:07;1003
    25/10-08:55:13;1918
    25/10-08:55:22;2396
    25/10-09:12:20;1829
    25/10-09:33:42;1971
    25/10-09:39:30;1213
    25/10-09:52:46;34630
    25/10-09:54:50;12199
    25/10-09:55:05;1017
    25/10-10:01:50;27485
    25/10-10:02:25;2390
    25/10-10:11:47;25251
    25/10-10:12:18;2559
    25/10-10:19:57;27027
    25/10-10:20:29;2772
    25/10-10:20:33;5138
    25/10-10:29:09;26653
    25/10-10:29:43;2681
    25/10-10:32:13;6486
    25/10-10:46:03;2164
    25/10-10:46:31;20590
    25/10-10:46:56;7333
    25/10-10:47:04;1049
    25/10-10:51:26;2194
    25/10-10:51:34;2638
    25/10-10:57:27;1758
    25/10-11:00:38;2967
    25/10-11:18:28;23101
    25/10-11:18:58;2613
    25/10-11:29:28;16156

    We can see that the server occasionally have drops of network traffic to almost 0, while activity drops on the disk to no activity at all (for up to 20-80 sec). See the image below for details. Since the users cannot access their data, the activity drops for a while because of that, naturally.

    We do not know immediately why this is happening. We get lots of complains from users working in applications that use files on the user's home directory. When this lock happens, the whole application freezes and this creates a lot of frustration.
    We have initially reconfigured  backup to run at 7 pm and fragmentation of the disk is turned off. 
    We have run the script that is attached here from different admin desktops to the file server. The script is trying to access a file on the file server each second and prints any delays in the log.

    Does anyone have a clue to where we could troubleshoot this scenario?



    Freddy

    Tuesday, November 1, 2016 9:29 AM

All replies

  • Hi Freddy,

    Thanks for your post.

    Since your error is random and even no error message, I'm afraid that may hard to do the troubleshooting.

    You mainly problem is slow performance when accessing home folder in file server sometimes, right?

    And according to your post, server occasionally have drops of network traffic to almost 0.  Please disable firewall and all third party applications like Anti-Virus application to test if it could reduce the waiting time when accessing a file.

    By the way, what's the client when the users access to the server ?

    Best Regards,

    Mary


    Please remember to mark the replies as answers if they help and unmark them if they provide no help.
    If you have feedback for TechNet Subscriber Support, contact tnmff@microsoft.com.

    Tuesday, November 1, 2016 12:25 PM
  • Slow performance? I would rather say no performance at all when the users try to access home folder in file server. Its like the disk locks so that it is impossible to access any files or folders on the disk during the seconds the incident occurs.

    The firewall service has been set to disable (I know, its better to disable the firewall, not the service, but that's how we do it apparently). SCEP antivirus real-time protection has also been disabled, so has scheduled scans

    I think the users only access this through 2008 R2 terminal-servers, not even sure they use W8 or W10 to access their home folders in our case... From what I heard, it's 2008 R2 mainly

    I have a log now, of the access time/response time above 15 milliseconds from another server, during normal conditions

    Logging response time above 15 miliseconds
    01/11/2016 16:50:46 Time: 61 miliseconds
    01/11/2016 16:50:56 Time: 111 miliseconds
    01/11/2016 18:36:58 Time: 23 miliseconds
    01/11/2016 19:13:28 Time: 15 miliseconds
    01/11/2016 19:40:08 Time: 31 miliseconds
    01/11/2016 19:42:48 Time: 80 miliseconds
    01/11/2016 21:05:32 Time: 15 miliseconds
    01/11/2016 21:16:19 Time: 17 miliseconds
    01/11/2016 21:40:19 Time: 15 miliseconds
    01/11/2016 21:43:19 Time: 22 miliseconds
    01/11/2016 21:48:49 Time: 16 miliseconds
    01/11/2016 22:37:09 Time: 17 miliseconds
    02/11/2016 00:00:00 Time: 135 miliseconds
    02/11/2016 00:12:52 Time: 21500 miliseconds
    02/11/2016 00:13:44 Time: 2249 miliseconds
    02/11/2016 00:14:18 Time: 943 miliseconds
    02/11/2016 00:21:59 Time: 12845 miliseconds
    02/11/2016 00:22:13 Time: 76 miliseconds
    02/11/2016 00:37:33 Time: 149 miliseconds
    02/11/2016 02:13:01 Time: 15 miliseconds
    02/11/2016 02:56:33 Time: 17 miliseconds
    02/11/2016 05:37:33 Time: 334 miliseconds
    02/11/2016 06:31:44 Time: 179 miliseconds
    02/11/2016 07:11:44 Time: 270 miliseconds
    02/11/2016 07:43:45 Time: 864 miliseconds
    02/11/2016 07:44:55 Time: 110 miliseconds
    02/11/2016 07:53:46 Time: 267 miliseconds
    02/11/2016 07:56:06 Time: 29 miliseconds
    02/11/2016 08:01:56 Time: 320 miliseconds
    02/11/2016 08:03:56 Time: 1598 miliseconds
    02/11/2016 08:04:38 Time: 404 miliseconds
    02/11/2016 08:07:08 Time: 2797 miliseconds
    02/11/2016 08:10:00 Time: 222 miliseconds
    02/11/2016 08:11:40 Time: 188 miliseconds
    02/11/2016 08:12:21 Time: 208 miliseconds
    02/11/2016 08:12:41 Time: 426 miliseconds
    02/11/2016 08:13:31 Time: 135 miliseconds
    02/11/2016 08:14:01 Time: 187 miliseconds
    02/11/2016 08:18:42 Time: 191 miliseconds
    02/11/2016 08:19:32 Time: 156 miliseconds
    02/11/2016 08:22:02 Time: 1146 miliseconds
    02/11/2016 08:22:13 Time: 861 miliseconds
    02/11/2016 08:22:34 Time: 182 miliseconds
    02/11/2016 08:23:04 Time: 282 miliseconds
    02/11/2016 08:24:35 Time: 972 miliseconds
    02/11/2016 08:25:06 Time: 130 miliseconds
    02/11/2016 08:25:36 Time: 366 miliseconds
    02/11/2016 08:30:06 Time: 199 miliseconds
    02/11/2016 08:33:46 Time: 19 miliseconds
    02/11/2016 08:36:56 Time: 710 miliseconds
    02/11/2016 08:37:27 Time: 240 miliseconds
    02/11/2016 08:45:47 Time: 520 miliseconds
    02/11/2016 08:46:08 Time: 55 miliseconds

    Freddy


    • Edited by CesarTabares Wednesday, November 2, 2016 7:51 AM
    Wednesday, November 2, 2016 7:50 AM
  • Hi,

    Do it work if you use PerfMon to find any clue?

    https://technet.microsoft.com/en-us/library/2008.08.pulse.aspx#id0120011

    Best Regards,

    Mary


    Please remember to mark the replies as answers if they help and unmark them if they provide no help.
    If you have feedback for TechNet Subscriber Support, contact tnmff@microsoft.com.

    Thursday, November 3, 2016 6:49 AM
  • I'll try the Hard Disk Bottleneck and update this thread


    Freddy

    Friday, November 4, 2016 12:16 PM
  • Hi,

    Any updates for this issue? If there's any information, please feel free to contact us.

    Best Regards,

    Mary


    Please remember to mark the replies as answers if they help.
    If you have feedback for TechNet Subscriber Support, contact tnmff@microsoft.com.

    Monday, November 21, 2016 8:29 AM
  • Yes I have a question about the screenshot I uploaded in my first post

    What does it mean when Disk Queue length drops down to almost 0, while network activity also goes down?


    Freddy

    Friday, December 23, 2016 1:30 PM