Performance optimization for size info in NTFS by zivillian · Pull Request #33 · DiscUtils/DiscUtils

zivillian · 2017-07-07T12:31:50Z

I've found a major performance penalty in the space usage calculation of NTFS. Currenty the ClusterBitmap is read byte by byte from the underlying disk. This result in runtimes of up to one minute for large filesystems (>1TiB fs with a bitmap >50MiB).

I've changed the implementation to read the bitmap in blocks of 4K, which increased the performance by > 99% (in my test from ~1200ms down to ~45ms).

As a second optimization I've added a BitCounter which uses a precalculated lookup table to further speed up the size calculation from ~45ms down to ~15ms.

LordMike

Great contribution. We can probably improve even more, but it's a great start. 👍

Please see the comments I've made.

LordMike · 2017-07-10T19:41:56Z

            return 0;
        }

+        public int GetBytes(byte[] buffer, long index, int count)


We CANNOT have a public method formed like this. I would assume you mean the Index of the buffer, and not the index of the underlying structure.

Either:

make it private/internal

change it so that it takes an offset for the buffer as well (like Array.Copy())

some other way, so that we keep within the observed standards in .NET API's .. :)

Also. Name the offset in buffer just that: offset. So we're consistent with others. :)

This signature was actually a bad copy of GetByte. I've added offset, made both internal and reordered the parameters to be more clear.

LordMike · 2017-07-10T19:49:26Z

+
+namespace DiscUtils.Streams
+{
+    public static class BitCounter


Could we add a little documentation on the purpose of this class? .. Much of the API does not have docmentation, but it'd be nice to know that the purpose of this class is (set) bit counting. Maybe on the public methods too.

LordMike · 2017-07-10T19:51:57Z

+        {
+            var end = start + count;
+            if (end > values.Length)
+                return 0;


Please throw exceptions for exceptional states.

done - but I'm actually unsure which exception and message fits best - I first thought about ArgumentOutOfRangeException, but there's no single argument, which is out of range.

LordMike · 2017-07-10T19:52:15Z

+            return _lookupTable[value];
+        }
+
+        public static long Count(byte[] values, int start, int count)


Can we name them offset and count?

Includes PRs: #17, #18, #21, #22, #23, #27, #30, #31, #33, #34, #35, #36, #38, #39, #40, #41, #48, #51, #52, #55, #60

performance optimization

3e7d815

LordMike requested changes Jul 10, 2017

View reviewed changes

refactoring

870c82f

LordMike approved these changes Jul 11, 2017

View reviewed changes

LordMike merged commit 9af6581 into DiscUtils:master Jul 11, 2017

zivillian deleted the ntfs_performance branch July 11, 2017 18:41

LordMike added a commit that referenced this pull request Aug 16, 2017

Release 0.13.0-alpha

f3efdf9

Includes PRs: #17, #18, #21, #22, #23, #27, #30, #31, #33, #34, #35, #36, #38, #39, #40, #41, #48, #51, #52, #55, #60

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance optimization for size info in NTFS#33

Performance optimization for size info in NTFS#33
LordMike merged 2 commits into
DiscUtils:masterfrom
ahdde:ntfs_performance

zivillian commented Jul 7, 2017

Uh oh!

LordMike left a comment

Uh oh!

LordMike Jul 10, 2017

Uh oh!

LordMike Jul 10, 2017

Uh oh!

LordMike Jul 10, 2017

Uh oh!

zivillian Jul 11, 2017 •

edited

Loading

Uh oh!

LordMike Jul 10, 2017

Uh oh!

zivillian Jul 11, 2017

Uh oh!

LordMike Jul 10, 2017

Uh oh!

zivillian Jul 11, 2017 •

edited

Loading

Uh oh!

LordMike Jul 10, 2017

Uh oh!

zivillian Jul 11, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

zivillian commented Jul 7, 2017

Uh oh!

LordMike left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

zivillian Jul 11, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

zivillian Jul 11, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

zivillian Jul 11, 2017 •

edited

Loading

zivillian Jul 11, 2017 •

edited

Loading