SharePoint File Types Distribution Test

SharePoint users typically upload and/or download files of different types in a SharePoint environment. Administrators may choose to block some of these file types owing to various reasons. One of the most common reasons is to reduce space usage of files! Certain file types may not be critical to the business but may occupy significant space. If files of such types accumulate, they may not only deny space for business-critical information, but may even crash the sites they are uploaded to. To avoid this, the 'heavy' file types may be blocked.

But, how does an administrator know which file types are consuming space excessively? This is where the SharePoint File Types Distribution test helps! This test groups files uploaded to the SharePoint environment on the basis of their file types, and reports the number and total size of each file type. The percentage size distribution of every file type is also reported, so that administrators can instantly and accurately identify the file type that is hogging space! Using the detailed diagnosis of the test, you can easily pinpoint the exact file extensions that are occupying the maximum space.

By default, eG Enterprise groups a pre-defined set of file extensions under a file type. For instance, files with extensions .doc, .docx, .log, .msg, .odt, .pages, .rtf, .tex, .txt, .wpd, .and wps are by default grouped under the file type 'Text files'.Space usage statistics related to all the aforesaid extensions will hence be aggregated and reported for the measure group, "Text files". If required, you can add more extensions to a file type or remove existing extensions from a file type. For example, to make sure that files with the extensions, say .xls and .xlsx, are also classifed as "Text files", do the following:

  1. Edit the eg_tests.ini file in the <EG_INSTALL_DIR>\manager\config directory.
  1. In the [Sharepoint2010_server] section of the file, you will find entries for each file type. In this section, look for the following entry:

    Text_Files=.doc|Microsoft Word Document,.docx|Microsoft Word Open XML Document,.log|Log File,.msg|Outlook Mail Message,.odt|OpenDocument Text Document,.pages|Pages Document,.rtf|Rich Text Format File,.tex|LaTeX Source Document,.txt|Plain Text File,.wpd|WordPerfect Document,.wps|Microsoft, Works Word Processor Document

  2. As you can see, the "Text_Files" parameter is set to a pipe-separated list of <DocumentType>,<FileExtension> pairs. For instance, in the pair Microsoft Word Document,.doc, .doc is the file extension that will be grouped under the measure group, "Text files". Microsoft Word Document will be displayed as the Document Type of all files with extension .doc, in the detailed diagnosis . Now, to make sure that files with extensions .xls and.xlsx are also grouped under the "Text files" measure group, append the following entry to the "Text_Files" parameter:

    |Microsoft Excel,.xls|Microsoft Excel,.xlsx

    The full Text_Files entry will now read as follows:

    Text_Files=.doc|Microsoft Word Document,.docx|Microsoft Word Open XML Document,.log|Log File,.msg|Outlook Mail Message,.odt|OpenDocument Text Document,.pages|Pages Document,.rtf|Rich Text Format File,.tex|LaTeX Source Document,.txt|Plain Text File,.wpd|WordPerfect Document,.wps|Microsoft, Works Word Processor Document|Microsoft Excel,.xls|Microsoft Excel,.xlsx

  3. Finally, save the file.

Target of the test : A Microsoft SharePoint Server

Agent deploying the test : An internal/remote agent

Outputs of the test : One set of results for the SharePoint server being monitored.

Configurable parameters for the test
Parameters Description

Test period

This indicates how often should the test be executed.

Host

The host for which the test is to be configured.

Port

The port at which the host server listens.

Fetch Farm Measures

Typically, farm-level metrics - eg., metrics on farm status, site collections, usage analytics - will not vary from one SharePoint server in the farm to another. If these metrics are collected and stored in the eG database for each monitored server in the SharePoint farm, it is bound to unnecessarily consume space in the database and increase processing overheads. To avoid this, farm-level metrics collection is by default switched off for the member servers in the SharePoint farm, and enabled only if the server being monitored is provisioned as the Central Administration site. Accordingly, this parameter is set to If Central Administration by default. This default setting ensures that farm-level metrics are collected from and stored in the database for only a single SharePoint server in the farm.

If you want to completely switch-off farm-level metrics collection for a SharePoint farm, then set this parameter to No.

Some high-security environments may not allow an eG agent to be deployed on the Central Administration site. Administrators of such environments may however require farm-level insights into status and performance. To provide these insights for such environments, you can optionally enable farm-level metrics collection from any monitored member server in the farm, even if that server is not provisioned as the Central Administration site. For this, set this parameter to Yes when configuring this test for that member server.

Domain, Domain User, Password, and Confirm Password

If the Fetch Farm Measures flag of these tests is set to No or to If Central Administration Site, then this test should be configured with the credentials of a user with the following privileges:

On the other hand, if the Fetch Farm Measures flag of these tests is set to Yes, then the user configured for the tests not only requires the four privileges discussed above, but should also be part of the following groups on the eG agent host:

  • Administrators

  • WSS_ADMIN_WPG

  • IIS_USRS

  • Performance Monitor Users

  • WSS_WPG

  • Users

It is recommended that you create a special user for this purpose and assign the aforesaid privileges to him/her. Once such a user is created, specify the domain to which that user belongs in the Domain text box, and then, enter the credentials of the user in the Domain User and Password text boxes. To confirm the password, retype it in the Confirm Password text box.

DD Frequency

Refers to the frequency with which detailed diagnosis measures are to be generated for this test. The default is 1:1. This indicates that, by default, detailed measures will be generated every time this test runs, and also every time the test detects a problem. You can modify this frequency, if you so desire. Also, if you intend to disable the detailed diagnosis capability for this test, you can do so by specifying none against DD frequency.

Detailed Diagnosis

To make diagnosis more efficient and accurate, the eG Enterprise embeds an optional detailed diagnostic capability. With this capability, the eG agents can be configured to run detailed, more elaborate tests as and when specific problems are detected. To enable the detailed diagnosis capability of this test for a particular server, choose the On option. To disable the capability, click on the Off option.

The option to selectively enable/disable the detailed diagnosis capability will be available only if the following conditions are fulfilled:

  • The eG manager license should allow the detailed diagnosis capability
  • Both the normal and abnormal frequencies configured for the detailed diagnosis measures should not be 0.
Measurements made by the test
Measurement Description Measurement Unit Interpretation

Text files

Indicates the number of text files presently in SharePoint.

Number

 

Text files size

Indicates the total size of all the text files.

MB

 

Text files distribution

Indicates the percentage of total space in the SharePoint environment that is occupied by the text files.

Percent

You can compare the value of this measure with that of the other "distribution" measures to know which file type is hogging the space.

A value close to 100% is a cause for concern, as it indicates that text files are occupying almost all the space in the environment. In such a case, use the detailed diagnosis of the measure to know which file extensions specifically are eroding the space.

Data files

Indicates the number of files of type data in the SharePoint environment.

Number

 

Data files size

Indicates the total size of all data files.

MB

 

Data files distribution

Indicates the percentage of total space in the environment that is occupied by the data files.

Percent

You can compare the value of this measure with that of the other "distribution" measures to know which file type is hogging the space.

A value close to 100% is a cause for concern, as it indicates that data files are occupying almost all the space in the environment. In such a case, use the detailed diagnosis of the measure to know which file extensions specifically are eroding the space.

Audio files

Indicates the number of files of type audio in the SharePoint environment.

Number

 

Audio files size

Indicates the total size of all audio files.

MB

 

Audio files distribution

Indicates the percentage of total space in the environment that is occupied by the audio files.

Percent

You can compare the value of this measure with that of the other "distribution" measures to know which file type is hogging the space.

A value close to 100% is a cause for concern, as it indicates that audio files are occupying almost all the space in the environment. In such a case, use the detailed diagnosis of the measure to know which file extensions specifically are eroding the space.

Video files

Indicates the number of files of type video in the SharePoint environment.

Number

 

Video files size

Indicates the total size of all video files.

MB

 

Video files distribution

Indicates the percentage of total space in the environment that is occupied by the video files.

Percent

You can compare the value of this measure with that of the other "distribution" measures to know which file type is hogging the space.

A value close to 100% is a cause for concern, as it indicates that video files are occupying almost all the space in the environment. In such a case, use the detailed diagnosis of the measure to know which file extensions specifically are eroding the space.

3D image files

Indicates the number of files of type 3D images in the SharePoint environment.

Number

 

3D images files size

Indicates the total size of all 3D image files.

MB

 

3D image files distribution

Indicates the percentage of total space in the environment that is occupied by the 3D image files.

Percent

You can compare the value of this measure with that of the other "distribution" measures to know which file type is hogging the space.

A value close to 100% is a cause for concern, as it indicates that 3D image files are occupying almost all the space in the environment. In such a case, use the detailed diagnosis of the measure to know which file extensions specifically are eroding the space.

Raster image files

Indicates the number of files of type raster image in the SharePoint environment.

Number

 

Raster image files size

Indicates the total size of all raster image files.

MB

 

Raster image files distribution

Indicates the percentage of total space in the environment that is occupied by the raster image files.

Percent

You can compare the value of this measure with that of the other "distribution" measures to know which file type is hogging the space.

A value close to 100% is a cause for concern, as it indicates that raster image files are occupying almost all the space in the environment. In such a case, use the detailed diagnosis of the measure to know which file extensions specifically are eroding the space.

Vector image files

Indicates the number of files of type vector image in the SharePoint environment.

Number

 

Vector image files size

Indicates the total size of all vector image files.

MB

 

Vector image files distribution

Indicates the percentage of total space in the environment that is occupied by the vector image files.

Percent

You can compare the value of this measure with that of the other "distribution" measures to know which file type is hogging the space.

A value close to 100% is a cause for concern, as it indicates that vector image files are occupying almost all the space in the environment. In such a case, use the detailed diagnosis of the measure to know which file extensions specifically are eroding the space.

Page layout files

Indicates the number of files of type page layout in the SharePoint environment.

Number

 

Page layout files size

Indicates the total size of all page layout files.

MB

 

Page layout files distribution

Indicates the percentage of total space in the environment that is occupied by the page layout files.

Percent

You can compare the value of this measure with that of the other "distribution" measures to know which file type is hogging the space.

A value close to 100% is a cause for concern, as it indicates that page layout files are occupying almost all the space in the environment. In such a case, use the detailed diagnosis of the measure to know which file extensions specifically are eroding the space.

Spreadsheet files

Indicates the number of files of type spreadsheet in the SharePoint environment.

Number

 

Spreadsheet files size

Indicates the total size of all spreadsheet files.

MB

 

Spreadsheet files distribution

Indicates the percentage of total space in the environment that is occupied by the spreadsheet files.

Percent

You can compare the value of this measure with that of the other "distribution" measures to know which file type is hogging the space.

A value close to 100% is a cause for concern, as it indicates that spreadsheet files are occupying almost all the space in the environment. In such a case, use the detailed diagnosis of the measure to know which file extensions specifically are eroding the space.

Database files

Indicates the number of files of type database in the SharePoint environment.

Number

 

Database files size

Indicates the total size of all database files.

MB

 

Database files distribution

Indicates the percentage of total space in the environment that is occupied by the database files.

Percent

You can compare the value of this measure with that of the other "distribution" measures to know which file type is hogging the space.

A value close to 100% is a cause for concern, as it indicates that database files are occupying almost all the space in the environment. In such a case, use the detailed diagnosis of the measure to know which file extensions specifically are eroding the space.

Executable files

Indicates the number of files of type executable in the SharePoint environment.

Number

 

Executable files size

Indicates the total size of all executable files.

MB

 

Executables files distribution

Indicates the percentage of total space in the environment that is occupied by the executable files.

Percent

You can compare the value of this measure with that of the other "distribution" measures to know which file type is hogging the space.

A value close to 100% is a cause for concern, as it indicates that executable files are occupying almost all the space in the environment. In such a case, use the detailed diagnosis of the measure to know which file extensions specifically are eroding the space.

Game files

Indicates the number of files of type game in the SharePoint environment.

Number

Game files size

Indicates the total size of all game files.

MB

 

Games files distribution

Indicates the percentage of total space in the environment that is occupied by the game files.

Percent

You can compare the value of this measure with that of the other "distribution" measures to know which file type is hogging the space.

A value close to 100% is a cause for concern, as it indicates that game files are occupying almost all the space in the environment. In such a case, use the detailed diagnosis of the measure to know which file extensions specifically are eroding the space.

CAD files

Indicates the number of files of type CAD in the SharePoint environment.

Number

CAD files size

Indicates the total size of all CAD files.

MB

 

CAD files distribution

Indicates the percentage of total space in the environment that is occupied by the CAD files.

Percent

You can compare the value of this measure with that of the other "distribution" measures to know which file type is hogging the space.

A value close to 100% is a cause for concern, as it indicates that CAD files are occupying almost all the space in the environment. In such a case, use the detailed diagnosis of the measure to know which file extensions specifically are eroding the space.

GIS files

Indicates the number of files of type GIS in the SharePoint environment.

Number

 

GIS files size

Indicates the total size of all GIS files.

MB

 

GIS files size distribution

Indicates the percentage of total space in the environment that is occupied by the GIS files.

Percent

You can compare the value of this measure with that of the other "distribution" measures to know which file type is hogging the space.

A value close to 100% is a cause for concern, as it indicates that GIS files are occupying almost all the space in the environment. In such a case, use the detailed diagnosis of the measure to know which file extensions specifically are eroding the space.

Web files

Indicates the number of files of type web in the SharePoint environment.

Number

 

Web files size

Indicates the total size of all web files.

MB

 

Web files distribution

Indicates the percentage of total space in the environment that is occupied by the web files.

Percent

You can compare the value of this measure with that of the other "distribution" measures to know which file type is hogging the space.

A value close to 100% is a cause for concern, as it indicates that web files are occupying almost all the space in the environment. In such a case, use the detailed diagnosis of the measure to know which file extensions specifically are eroding the space.

Plugin files

Indicates the number of files of type plugin in the SharePoint environment.

Number

 

Plugin files size

Indicates the total size of all plugin files.

MB

 

Plugin files distribution

Indicates the percentage of total space in the environment that is occupied by the plugin files.

Percent

You can compare the value of this measure with that of the other "distribution" measures to know which file type is hogging the space.

A value close to 100% is a cause for concern, as it indicates that plugin files are occupying almost all the space in the environment. In such a case, use the detailed diagnosis of the measure to know which file extensions specifically are eroding the space.

Font files

Indicates the number of files of type font in the SharePoint environment.

Number

 

Font files size

Indicates the total size of all font files.

MB

 

Font files distribution

Indicates the percentage of total space in the environment that is occupied by the font files.

Percent

You can compare the value of this measure with that of the other "distribution" measures to know which file type is hogging the space.

A value close to 100% is a cause for concern, as it indicates that font files are occupying almost all the space in the environment. In such a case, use the detailed diagnosis of the measure to know which file extensions specifically are eroding the space.

System files

Indicates the number of files of type system in the SharePoint environment.

Number

 

System files size

Indicates the total size of all system files.

MB

 

System files distribution

Indicates the percentage of total space in the environment that is occupied by the system files.

Percent

You can compare the value of this measure with that of the other "distribution" measures to know which file type is hogging the space.

A value close to 100% is a cause for concern, as it indicates that system files are occupying almost all the space in the environment. In such a case, use the detailed diagnosis of the measure to know which file extensions specifically are eroding the space.

Settings files

Indicates the number of files of type settings in the SharePoint environment.

Number

 

Settings files size

Indicates the total size of all settings files.

MB

 

Settings files distribution

Indicates the percentage of total space in the environment that is occupied by the settings files.

Percent

You can compare the value of this measure with that of the other "distribution" measures to know which file type is hogging the space.

A value close to 100% is a cause for concern, as it indicates that settings files are occupying almost all the space in the environment. In such a case, use the detailed diagnosis of the measure to know which file extensions specifically are eroding the space.

Encoded files

Indicates the number of files of type encoded in the SharePoint environment.

Number

 

Encoded files size

Indicates the total size of all encoded files.

MB

 

Encoded files distribution

Indicates the percentage of total space in the environment that is occupied by the encoded files.

Percent

You can compare the value of this measure with that of the other "distribution" measures to know which file type is hogging the space.

A value close to 100% is a cause for concern, as it indicates that encoded files are occupying almost all the space in the environment. In such a case, use the detailed diagnosis of the measure to know which file extensions specifically are eroding the space.

Compressed files

Indicates the number of files of type compressed in the SharePoint environment.

Number

 

Compressed files size

Indicates the total size of all compressed files.

MB

 

Compressed files distribution

Indicates the percentage of total space in the environment that is occupied by the compressed files.

Percent

You can compare the value of this measure with that of the other "distribution" measures to know which file type is hogging the space.

A value close to 100% is a cause for concern, as it indicates that compressed files are occupying almost all the space in the environment. In such a case, use the detailed diagnosis of the measure to know which file extensions specifically are eroding the space.

Disk image files

Indicates the number of files of type disk image in the SharePoint environment.

Number

 

Disk images files size

Indicates the total size of all disk image files.

MB

 

Disk image files distribution

Indicates the percentage of total space in the environment that is occupied by the disk image files.

Percent

You can compare the value of this measure with that of the other "distribution" measures to know which file type is hogging the space.

A value close to 100% is a cause for concern, as it indicates that disk image files are occupying almost all the space in the environment. In such a case, use the detailed diagnosis of the measure to know which file extensions specifically are eroding the space.

Developer files

Indicates the number of files of type developer in the SharePoint environment.

Number

 

Developer files size

Indicates the total size of all developer files.

MB

 

Developer files distribution

Indicates the percentage of total space in the environment that is occupied by the developer files.

Percent

You can compare the value of this measure with that of the other "distribution" measures to know which file type is hogging the space.

A value close to 100% is a cause for concern, as it indicates that developer files are occupying almost all the space in the environment. In such a case, use the detailed diagnosis of the measure to know which file extensions specifically are eroding the space.

Backup files

Indicates the number of files of type backup in the SharePoint environment.

Number

 

Backup files size

Indicates the total size of all backup files.

MB

 

Backup files distribution

Indicates the percentage of total space in the environment that is occupied by the backup files.

Percent

You can compare the value of this measure with that of the other "distribution" measures to know which file type is hogging the space.

A value close to 100% is a cause for concern, as it indicates that backup files are occupying almost all the space in the environment. In such a case, use the detailed diagnosis of the measure to know which file extensions specifically are eroding the space.

Miscellaneous files

Indicates the number of files that are not of any of types listed above, in the SharePoint environment.

Number

 

Miscellaneous files size

Indicates the total size of all the other files.

MB

 

Miscellaneous files distribution

Indicates the percentage of total space in the environment that is occupied by all the other files.

Percent

You can compare the value of this measure with that of the other "distribution" measures to know which file type is hogging the space.

A value close to 100% is a cause for concern, as it indicates that files of some random type are occupying almost all the space in the environment. In such a case, use the detailed diagnosis of the measure to know which file extensions specifically are eroding the space.

Use the detailed diagnosis of the Text files measure to view the file extensions grouped under Text files and the space usage of each extension.

Figure 1 : The detailed diagnosis of the Text files measure