Pengsdata File Categorizer & Filter
Function:This is a creationary file classification tool, it is able to categorize or filtrate files by keywords or filename extensions. If by keywords: First, set a relative directory name in a text file, e.g. 'relativeDir', then set a keyword under the 'relativeDir', e.g. 'excel'. Now, run this tool and select a Source Directory and Target Drectory, e.g. 'd:/sourceDir' and 'd:/targetDir', then program use this text file as a template categorize source files. If a file's filename includes the keyword 'excel', program places this file into directory 'd:/targetDir/relativeDir/'. If by filename extensions: copy or move given types of files.
Characteristic: Need not installation; Easy to use; Most of functions is free; Good solution to solve HTML files which attached connect-folder (e.g. '.files', '_files'); Security and trustiness; Multi-OS platform supported(windows/linux/unix/mac etc.)
<?xml version="1.0"
encoding="utf-8"?> <pengsdata> <dir name="windows" > <dir name="windows98"> <keyword name="win|98" priority="5"/> <keyword name="windows 98" priority="10"/> </dir> <dir name="windows2000"> <keyword name="win|2000" priority="5"/> <keyword name="windows 2000" priority="10"/> </dir> <keyword name="windows" priority="-1"/> <keyword name="win" priority="-10" description="windows"/> </dir> <dir name="office"> <dir name="office2000"> <dir name="word2000"> <keyword name="word|2000" priority="80"/> <keyword name="word 2000" priority="100"/> </dir> <keyword name="office|2000" priority="50"/> <keyword name="office-2000" priority="60"/> <keyword name="office 2000" priority="65"/> </dir> <keyword name="office" priority="-1"/> </dir> </pengsdata> |
Comments for template:
<?xml version="1.0" encoding="utf-8"?>: XML declaration,
required.
encoding should be accord with your system encoding.You can get your system encoding by
following steps: run program, click "Tools" in the program menu bar, then click
"My System Encoding" in the menu item.
<pengsdata>: pengsdata element, required.
<dir name="">: Appoint directory name, required. attribute-name,
required, non-empty.
its sub-element may be "keyword" or "dir". dir element corresponding
to directory which named by name-attribute of dir element.
<keyword name="" priority="" description=""/>: Appoint
keyword, required.
It has no sub-element.
attribute-name and attribute-priority, required, non-empty. attribute-description is
optional.
value of attribute-name must be one or more keywords(if more, split by "|"
between keywords, relationship is "AND", denote that filename must include all
keywords), ignoring case.
attribute-priority's value appoint keyword element's priority, must be a integer.Please
set it range from -30 to 100, so that it can be shared with other users.
Program takes out a file, then takes out a keyword which has highest priority,
then judges whether includes the keyword in the file name.
If includes, stop judging and categorize the file to directory which corresponding to dir
element that include the keyword element.
If dose not include, takes out a keyword which has highest priority in other keywords,
repeates above.
For example(template is test.xml above,supose in d:\),
Source directory: c:\abc\def
Destination directory: d:\xyz
Check whether include keyword in Text file content: deselected
Check keywords' priority is right or not: deselected
Keyword priority must big than: -1
Template file path: d:\test.xml
Suppose source directory exists file: c:\abc\def\gh\how install office 2000 on windows
2000.txt
result: d:\xyz\office\office 2000\how install office 2000 on windows 2000.txt
supplement:
if "Check whether include keyword in Text file content" is selected, program
will check 20 lines max in front of content whether includes keyword "office
2000" in "c:\abc\def\gh\how install office 2000 on windows 2000.txt", if
dosen't include, only list the file in program window, will not copy or move it.
"Text file" means only txt/htm/html files' content will be checked.
"Check keywords' priority is right or not", if select, will check all keywords'
priority in the template file. And all of keywords, whose priority is incorrect, will be
listed in program window. You should select this option when you first use a template
file.
"keyword Priority must big than" is set -1 here, three keyword elements below
will be ignored:
<keyword name="windows" priority="-1"/>
<keyword name="win" priority="-10"/>
<keyword name="office" priority="-1"/>
"template file path" may one or more template file path.
Support and feedback: If you have any Questions, Comments, or Suggestions,
do not hesitate to get in touch with me:
wenmingpeng@gmail.com;
pengwenming@hotmail.com;
wenming_peng@yahoo.com