+7 (495) 987 43 74 ext. 3304
Join us -              
Рус   |   Eng

Authors

Pavlov O.

Degree
Student, MAI, Department no. 316, Moscow Aviation Institute (National Research University)
E-mail
pavlov@smiap.ru
Location
Moscow
Articles

On the issue of indexing file storage based on FTP

This article discusses the server indexing system based on the FTP protocol. The article describes a set of functions for indexing files, search engine and database. The authors have shown the possibility of FTP server indexing, possible problems and the ways of solving them. Despite the evolution of file storages through the 34 years passed after the creation of FTP protocol this issue remains relevant and useful because of big amount of data, stored on FTP servers and wide functionality of developed search system. The search system developed by the project consists of three main parts: indexing system, database and Web-service. Indexing system is the most important and difficult part of the project, it indexes various types of servers, which are fundamentally different from each other in the format of data on files, and avoids link traps and upload indexed data to database. To ensure multi-threaded operation and efficiency, the indexing system is implemented with Python. Web-service accepts requests from the end user, makes a request to the database and returns the search result. The user can create a query specifying the full or partial file name, file type and size. The unique difference of this system is the storage of the change history of files and servers that can be tracked for each file, directory or server.
Read more...