Volume 46 Issue 3
Mar.  2020
Turn off MathJax
Article Contents
XIA Qianchen, LYU Jianghua, MENG Xiangxi, et al. Distributed user trace collection and storage system[J]. Journal of Beijing University of Aeronautics and Astronautics, 2020, 46(3): 548-562. doi: 10.13700/j.bh.1001-5965.2019.0003(in Chinese)
Citation: XIA Qianchen, LYU Jianghua, MENG Xiangxi, et al. Distributed user trace collection and storage system[J]. Journal of Beijing University of Aeronautics and Astronautics, 2020, 46(3): 548-562. doi: 10.13700/j.bh.1001-5965.2019.0003(in Chinese)

Distributed user trace collection and storage system

doi: 10.13700/j.bh.1001-5965.2019.0003
Funds:

National Natural Science Foundation of China 61300007

National Natural Science Foundation of China 61305054

Foundation of the Key Lab of Software Development Environment SKLSDE-2012ZX-28

Foundation of the Key Lab of Software Development Environment SKLSDE-2014ZX-06

More Information
  • Corresponding author: LYU Jianghua, E-mail: jhlv@buaa.edu.cn
  • Received Date: 09 Jan 2019
  • Publish Date: 20 Mar 2020
  • In the distributed complex network environment, collecting the large number of users' behavioral data along with the website data during browsing accurately and comprehensively, efficiently storing them are the basis of user behavior analysis. In order to solve the problems of diversity of data types and storage differences, improve the efficiency of data retrieval, and provide support for the analysis of user behavior for the individual needs of enterprises, a white box mode of user trace collection and storage system is designed in this paper. The users visit the Web server and processes the data of interaction/transaction and user operations, such as pictures, video, description of goods and other types of files. These interfaces and data are called user browsing traces, and operation sequences are the actual user behaviors in order. User data and operation sequence analysis can accurately reflect user characteristics. The collection system is modeled by the interface window tree, providing a unified access interface for data, which is stored in different locations according to the data types. The applications input parameters to specify the storage location to create the database. Through the access interface, the user data can be accessed according to the different file types and requirements. The model solves the problem of capturing, storing, and retrieving traces of Internet-oriented user interaction, and has good accuracy and integrity.

     

  • loading
  • [1]
    SRIVASTAVA J, COOLEY R, DESHPANDE M, et al.Web usage mining:Discovery and applications of usage patterns from Web data[J].ACM SIGKDD Explorations Newsletter, 2000, 1(2):12-23. doi: 10.1145/846183.846188
    [2]
    张玉芳, 张艳华, 熊忠阳.一种高效的用户浏览行为采集方法[J].计算机工程与应用, 2013, 49(3):126-129. doi: 10.3778/j.issn.1002-8331.1108-0269

    ZHANG Y F, ZHANG Y H, XIONG Z Y.Efficient method for collecting user browsing behaviors[J].Computer Engineering and Applications, 2013, 49(3):126-129(in Chinese). doi: 10.3778/j.issn.1002-8331.1108-0269
    [3]
    CATLEDGE L D, PITKOW J E.Characterizing browsing strategies in the world-wide web[J].International World Wide Web Conference, 1995, 27(95):1065-1073.
    [4]
    董志安, 吕学强.基于百度搜索日志的用户行为分析[J].计算机应用与软件, 2013, 30(7):17-20. doi: 10.3969/j.issn.1000-386x.2013.07.006

    DONG Z A, LYU X Q.User behavior analyses based on baidu search logs[J].Computer Applications and Software, 2013, 30(7):17-20(in Chinese). doi: 10.3969/j.issn.1000-386x.2013.07.006
    [5]
    THORAT S S, MORE P.User oriented approach to website navigation concept using mathematical model[C]//International Conference on Computational Intelligence and Communication Networks.Piscataway, NJ: IEEE Press, 2016: 1431-1435.
    [6]
    李睿, 连航, 马世龙, 等.基于形式化方法的航空电子系统检测[J].软件学报, 2015, 26(2):181-201. http://d.old.wanfangdata.com.cn/Periodical/rjxb201502002

    LI R, LIAN H, MA S L, et al.Avionics system testing based on formal methods[J].Journal of Software, 2015, 26(2):181-201(in Chinese). http://d.old.wanfangdata.com.cn/Periodical/rjxb201502002
    [7]
    余慧佳, 刘奕群, 张敏, 等.基于大规模日志分析的搜索引擎用户行为分析[J].中文信息学报, 2007, 21(1):109-114. doi: 10.3969/j.issn.1003-0077.2007.01.018

    YU H J, LIU Y Q, ZHANG M, et al.Research in search engine user behavior based on log analysis[J].Journal of Chinese Information Processing, 2007, 21(1):109-114(in Chinese). doi: 10.3969/j.issn.1003-0077.2007.01.018
    [8]
    FU Y, LUO S, SHU J.Survey of secure cloud storage system and key technologies[J].Journal of Computer Research & Development, 2013, 50(1):136-145. http://www.wanfangdata.com.cn/details/detail.do?_type=perio&id=jsjyjyfz201301013
    [9]
    LIU J, HUANG K, RONG H, et al.Privacy-preserving public auditing for regenerating-code-based cloud storage[J].IEEE Transactions on Information Forensics & Security, 2015, 10(7):1513-1528.
    [10]
    WU Y, JIANG Z L, WANG X, et al.Dynamic data operations with deduplication in privacy-preserving public auditing for secure cloud storage[C]//IEEE International Conference on Computational Science and Engineering.Piscataway, NJ: IEEE Press, 2017: 562-567.
    [11]
    BELLET A, HABRARD A, SEBBAN M.A survey on metric learning for feature vectors and structured data[EB/OL].(2014-02-12)[2018-12-29].https: //arxiv.org/abs/1306.6709.
    [12]
    杨晶, 周双娥.一种基于XML的非结构化数据转换方法[J].计算机科学, 2017, 44(11):414-417. http://www.wanfangdata.com.cn/details/detail.do?_type=perio&id=jsjkx2017z2088

    YANG J, ZHOU S E.Method for unstructured data transformation based on XML technology[J].Computer Science, 2017, 44(11):414-417(in Chinese). http://www.wanfangdata.com.cn/details/detail.do?_type=perio&id=jsjkx2017z2088
    [13]
    BOUCHER T D, AUSLANDER D M, BASH C E, et al.Viability of dynamic cooling control in a data center environment[C]//The Ninth Intersociety Conference on Thermal and Thermomechanical Phenomena in Electronic Systems, 2004(ITHERM'04).Piscataway, NJ: IEEE Press, 2006: 593-600.
    [14]
    HOU B, CHEN F, OU Z, et al.Understanding I/O performance behaviors of cloud storage from a client's perspective[J].ACM Transactions on Storage, 2017, 13(2):16. http://www.wanfangdata.com.cn/details/detail.do?_type=perio&id=98bf6283ba03ec991c10fccbde40cad9
    [15]
    汪帅, 吕江花, 汪溁鹤, 等.一种支持数据去冗和扩容的多媒体文件云存储系统实现[J].计算机研究与发展, 2018, 55(5):1034-1048. http://d.old.wanfangdata.com.cn/Periodical/jsjyjyfz201805013

    WANG S, LYU J H, WANG R H, et al.A multimedia file cloud storage system to support data deduplication and logical expansion[J].Journal of Computer Research and Development, 2018, 55(5):1034-1048(in Chinese). http://d.old.wanfangdata.com.cn/Periodical/jsjyjyfz201805013
    [16]
    李慧莹.基于HDFS的小文件存储方法的研究与优化[D].西安: 西安电子科技大学, 2014. http://cdmd.cnki.com.cn/Article/CDMD-10701-1014331548.htm

    LI H Y.Research and optimization of small file storage method based on HDFS[D].Xi'an: Xidian University, 2014(in Chinese). http://cdmd.cnki.com.cn/Article/CDMD-10701-1014331548.htm
    [17]
    焦晨宇.可伸缩分布式文件系统及其应用[D].北京: 北京理工大学, 2015.

    JIAO C Y.The design and application of a scalable distributed file system[D].Beijing: Beijing Institute of Technology, 2015(in Chinese).
  • 加载中

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Figures(9)  / Tables(4)

    Article Metrics

    Article views(712) PDF downloads(178) Cited by()
    Proportional views
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return