Citation: | TONG Lingling, LI Pengxiao, DUAN Dongsheng, et al. Data masking model for heterogeneous big data environment[J]. Journal of Beijing University of Aeronautics and Astronautics, 2022, 48(2): 249-257. doi: 10.13700/j.bh.1001-5965.2020.0403(in Chinese) |
Due to the variety of data types and desensitization demand in different scenarios, traditional data masking methods cannot meet the user privacy protection requirements in the environment of big data. How to realize the accurate pointing and efficient desensitization of heterogeneous big data for data security, trust and availability, has become the key in this area. In this paper, we propose a data masking model for heterogeneous big data applications, such as texts, images, voices and databases, and four key modules are presented in our model. First, the sensitive data automatic identification and classification in different applications are realized in different application scenarios by desensitization data preprocessing. Second, with data pre-masking method, the data masking evaluation is implemented in five dimensions, including data availability, data relevance, degree of privacy protection, and time and space complexity, to construct the customized desensitization strategy. Finally, after task scheduling, the allocation and execution of the data masking tasks are performed, and the masking data recovery can also be partially supported. Two typical data masking applications are verified and analyzed based on the proposed heterogeneous big data masking model, indicating that effective desensitization can be achieved in different application scenarios.
[1] |
SWEENEY L. k-anonymity: A model for protecting privacy[J]. Fuzziness and Knowledge-based Systems, 2002, 10(5): 557-570. doi: 10.1142/S0218488502001648
|
[2] |
RADHAKRISHNAN R, KHARRAZI M, MEMON N. Data masking: A new approach for steganography[J]. Journal of VLSI Signal Processing Systems for Signal, Image and Video Technology, 2005, 41(3): 293-303. doi: 10.1007/s11265-005-4153-1
|
[3] |
RAVIKUMAR G K, MANJUNATH T N, RAVINDRA S, et al. A survey on recent trends, process and development in data masking for testing[J]. International Journal of Computer Science, 2011, 8(2): 535-544.
|
[4] |
VICTOR N, LOPEZ D, ABAWAJY J H. Privacy models for big data: A survey[J]. International Journal of Big Data Intelligence, 2016, 3(1): 61-75. doi: 10.1504/IJBDI.2016.073904
|
[5] |
VADREVU P K, ADUSUMALLI S K, MANGALAMPLLI V K. Survey: Privacy preserving data publication in the age of big data in IoT era[J]. International Journal of Engineering, Science and Mathematics, 2017, 6(8): 938-944.
|
[6] |
陈天莹, 陈剑锋. 大数据环境下的智能数据脱敏系统[J]. 通信技术, 2016, 49(7): 915-922. doi: 10.3969/j.issn.1002-0802.2016.07.023
CHEN T Y, CHEN J F. Intelligent data masking system for big data productive environment[J]. Communications Technology, 2016, 49(7): 915-922(in Chinese). doi: 10.3969/j.issn.1002-0802.2016.07.023
|
[7] |
MACHANAVAJJHALA A, GEHRKE J, KIFER D, et al. l-diversity: Privacy beyond k-anonymity[C]//IEEE 22nd International Conference on Data Engineering. Piscataway: IEEE Press, 2006: 24.
|
[8] |
LI N, LI T, VENKATASUBRAMANIAN S. t-closeness: Privacy beyond k-anonymity and l-diversity[C]//IEEE 23rd International Conference on Data Engineering. Piscataway: IEEE Press, 2007: 106-115.
|
[9] |
SARADA G, ABITHA N, MANIKANDAN G, et al. A few new approaches for data masking[C]//International Conference on Circuits, Power and Computing Technologies. Piscataway: IEEE Press, 2015: 15295632.
|
[10] |
GUJJARY V A, SAXENA A. A neural network approach for data masking[J]. Neurocomputing, 2011, 74(9): 1497-1501. doi: 10.1016/j.neucom.2011.01.002
|
[11] |
ZHOU Y, LOUIS T A. A smoothing approach for masking spatial data[J]. Annals of Applied Statistics, 2010, 4(3): 1451-1475. doi: 10.1214/09-aoas325
|
[12] |
吴克河, 朱海, 李为, 等. 基于敏感信息度量的t-保密脱敏技术改良[J]. 信息技术, 2019(11): 5-9.
WU K H, ZHU H, LI W, et al. An improvement of t-closeness technology based on sensitive information metric[J]. Information Technology, 2019(11): 5-9(in Chinese).
|
[13] |
SANTOS R J, BERNARDINO J, VIEIRA M. A data masking technique for data warehouses[C]//Proceedings of the 15th Symposium on International Database Engineering & Applications, 2011: 61-69.
|
[14] |
张琦颖. 大数据脱敏系统的设计与实现[D]. 北京: 北京邮电大学, 2018: 19-33.
ZHANG Q Y. The design and implementation of big data anonymity system[D]. Beijing: Beijing University of Posts and Telecommunications, 2018: 19-33(in Chinese).
|
[15] |
邵华西. 基于T-Closeness的大数据脱敏系统的设计与实现[D]. 北京: 北京邮电大学, 2019: 44-52.
SHAO H X. Design and implementation of T-Closeness based big data anonymization system[D]. Beijing: Beijing University of Posts and Telecommunications, 2019: 44-52(in Chinese).
|
[16] |
王鑫, 王电钢, 母继元, 等. 基于机器学习的数据脱敏系统研究与设计[J]. 电力信息与通信技术, 2018, 16(1): 33-38.
WANG X, WANG D G, MU J Y, et al. Research and implementation of data masking system based on machine learning[J]. Electric Power ICT, 2018, 16(1): 33-38(in Chinese).
|
[17] |
邓雪, 李家铭, 曾浩健, 等. 层次分析方法权重计算方法分析及其应用研究[J]. 数学的实践与认知, 2012, 42(7): 93-100.
DENG X, LI J M, ZENG H J, et al. Research on computation methods of AHP weight vector and its applications[J]. Mathematics in Practice and Theory, 2012, 42(7): 93-100(in Chinese).
|
[1] | LI L Y,YANG R N,WANG Y,et al. CAP planning method based on elliptic fitting of optimal detection routes[J]. Journal of Beijing University of Aeronautics and Astronautics,2025,51(1):293-302 (in Chinese). doi: 10.13700/j.bh.1001-5965.2022.0978. |
[2] | SUN K,HU Q S,ZHENG X F,et al. Multi-Bernoulli extended target tracking based on orientation and half axes lengths of an ellipse[J]. Journal of Beijing University of Aeronautics and Astronautics,2024,50(11):3367-3376 (in Chinese). doi: 10.13700/j.bh.1001-5965.2022.0869. |
[3] | CHEN Hui, LIU Meng-bo, LIAN Feng, HAN Chong-zhao. Star convex irregular shape multi-extended target PMBM filter[J]. Journal of Beijing University of Aeronautics and Astronautics. doi: 10.13700/j.bh.1001-5965.2023.0766 |
[4] | PANG C,LIU D J,TIAN G,et al. Experimental and simulation study on fatigue multi crack fusion of 2195-T8 Al-Li alloy[J]. Journal of Beijing University of Aeronautics and Astronautics,2024,50(1):350-358 (in Chinese). doi: 10.13700/j.bh.1001-5965.2022.0249. |
[5] | ZHANG Xu, ZHAO Rui, LI Yu, YANG Guang, WANG Li-yan. Component of gas-injection effects on wall heat flux and skin-friction of vehicles[J]. Journal of Beijing University of Aeronautics and Astronautics. doi: 10.13700/j.bh.1001-5965.2024.0009 |
[6] | ZHOU K,CHEN W J,CHEN W H,et al. Extended subtraction speech enhancement based on cubic spline interpolation[J]. Journal of Beijing University of Aeronautics and Astronautics,2023,49(10):2826-2834 (in Chinese). doi: 10.13700/j.bh.1001-5965.2021.0744. |
[7] | WANG J M,GUO Y Q,YU H F. Extension method of engine low speed characteristics based on backbone features[J]. Journal of Beijing University of Aeronautics and Astronautics,2023,49(9):2351-2360 (in Chinese). doi: 10.13700/j.bh.1001-5965.2021.0634. |
[8] | LIU Qiang, YIN Yu, LI Kai. Research on image preprocessing acceleration method based on RISC-V vector extension[J]. Journal of Beijing University of Aeronautics and Astronautics. doi: 10.13700/j.bh.1001-5965.2023.0208 |
[9] | ZHANG Yun-jie, ZHOU Jie-xin, ZHANG Feng-zhe, ZHOU Rui, ZOU Ting. Reachability Evaluation Method for Ballistic Missile Based on Extended Boundary Method[J]. Journal of Beijing University of Aeronautics and Astronautics. doi: 10.13700/j.bh.1001-5965.2023.0630 |
[10] | HAN J K,YUAN T,LIU Z K,et al. Expanding hexagon search method based on honeycomb structure[J]. Journal of Beijing University of Aeronautics and Astronautics,2023,49(10):2731-2740 (in Chinese). doi: 10.13700/j.bh.1001-5965.2021.0718. |
[11] | XIN T D,CUI C Y,LIU Y,et al. Non-probabilistic reliability analysis method for propellent tank with crack defect[J]. Journal of Beijing University of Aeronautics and Astronautics,2023,49(9):2330-2336 (in Chinese). doi: 10.13700/j.bh.1001-5965.2021.0651. |
[12] | FAN X H,GOU B Y,CHEN T,et al. Hole edge crack monitoring technology of flexible eddy current array sensor[J]. Journal of Beijing University of Aeronautics and Astronautics,2023,49(3):726-734 (in Chinese). doi: 10.13700/j.bh.1001-5965.2021.0306. |
[13] | LI Y,ZONG H H,CAI J,et al. Hydroplaning behavior of aircraft wheel group and additional resistance due to accumulated water on pavement[J]. Journal of Beijing University of Aeronautics and Astronautics,2023,49(5):1099-1107 (in Chinese). doi: 10.13700/j.bh.1001-5965.2021.0402. |
[14] | WEN C,DONG W H,XIE W J,et al. Multi-UAVs 3D cooperative curve path planning method based on CEA-GA[J]. Journal of Beijing University of Aeronautics and Astronautics,2023,49(11):3086-3099 (in Chinese). doi: 10.13700/j.bh.1001-5965.2021.0787. |
[15] | ZHOU Quan-zhi, YANG You-xu, SUN Lu-bin, ZHANG Xing-cui, WU Yi-fei, HUO Meng-wen. Aeroelastic Optimization Design of SpaRibs Wing Structure[J]. Journal of Beijing University of Aeronautics and Astronautics. doi: 10.13700/j.bh.1001-5965.2023.0343 |
[16] | WANG Z X,WAN Z Q,WANG X Z,et al. Fast stability analysis method for composite panel with variable angle tow fiber[J]. Journal of Beijing University of Aeronautics and Astronautics,2023,49(2):353-366 (in Chinese). doi: 10.13700/j.bh.1001-5965.2021.0259. |
[17] | WU Sunyong, ZHOU Yusong, XIE Yun, CAI Ruhua, FAN Xiangting. Extended target tracking algorithm based on MM-GGIW-PMBM filter[J]. Journal of Beijing University of Aeronautics and Astronautics, 2022, 48(12): 2356-2364. doi: 10.13700/j.bh.1001-5965.2021.0162 |
[18] | PENG Chaoyong, XU Songbai, DU Chuangzhou, ZHANG Jie. Ultrasonic phased array imaging on aviation aluminum block fatigue crack[J]. Journal of Beijing University of Aeronautics and Astronautics, 2022, 48(12): 2398-2404. doi: 10.13700/j.bh.1001-5965.2021.0161 |
[19] | LI Yongchang, DAI Yuting, YANG Chao. Fluid and structure coupling analysis of split drag rudder[J]. Journal of Beijing University of Aeronautics and Astronautics, 2022, 48(12): 2494-2501. doi: 10.13700/j.bh.1001-5965.2021.0151 |
[20] | XIA Fei, XUE Jianghong, HE Zanhang, JIN Fusong. Interfacial crack growth of delaminated composite laminates under hygrothermal environment[J]. Journal of Beijing University of Aeronautics and Astronautics, 2022, 48(12): 2460-2472. doi: 10.13700/j.bh.1001-5965.2021.0137 |