Citation: | GUAN Shuaipeng, YU Haiyang, YANG Zhen, et al. Automatic poster synthesis system based on keywords[J]. Journal of Beijing University of Aeronautics and Astronautics, 2022, 48(2): 356-368. doi: 10.13700/j.bh.1001-5965.2020.0552(in Chinese) |
The popularization of intelligence puts forward new requirements for image editing. As a way of transmitting information in the form of images, posters play an important role in daily life and work management. But the production of posters requires multi-element image synthesis. However, there is a lack of an interactive and one click image synthesis system. Combined with the current popular image processing technology, a poster automatic synthesis system is designed and implemented. We propose keyword-based image retrieval scheme, constructs a dual filtering scheme based on text and content, and provides users with accurate and fast image retrieval methods. By counting the composition rules of a large number of carefully designed poster pictures and introducing the composition rules of aesthetic common sense, we propose a portrait layout recommendation scheme based on two-way rules, which assists users in portrait layout design under the combined effect of two-way rules. The experimental results prove that the scheme designed in this paper can run stably and efficiently, users can realize image synthesis through simple interactive operations, and the final image synthesis effect is real and effective.
[1] |
JOHNSON M, BROSTOW G J, SHOTTON J, et al. Semantic photo synthesis[J]. Computer Graphics Forum, 2006, 25(3): 407-413. doi: 10.1111/j.1467-8659.2006.00960.x
|
[2] |
CHEN T, CHENG M M, TAN P, et al. Sketch2Photo[J]. ACM Transactions on Graphics, 2009, 28(5): 1-10.
|
[3] |
NIBLACK C W, BARBER R, EQUITZ W, et al. QBIC project: Querying images by content, using color, texture, and shape[C]//Proceedings SPIE 1908, Storage and Retrieval for Image and Video Databases, 1993, 1908: 173-187.
|
[4] |
PENTLAND A, PICARD R W, SCLAROFF S. Photobook: Content-based manipulation of image databases[J]. International Journal of Computer Vision, 1996, 18(3): 233-254. doi: 10.1007/BF00123143
|
[5] |
AGARWALA A, DONTCHEVA M, AGRAWALA M, et al. Interactive digital photomontage[C]//ACM SIGGRAPH 2004. New York: ACM, 2004: 294-302.
|
[6] |
BEZDEK J C. Modified objective function algorithms[M]. Berlin: Springer, 1981: 155-201.
|
[7] |
BOYKOV Y Y, JOLLY M P. Interactive graph cuts for optimal boundary & region segmentation of objects in N-D images[C]//Proceedings 8th IEEE International Conference on Computer Vision. Piscataway: IEEE Press, 2001: 105-112.
|
[8] |
ROTHER C, KOLMOGOROV V, BLAKE A. GrabCut: Interactive foreground extraction using iterated graph cnts[J]. ACM Transactions on Graphics, 2004, 23(3): 309-314. doi: 10.1145/1015706.1015720
|
[9] |
FELZENSZWALB P F, HUTTENLOCHER D P. Efficient graph-based image segmentation[J]. International Journal of Computer Vision, 2004, 59(2): 167-181. doi: 10.1023/B:VISI.0000022288.19776.77
|
[10] |
WANG J, COHEN M F. Simultaneous matting and compositing[C]//2007 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2007: 1-8.
|
[11] |
FATTAL R, LISCHINSKI D, WERMAN M. Gradient domain high dynamic range compression[C]//ACM SIGGRAPH 2002. New York: ACM, 2002: 249-256.
|
[12] |
PÉREZ P, GANGNET M, BLAKE A. Poisson image editing[C]//ACM SIGGRAPH 2003. New York: ACM, 2003: 313-318.
|
[13] |
INABA S, KANEZAKI A, HARADA T. Automatic image synthesis from keywords using scene context[C]//Proceedings of the 22nd ACM International Conference on Multimedia. New York: ACM, 2014: 1149-1152.
|
[14] |
BHATTACHARYA S, SUKTHANKAR R, SHAH M. A framework for photo-quality assessment and enhancement based on visual aesthetics[C]//Proceedings of the 18th ACM International Conference on Multimedia, New York: ACM, 2010: 271-280.
|
[15] |
ZHANG Y H, SUN X S, YAO H X, et al. Aesthetic composition represetation for portrait photographing recommendation[C]//201219th IEEE International Conference on Image Processing. Piscataway: IEEE Press, 2012: 2753-2756.
|
[16] |
WANG Y T, SONG M L, TAO D C, et al. Where2Stand[J]. ACM Transactions on Intelligent Systems and Technology, 2015, 7(1): 1-22.
|
[17] |
RAWAT Y S, SONG M L, KANKANHALLI M S. A spring-electric graph model for socialized group photography[J]. IEEE Transactions on Multimedia, 2018, 20(3): 754-766. doi: 10.1109/TMM.2017.2750420
|
[18] |
YANG Z, LEI J J, FAN K F, et al. Keyword extraction by entropy difference between the intrinsic and extrinsic mode[J]. Physica A: Statistical Mechanics and Its Applications, 2013, 392(19): 4523-4531. doi: 10.1016/j.physa.2013.05.052
|
[19] |
HOU Q B, CHENG M M, HU X W, et al. Deeply supervised salient object detection with short connections[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2017: 5300-5309.
|
[20] |
CHENG M M, MITRA N J, HUANG X, et al. Global contrast based salient region dection[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2014, 37(3): 569-582.
|
[21] |
RAO A, SRIHARI R K, ZHANG Z. Spatial color histograms for content-based image retrieval[C]//Proceedings 11th International Conference on Tools with Artificial Intelligence. Piscataway: IEEE Press, 1999: 183-186.
|
[22] |
童振兴. 基于内容的图像检索技术综述与展望[J]. 计算机光盘软件与应用, 2010, 13(6): 88.
TONG Z X. Content-based image retrieval technologies summary and prospects[J]. Computer CD Software and Applications, 2010, 13(6): 88(in Chinese).
|
[23] |
DALAL N, TRIGGS B. Histograms of oriented gradients for human detection[C]//2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2005: 886-893.
|
[24] |
XIE Y, LIU L F, LI C H, et al. Unifying visual saliency with HOG feature learning for traffic sign detection[C]//2009 IEEE Intelligent Vehicles Symposium. Piscataway: IEEE Press, 2009: 24-29.
|
[25] |
TERASAWA K, TANAKA Y. Slit style HOG feature for document image word spotting[C]//200910th International Conference on Document Analysis and Recognition. Piscataway: IEEE Press, 2009: 116-120.
|
[26] |
WEI S H, RAMAKRISHNA V, KANADE T, et al. Convolutional pose machines[C]//2016 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2016: 4724-4732.
|
[27] |
SIMON T, JOO H, MATTHEWS I, et al. Hand keypoint detection in single images using multiview bootstrapping[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2017: 4645-4653.
|
[28] |
CAO Z, SIMON T, WEI S H, et al. Realtime multi-person 2D pose estimation using part affinity fields[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2017: 1302-1310.
|
[29] |
CHENG Y Z. Mean shift, mode seeking, and clustering[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1995, 17(8): 790-799. doi: 10.1109/34.400568
|
[30] |
KANUNGO T, MOUNT D M, NETANYAHU N S, et al. An efficient K-means clustering algorithm: Analysis and implementation[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2002, 24(7): 881-892. doi: 10.1109/TPAMI.2002.1017616
|
[31] |
SCHWARZ G. Estimating the dimension of a model[J]. The Annals of Statistics, 1978, 6(2): 461-464.
|
[32] |
DEMPSTER A P, LAIRD N M, RUBIN D B. Maximum likelihood from incomplete data via the EM algorithm[J]. Journal of the Royal Statistical Society: Series B (Methodological), 1977, 39(1): 1-22. doi: 10.1111/j.2517-6161.1977.tb01600.x
|
[33] |
ZUCKER M. Monte Zucker's portrait photography handbook[M]. [S. l. ]: Amherst Media, 2007.
|
[34] |
BALLARD D H. Generalizing the Hough transform to detect arbitrary shapes[J]. Pattern Recognition, 1981, 13(2): 111-122. doi: 10.1016/0031-3203(81)90009-1
|
[35] |
TOLDO R, FUSIELLO A. Robust multiple structures estimation with J-linkage[C]//European Conference on Computer Vision. Berlin: Springer, 2008: 537-547.
|