[1] 廖彬,于炯,张陶,等. 基于分布式文件系统HDFS的节能算法[J]. 计算机学报, 2013, 36(5):1047-1064. [2] HVACHKO K, KUANG H, RADIA S, et al.The Hadoop distributed file system[C]// 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies. 2010:1-10. [3] 徐鹏. Hadoop 2.X HDFS源码剖析[M]. 北京:电子工业出版社, 2016. [4] 郑通,郭卫斌,范贵生. HDFS中海量小文件合并与预取优化方法的研究[J]. 计算机科学, 2017,44(S2):516-519. [5] 李文武,张建锋,王景林. 基于EHDFS的海量小文件存储与检索方法[J]. 计算机工程与设计, 2022,43(2):376-383. [6] 顾玉宛,王文闻,孙玉强. 一种面向HDFS中海量小文件的存取优化方法[J].计算机应用研究, 2017,34(8):2319-2323. [7] 李洪奇,朱丽萍,孙国玉,等. 面向海量小文件的分布式存储系统设计与实现[J]. 计算机工程与设计, 2016,37(1):86-92. [8] ASF Infrabot. SequenceFile[EB/OL].(2019-09-07)[2022-04-12]. https://cwiki.apache.org/confluence/display/HADOOP2/SequenceFile. [9] The Apache Software Foundation. MapFile[EB/OL].[2022-04-12]. https://hadoop.Apache.org/docs/r2.6.2/api/org/apache/hadoop/io/MapFile.html. [10] The Apache Software Foundation. Hadoop Archives Guide [EB/OL].[2022-04-12]. https://hadoop.apache.org/docs/stable1/hadoop_archives.html. [11] SHEORAN S, SETHIA D, SARAN H.Optimized MapFile based storage of small files in Hadoop[C] // 2017 IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID), 2017: 906-912. [12] MENG B, GUO W, FAN G, et al.A novel approach for efficient accessing of small files in HDFS: TLB-MapFile[C]// 2016 17th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD). 2016:681-686. [13] VORAPONGKITIPUN C, NUPAIROJ N.Improving performance of small-file accessing in Hadoop[C]// 2014 11th International Joint Conference on Computer Science and Software Engineering (JCSSE). 2014:200-205. [14] TAO W J, ZHAI Y L, TCHAYE-KONDI J.LHF: A new archive based approach to accelerate massive small files access performance in HDFS[C]// 2019 IEEE 5th International Conference on Big Data Computing Service and Applications (BigDataService). 2019:40-48. [15] 郑翠芳. 几种常用无损数据压缩算法研究[J]. 计算机技术与发展, 2011,21(9):73-76. [16] ZIV J, LEMPEL A.A universal algorithm for sequential data compression[J]. IEEE Transactions on Information Theory, 1977,23(5):337-342. [17] LHUILLIER M, QUAN L.Match propagation for image-based modeling and rendering[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2002,24(8):1140-1146. [18] RAUSCHERT P, KLIMETS Y, VELTEN J, et al.Very fast GZIP compression by means of content addressable memories[C]// 2004 IEEE Region 10 Conference TENCON 2004. 2004:391-394. [19] GitHub Inc.. Snappy[EB/OL].[2022-04-12].http://google.github.io/snappy/. [20] Seward J. Bzip2[EB/OL].[2022-04-12]. http://www.bzip.org/. [21] LI S H, LUO J R, WU Y CH, et al.Continuous and realtime data acquisition embedded system for EAST[J]. IEEE Transaction on Nuclear Science, 2010,57(2):696-699. [22] 宋秉玺. 高效无损压缩算法的研究与实现[D]. 西安:西安电子科技大学, 2014. [23] 向丽辉,缪力,张大方. 压缩对Hadoop性能影响研究[J]. 计算机工程与科学, 2015, 37(2): 207-212. [24] 王松,房利国, 韩炼冰,刘鸿博. 一种快速解压的无损压缩算法[J]. 通信技术, 2020,53(5):1121-1126. [25] 夏靖波,韦泽鲲,付凯,等. 云计算中Hadoop技术研究与应用综述[J]. 计算机科学, 2016,43(11):6-11. [26] 董新华,李瑞轩,周湾湾,等. Hadoop系统性能优化与功能增强综述[J]. 计算机研究与发展, 2013,50(S2):1-15. [27] ZHAI Y L, TCHAYE-KONDI J, LIN K J, et al.Hadoop perfect file: A fast and memory-efficient metadata access archive file to face small files problem in HDFS[J]. Journal of Parallel and Distributed Computing, 2021,156:119-130. [28] DONG B, ZHENG Q H, TIAN F, et al.An optimized approach for storing and accessing small files on cloud storage[J]. Journal of Network and Computer Applications, 2012,35(6):1847-1862. [29] 王刚. 云平台下HDFS HA的研究与实现[D]. 西安:西北大学, 2013. |