当前位置: 首页 > news >正文

人事代理网站建设专业的建设网站服务

人事代理网站建设,专业的建设网站服务,西瓜网站建设,海外网站营销一、技术背景与目标 Wiki系统导出的文档通常以HTML格式存在,且内容分散在多个文件中,每个页面对应一个HTML文件。然而,Microsoft Word(Word)在处理HTML文件时,仅支持单个HTML文件的导入。因此,为了将Wiki导出的内容转换为Word可识别的格式,必须将分散的HTML文件整合为一…

一、技术背景与目标

Wiki系统导出的文档通常以HTML格式存在,且内容分散在多个文件中,每个页面对应一个HTML文件。然而,Microsoft Word(Word)在处理HTML文件时,仅支持单个HTML文件的导入。因此,为了将Wiki导出的内容转换为Word可识别的格式,必须将分散的HTML文件整合为一个单一的HTML文件。这一过程涉及HTML文件的解析、内容提取、结构重组以及样式调整等多个技术要点。

二、代码逻辑与技术要点解析

(一)WikiToHtml:生成单个HTML文件

WikiToHtml类的核心功能是将分散的Wiki HTML文件整合为一个单一的HTML文件,同时生成导航结构以便在Word中浏览。
1. 清理与初始化
在处理HTML文件之前,首先需要清理目标目录,移除旧文件以避免冲突。clearn方法通过检查文件是否存在并删除它们来实现这一功能:

<span style="color:#060607"><span style="background-color:#ffffff"><span style="background-color:#fafafa"><span style="color:#383a42"><code class="language-java"><span style="color:#a626a4">public</span> <span style="color:#a626a4">static</span> <span style="color:#a626a4">void</span> <span style="color:#4078f2">clearn</span><span style="color:#383a42">(</span><span style="color:#383a42">)</span> <span style="color:#383a42">{</span><span style="color:#b76b01">File</span> filehhc <span style="color:#4078f2">=</span> <span style="color:#a626a4">new</span> <span style="color:#b76b01">File</span><span style="color:#383a42">(</span>hhcurl<span style="color:#383a42">)</span><span style="color:#383a42">;</span><span style="color:#a626a4">if</span> <span style="color:#383a42">(</span>filehhc<span style="color:#383a42">.</span><span style="color:#4078f2">exists</span><span style="color:#383a42">(</span><span style="color:#383a42">)</span><span style="color:#383a42">)</span> <span style="color:#383a42">{</span>filehhc<span style="color:#383a42">.</span><span style="color:#4078f2">delete</span><span style="color:#383a42">(</span><span style="color:#383a42">)</span><span style="color:#383a42">;</span><span style="color:#383a42">}</span><em><span style="color:#a0a1a7">// 其他文件的清理逻辑...</span></em>
<span style="color:#383a42">}</span></code></span></span></span></span>
此逻辑确保每次运行程序时,目标目录都是干净的,避免旧文件干扰。
2. 文件遍历与路径收集
getFile方法递归遍历指定目录,收集所有HTML文件的路径,并将这些路径存储到一个StringBuffer中:

<span style="color:#060607"><span style="background-color:#ffffff"><span style="background-color:#fafafa"><span style="color:#383a42"><code class="language-java"><span style="color:#a626a4">public</span> <span style="color:#a626a4">static</span> <span style="color:#a626a4">void</span> <span style="color:#4078f2">getFile</span><span style="color:#383a42">(</span><span style="color:#b76b01">File</span> file<span style="color:#383a42">,</span> <span style="color:#b76b01">StringBuffer</span> str<span style="color:#383a42">)</span> <span style="color:#383a42">{</span><span style="color:#b76b01">File</span><span style="color:#383a42">[</span><span style="color:#383a42">]</span> fileitem <span style="color:#4078f2">=</span> file<span style="color:#383a42">.</span><span style="color:#4078f2">listFiles</span><span style="color:#383a42">(</span><span style="color:#383a42">)</span><span style="color:#383a42">;</span><span style="color:#a626a4">for</span> <span style="color:#383a42">(</span><span style="color:#a626a4">int</span> i <span style="color:#4078f2">=</span> <span style="color:#b76b01">0</span><span style="color:#383a42">;</span> i <span style="color:#4078f2"><</span> fileitem<span style="color:#383a42">.</span>length<span style="color:#383a42">;</span> i<span style="color:#4078f2">++</span><span style="color:#383a42">)</span> <span style="color:#383a42">{</span><span style="color:#a626a4">if</span> <span style="color:#383a42">(</span>fileitem<span style="color:#383a42">[</span>i<span style="color:#383a42">]</span><span style="color:#383a42">.</span><span style="color:#4078f2">isDirectory</span><span style="color:#383a42">(</span><span style="color:#383a42">)</span><span style="color:#383a42">)</span> <span style="color:#383a42">{</span><span style="color:#4078f2">getFile</span><span style="color:#383a42">(</span>fileitem<span style="color:#383a42">[</span>i<span style="color:#383a42">]</span><span style="color:#383a42">,</span> str<span style="color:#383a42">)</span><span style="color:#383a42">;</span><span style="color:#383a42">}</span> <span style="color:#a626a4">else</span> <span style="color:#383a42">{</span><span style="color:#a626a4">try</span> <span style="color:#383a42">{</span>str<span style="color:#383a42">.</span><span style="color:#4078f2">append</span><span style="color:#383a42">(</span>fileitem<span style="color:#383a42">[</span>i<span style="color:#383a42">]</span><span style="color:#383a42">.</span><span style="color:#4078f2">getCanonicalPath</span><span style="color:#383a42">(</span><span style="color:#383a42">)</span> <span style="color:#4078f2">+</span> <span style="color:#50a14f">"\r\n"</span><span style="color:#383a42">)</span><span style="color:#383a42">;</span><span style="color:#383a42">}</span> <span style="color:#a626a4">catch</span> <span style="color:#383a42">(</span><span style="color:#b76b01">IOException</span> e<span style="color:#383a42">)</span> <span style="color:#383a42">{</span>e<span style="color:#383a42">.</span><span style="color:#4078f2">printStackTrace</span><span style="color:#383a42">(</span><span style="color:#383a42">)</span><span style="color:#383a42">;</span><span style="color:#a626a4">if</span> <span style="color:#383a42">(</span>log <span style="color:#4078f2">!=</span> <span style="color:#a626a4">null</span><span style="color:#383a42">)</span> <span style="color:#383a42">{</span>log<span style="color:#383a42">.</span><span style="color:#4078f2">addLog</span><span style="color:#383a42">(</span>e<span style="color:#383a42">.</span><span style="color:#4078f2">getMessage</span><span style="color:#383a42">(</span><span style="color:#383a42">)</span><span style="color:#383a42">)</span><span style="color:#383a42">;</span><span style="color:#383a42">}</span><span style="color:#383a42">}</span><span style="color:#383a42">}</span><span style="color:#383a42">}</span>
<span style="color:#383a42">}</span></code></span></span></span></span>
此方法通过递归遍历目录,确保所有HTML文件的路径都被收集,为后续的文件处理提供基础。
3. HTML内容整合
modifyHtml方法负责处理每个HTML文件,移除不需要的元素(如页眉、页脚、附件等),并调整样式以适应Word的显示需求:

<span style="color:#060607"><span style="background-color:#ffffff"><span style="background-color:#fafafa"><span style="color:#383a42"><code class="language-java"><span style="color:#a626a4">public</span> <span style="color:#a626a4">static</span> <span style="color:#a626a4">void</span> <span style="color:#4078f2">modifyHtml</span><span style="color:#383a42">(</span><span style="color:#383a42">)</span> <span style="color:#383a42">{</span><span style="color:#b76b01">File</span> homeFile <span style="color:#4078f2">=</span> <span style="color:#a626a4">new</span> <span style="color:#b76b01">File</span><span style="color:#383a42">(</span>home<span style="color:#383a42">)</span><span style="color:#383a42">;</span><span style="color:#b76b01">File</span><span style="color:#383a42">[</span><span style="color:#383a42">]</span> htmlFile <span style="color:#4078f2">=</span> homeFile<span style="color:#383a42">.</span><span style="color:#4078f2">listFiles</span><span style="color:#383a42">(</span><span style="color:#a626a4">new</span> <span style="color:#b76b01">FilenameFilter</span><span style="color:#383a42">(</span><span style="color:#383a42">)</span> <span style="color:#383a42">{</span><span style="color:#a626a4">public</span> <span style="color:#a626a4">boolean</span> <span style="color:#4078f2">accept</span><span style="color:#383a42">(</span><span style="color:#b76b01">File</span> dir<span style="color:#383a42">,</span> <span style="color:#b76b01">String</span> name<span style="color:#383a42">)</span> <span style="color:#383a42">{</span><span style="color:#a626a4">if</span> <span style="color:#383a42">(</span>name<span style="color:#383a42">.</span><span style="color:#4078f2">lastIndexOf</span><span style="color:#383a42">(</span><span style="color:#50a14f">".html"</span><span style="color:#383a42">)</span> <span style="color:#4078f2">!=</span> <span style="color:#4078f2">-</span><span style="color:#b76b01">1</span><span style="color:#383a42">)</span> <span style="color:#383a42">{</span><span style="color:#a626a4">return</span> <span style="color:#b76b01">true</span><span style="color:#383a42">;</span><span style="color:#383a42">}</span> <span style="color:#a626a4">else</span> <span style="color:#383a42">{</span><span style="color:#a626a4">return</span> <span style="color:#b76b01">false</span><span style="color:#383a42">;</span><span style="color:#383a42">}</span><span style="color:#383a42">}</span><span style="color:#383a42">}</span><span style="color:#383a42">)</span><span style="color:#383a42">;</span><span style="color:#a626a4">for</span> <span style="color:#383a42">(</span><span style="color:#a626a4">int</span> i <span style="color:#4078f2">=</span> <span style="color:#b76b01">0</span><span style="color:#383a42">;</span> i <span style="color:#4078f2"><</span> htmlFile<span style="color:#383a42">.</span>length<span style="color:#383a42">;</span> i<span style="color:#4078f2">++</span><span style="color:#383a42">)</span> <span style="color:#383a42">{</span><span style="color:#b76b01">File</span> html <span style="color:#4078f2">=</span> htmlFile<span style="color:#383a42">[</span>i<span style="color:#383a42">]</span><span style="color:#383a42">;</span><span style="color:#b76b01">Document</span> doc <span style="color:#4078f2">=</span> <span style="color:#b76b01">Jsoup</span><span style="color:#383a42">.</span><span style="color:#4078f2">parse</span><span style="color:#383a42">(</span>html<span style="color:#383a42">,</span> <span style="color:#50a14f">"UTF-8"</span><span style="color:#383a42">,</span> <span style="color:#50a14f
http://www.yayakq.cn/news/685541/

相关文章:

  • 怎样创网站分类目录网站有哪些
  • 山东网站建设运行工资网站开发+自动生成缩略图
  • 微信群投票网站怎么做的优惠的网站建设
  • 青岛网站建设和优化用软件做网站
  • 网站建设的入门书籍招聘网站做两份简历
  • 做网站可以用python么全国企业信用公示系统查询
  • 重庆网站推广专家网站底部备案号悬挂
  • 网站例子大全网络营销建议
  • 网站对于企业的好处网站被k多久恢复
  • 网站做聚合是啥意思dw8网页设计教程
  • 制作什么网站做毕业设计it培训机构学费
  • 嘉瑞建设有限公司网站什么企业时候做网站
  • 网站制作价格怎么算wordpress繁简体
  • 网上做设计网站刚发布的手机
  • 中国建设银行公司网站官网初中做网站软件
  • 山东省建设厅官方网站怎么样手机端下载
  • wordpress快站楼盘设计师工资一般多少
  • 网站后台登录系统是怎么做的网站服务器排名前十
  • 互动网站设计与制作网站商城具有哪些功能模块
  • 农业网站电子商务平台建设方案厦门专业网站制作
  • 网站建设问题清单网站建设工作情况
  • 安顺建设工程造价管理网站游戏app软件开发公司
  • 网站开发遇到过哪些技术难点做互助盘网站找哪家好
  • 蒲公英网站建设宁波网站设计服务收费价格
  • 深圳企业建网站公司seo每天一贴博客
  • 深鑫辉网站建设提交收录网站
  • 襄州区住房和城乡建设局网站网站后台管理增加功能
  • 自动化系统网站建设微商城和小程序区别
  • 网站下拉菜单重叠wordpress美容养生
  • 食品包装设计说明seo论坛