Python - Python SimpleHTTPServerWithUpload 参考笔记

参考资料

设计思路参考：

参考的程序码来源：

在SimpleHTTPServer中加入上传功能

说明

如题，当初会撰写这篇参考笔记，主要是因为，之前有练习过 Python Django 的网页框架，於是乎，就想到了，能不能够单纯只用 python 来设计一个简单的本地网页档案服务器，所以就在 google 搜寻相关的资料之後，还真的有找到类似的模组w，所以就在实际测试完毕之後，把操作过程保留下来了XD。

特此撰写本篇文章作为纪录文件，用以方便後续有需要的时候，可以快速的重复查阅，虽然後面比较没有什麽机会再用到，但也算是一个还不错的经验。

简介

用 Python 的 SimpleHTTPServer 模组来建立一个简单的临时网页服务器（Web Server）包含档案上传功能，上传的档案会显示在网页上。

操作介绍

执行程序，指令如下：

py fileuploadtest.py

如下图所示：

可以用网址： http://localhost:8000/ 或是 http://127.0.0.1:8000/ 来访问网页，如下图所示：

然後就可以看到资料夹内放的一些档案，也可以上传其他档案到 fileuploadtest.py 档案所在的资料夹。

完整程序码

建议程序档案可取名为：fileuploadtest.py，取其他名字也可，EX：XXX.py。

完整程序码，如下所示：

#!/usr/bin/env python3
# -*- coding: utf-8 -*-
"""Simple HTTP Server With Upload based on python3.
Python2-Version by [bones7456] and [BUPTGuo]:
	This module builds on BaseHTTPServer by implementing the standard GET and HEAD requests in a fairly straightforward manner.
Python3-Version by [FrozenMap]:
	Based on the new features in python3, this module is built on [http.server] by rewriting some implementations of do_GET, do_HEAD and do_POST and other functions in Python2-Version.
20181029 Update:
With the help of [a.7], we can now upload multiple files together, rather than one file at a time
More details can be found on the blog with the link below:
https://jjayyyyyyy.github.io/2016/10/07/reWrite_SimpleHTTPServerWithUpload_with_python3.html
"""

__version__ = "0.4"
__all__ = ["SimpleHTTPRequestHandler"]
__author__ = "bones7456, BUPTGuo, FrozenMap, a.7"
__home_page__ = "http://luy.li/, http://buptguo.com, https://jjayyyyyyy.github.io"


import os
import posixpath
import http.server
import urllib
import cgi
import shutil
import mimetypes
import re

class SimpleHTTPRequestHandler(http.server.BaseHTTPRequestHandler):

	"""Simple HTTP request handler with GET/HEAD/POST commands.
	This serves files from the current directory and any of its
	subdirectories.  The MIME type for files is determined by
	calling the .guess_type() method. And can reveive file uploaded
	by client.
	The GET/HEAD/POST requests are identical except that the HEAD
	request omits the actual contents of the file.
	"""

	server_version = "SimpleHTTPWithUpload/" + __version__

	def do_GET(self):
		"""Serve a GET request."""
		f = self.send_head()
		if f:
			self.wfile.write(f)

	def do_HEAD(self):
		"""Serve a HEAD request."""
		f = self.send_head()

	def do_POST(self):
		"""Serve a POST request."""
		r, info = self.deal_post_data()

		print(info)
		print("uploaded by:", self.client_address)
		info = info.replace('\n', '<br>')
		f = ('<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN" "http://www.w3.org/TR/html4/strict.dtd">') +\
			('<html><head>') +\
			('<meta http-equiv="Content-Type" content="text/html; charset=utf-8">') +\
			('<title>Upload Result Page</title>') +\
			('</head><body>') +\
			('<h1>Upload Result Page</h1>') +\
			('<hr>')
		if r:
			f = f + ('<strong>Success:<strong><br/>') + info
		else:
			f = f + ('<strong>Failed:<strong>') + info
		f = f + '<br><a href="%s">back</a>' % self.headers['referer'] +\
			'</body></html>'

		f = f.encode('utf-8')
		length = len(f)
		self.send_response(200)
		self.send_header("Content-type", "text/html")
		self.send_header("Content-Length", str(length))
		self.end_headers()
		self.wfile.write(f)


	'''
	POST data
	------WebKitFormBoundaryLVlRNkjiiJLtNYQE
	Content-Disposition: form-data; name="file"; filename="file1.txt"
	Content-Type: text/plain
	content in file1
	hello file1
	------WebKitFormBoundaryLVlRNkjiiJLtNYQE
	Content-Disposition: form-data; name="file"; filename="file2.txt"
	Content-Type: text/plain
	content in file2
	hello file2
	------WebKitFormBoundaryLVlRNkjiiJLtNYQE--
	'''
	def deal_post_data(self):
		boundary = self.headers["Content-Type"].split("=")[1]

		boundary_begin = ('--' + boundary + '\r\n').encode('utf-8')
		boundary_end = ('--' + boundary + '--\r\n').encode('utf-8')

		return_status = True
		return_info = '\n'
		outer = 1
		inner = 2
		leave = 3
		loop_info = outer # 1: outer loop, 2: inner_loop, 3: leave and return

		# first line
		# b'------WebKitFormBoundaryLVlRNkjiiJLtNYQE'
		line = self.rfile.readline()

		while loop_info == outer:
			# print(line)
			line = line
			if line != boundary_begin:
				return_status = False
				return_info += "Content NOT begin with boundary\n"
				break

			# get filename
			# b'Content-Disposition: form-data; name="file"; filename="file1.txt"'
			line = self.rfile.readline().decode('utf-8').rstrip('\r\n')
			# print(line)
			filename = re.findall(r'filename="(.*)"', line)[0]
			# print(filename)
			if not filename:
				return_status = False
				return_info += "Can't find out file name...\n"
				loop_info = leave
				break
			path = self.translate_path(self.path)
			filename = os.path.join(path, filename)
			# if filename alread exists
			if os.path.exists(filename):
				filename += "_copy"

			# second line
			# b'Content-Type: text/plain'
			line = self.rfile.readline()
			# print(line)

			# blank line
			line = self.rfile.readline()
			# print(line)

			loop_info = inner
			# POST data
			try:
				with open(filename, 'wb') as f:
					while loop_info == inner:
						line = self.rfile.readline()
						# print(line)
						if line == boundary_begin:
							loop_info = outer
							# print('out')
							break
						elif line == boundary_end:
							# print('leave')
							loop_info = leave
							break
						else:
							# line 还是二进制形式, realine() 不会删掉二进制的'\n'
							f.write(line)
			except Exception as e:
				loop_info = leave
				return_status = False
				return_info += 'Exception!\n'
			return_info += filename + '\n'
		return (return_status, return_info)

	def send_head(self):
		"""Common code for GET and HEAD commands.
		This sends the response code and MIME headers.
		Return value is either a file object (which has to be copied
		to the outputfile by the caller unless the command was HEAD,
		and must be closed by the caller under all circumstances), or
		None, in which case the caller has nothing further to do.
		"""
		path = self.translate_path(self.path)
		f = None
		if os.path.isdir(path):
			if not self.path.endswith('/'):
				# redirect browser - doing basically what apache does
				self.send_response(301)
				self.send_header("Location", self.path + "/")
				self.end_headers()
				return None
			for index in "index.html", "index.htm":
				index = os.path.join(path, index)
				if os.path.exists(index):
					path = index
					break
			else:
				return self.list_directory(path)
		ctype = self.guess_type(path)
		try:
			# Always read in binary mode. Opening files in text mode may cause
			# newline translations, making the actual size of the content
			# transmitted *less* than the content-length!
			f = open(path, 'rb')
		except IOError:
			self.send_error(404, "File not found")
			return None
		self.send_response(200)
		self.send_header("Content-type", ctype)
		fs = os.fstat(f.fileno())
		self.send_header("Content-Length", str(fs[6]))
		self.send_header("Last-Modified", self.date_time_string(fs.st_mtime))
		self.end_headers()
		data = f.read()
		f.close()

		return data

	def list_directory(self, path):
		"""Helper to produce a directory listing (absent index.html).
		Return value is either a file object, or None (indicating an
		error).  In either case, the headers are sent, making the
		interface the same as for send_head().
		"""
		try:
			list = os.listdir(path)
		except os.error:
			self.send_error(404, "No permission to list directory")
			return None
		list.sort(key=lambda a: a.lower())
		displaypath = cgi.escape(urllib.parse.unquote(self.path))

		f = ('<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN" "http://www.w3.org/TR/html4/strict.dtd">') +\
			('<html><head>') +\
			('<meta http-equiv="Content-Type" content="text/html; charset=utf-8">') +\
			('<title>Directory listing for %s</title>' % displaypath) +\
			('</head><body>') +\
			('<h1>Directory listing for %s</h1>' % displaypath) +\
			('<form ENCTYPE="multipart/form-data" method="post">') +\
			('<input name="file" type="file" multiple="multiple"/>') +\
			('<input type="submit" value="upload"/></form>') +\
			('<hr><ul>')

		for name in list:
			fullname = os.path.join(path, name)
			displayname = linkname = name
			# Append / for directories or @ for symbolic links
			if os.path.isdir(fullname):
				displayname = name + "/"
				linkname = name + "/"
			if os.path.islink(fullname):
				displayname = name + "@"
				# Note: a link to a directory displays with @ and links with /
			f = f + ('<li><a href="%s">%s</a>' % (urllib.parse.quote(linkname), cgi.escape(displayname)))
		f = f + ("</ul><hr></body></html>")

		f = f.encode('utf-8')
		length = len(f)
		self.send_response(200)
		self.send_header("Content-type", "text/html")
		self.send_header("Content-Length", str(length))
		self.end_headers()
		return f

	def translate_path(self, path):
		"""Translate a /-separated PATH to the local filename syntax.
		Components that mean special things to the local file system
		(e.g. drive or directory names) are ignored.  (XXX They should
		probably be diagnosed.)
		"""
		# abandon query parameters
		path = path.split('?',1)[0]
		path = path.split('#',1)[0]
		path = posixpath.normpath(urllib.parse.unquote(path))
		words = path.split('/')
		words = filter(None, words)
		path = os.getcwd()
		for word in words:
			drive, word = os.path.splitdrive(word)
			head, word = os.path.split(word)
			if word in (os.curdir, os.pardir): continue
			path = os.path.join(path, word)
		return path

	def guess_type(self, path):
		"""Guess the type of a file.
		Argument is a PATH (a filename).
		Return value is a string of the form type/subtype,
		usable for a MIME Content-type header.
		The default implementation looks the file's extension
		up in the table self.extensions_map, using application/octet-stream
		as a default; however it would be permissible (if
		slow) to look inside the data to make a better guess.
		"""

		base, ext = posixpath.splitext(path)
		if ext in self.extensions_map:
			return self.extensions_map[ext]
		ext = ext.lower()
		if ext in self.extensions_map:
			return self.extensions_map[ext]
		else:
			return self.extensions_map['']

	if not mimetypes.inited:
		mimetypes.init() # try to read system mime.types
	extensions_map = mimetypes.types_map.copy()
	extensions_map.update({
		'': 'application/octet-stream', # Default
		'.py': 'text/plain',
		'.c': 'text/plain',
		'.h': 'text/plain',
		})


def test(HandlerClass = SimpleHTTPRequestHandler,
		 ServerClass = http.server.HTTPServer):
	http.server.test(HandlerClass, ServerClass)

if __name__ == '__main__':
	test()

<<: Day10 Sync.WaitGroup & Sync.Map

>>: 【Day 10】While 回圈

Day26 如何使用bootstrap

杂谈

Day12: GuardDuty单一帐号/Org.布建、测试结果产生

杂谈

Day#23 github连结先来

杂谈

AVFoundation 来看看 Day 19

杂谈

Day 1 - 前言与介绍

杂谈

git - 2 ( push github、找插件、token、branch+merge、tag、stash )

1. git push - 推送数据库到 Github (clone + push) (1)建立新专...

[Day 30] 模仿 Node 的非同步实验兼完赛心得

实作所谓读了原始码却不实作与验证, 那不过就是对原始码的意淫而已, 所以在这个系列文的结尾总是还要...

收集和引出软件开发专业中利害关系人的安全需要(needs)和需求(requirements)

-软件开发生命周期 (SDLC) – 设计在（需求）分析中引出、收集、分析、指定、记录、验证、确...

Day 06　Use automated machine learning in Azure Machine Learning

To use Azure Machine Learning Create an Azure Mach...

Day29 Session 的使用-2

因为开启Session，所以原本abc页面的验证是用Cookie，现在要更改为验证Session存在...