文章 - 30天尝试新事情

win10 vbs无法隐藏执行批处理问题

windows

比如语句：

Set ws = CreateObject("Wscript.Shell") 

ws.run "cmd /c D:\新建\diff.bat",vbhide

WScript.sleep 1000

始终无法执行，莫名其妙。
通过换路径执行，发现是路径不能有中文。。。。
居然windows也有这种中文路径问题。

win10 vbs中不能有中文路径名

继续阅读 »

比如语句：

Set ws = CreateObject("Wscript.Shell") 

ws.run "cmd /c D:\新建\diff.bat",vbhide

WScript.sleep 1000

始终无法执行，莫名其妙。
通过换路径执行，发现是路径不能有中文。。。。
居然windows也有这种中文路径问题。

win10 vbs中不能有中文路径名收起阅读 »

redis desktop manager windows客户端

数据库

原来用的QT写的。
怪不得这么垃圾。。。。。

pyqt5 QRect在哪个类

pyqt

最新的版本是在 QtCore里面的

from PyQt5.QtCore import Qt,QRect

最新的版本是在 QtCore里面的

from PyQt5.QtCore import Qt,QRect

分享一个电影网站 insiji.com

闲聊

https://insiji.com/
貌似电影比较新，也比较清晰。目前良心的站点不多了。

在线看挺流畅的，不过不登录没得下载。只好抓包，把ts文件下载后再合并。

2020-03-27 新事情坚持

30天尝试新事情

持续一周的出差，学到的一些东西。

接下来的30天，要养成的习惯：

1. 早睡早起
2. 持续比例的输出。（有输入就要有输出）
3. 锻炼身体
4. 先专注一件事情，让他变为习惯后再攻克另外一件事情。

忘记的QQ密码

闲聊

曾经用了十多年的QQ忘记了密码，只有选择找回，找回密码问题是我的梦想是什么，过了这么多年，完全忘了当时是怎么回答的，就开始挨个慢慢试，房子，钱，车，工作，都不对，直到那时才意识到，弄丢的并QQ密码，而是梦想

windows下logstash6.5安装配置网上的教程是linux，有坑

Logstash

https://www.cnblogs.com/jstarseven/p/7704893.html
百度里搜到的logstash安装配置教程千篇一律都是这样子。

启动服务测试一下是否安装成功：

cd bin

 ./logstash -e 'input { stdin { } } output { stdout {} }'

上面的是linux运行的。
windows下把logstash改为logstash.bat 然后运行。

报错：

ERROR: Unknown command '{'



See: 'bin/logstash --help'

[ERROR] 2020-02-28 10:04:29.307 [main] Logstash - java.lang.IllegalStateException: Logstash stopped processing because of an error: (SystemExit) exit

后来搜了下国外的网站。原来问题出在那个单引号上，把单引号改为双引号就可以了。

继续阅读 »

https://www.cnblogs.com/jstarseven/p/7704893.html
百度里搜到的logstash安装配置教程千篇一律都是这样子。

启动服务测试一下是否安装成功：

cd bin

 ./logstash -e 'input { stdin { } } output { stdout {} }'

上面的是linux运行的。
windows下把logstash改为logstash.bat 然后运行。

报错：

ERROR: Unknown command '{'



See: 'bin/logstash --help'

[ERROR] 2020-02-28 10:04:29.307 [main] Logstash - java.lang.IllegalStateException: Logstash stopped processing because of an error: (SystemExit) exit

后来搜了下国外的网站。原来问题出在那个单引号上，把单引号改为双引号就可以了。
收起阅读 »

薅“疫情公益”羊毛，黑产恶意爬取各大出版社电子书上万册

爬虫 python 网络安全

疫情以来，所有企业都上班延期选择在线复工，在我们居家自我隔离期间，极验观察爬虫却没有消停，反而爬虫行为更加活跃且更胜往常。本周五，我们和无糖信息一起聊聊线上爬虫的“疫情”。

个人的知识星球

数据分析套利

最近开通了知识星球，尝试在里面分享一些量化投资套利的学习教程。在里面你可以对不懂的问题进行提问，球主会一一的耐心解答。

微信扫一扫加入我的知识星球

星球的第一篇文章
python获取全市场LOF基金折溢价数据并进行套利

市场是总共的LOF基金有301只（上图右下角的圈圈是所有基金的条数），而集思录上只有120只左右，所以有些溢价厉害（大于10%）的LOF基金并没有在集思录的网站上显示，这对于专注于套利的投资者来说，会损失很多潜在的套利机会。

点击查看大图

我回复了该贴后，有大量的人私信我，问我能否提供一份这个数据，或者教对方如何获取这些数据。因为人数众多，也没有那么多精力来一一回答。毕竟不同人的水平背景不一样，逐个回答起来也很累，所以就回答了几个朋友的问题后就一一婉拒了。

然后在几个投资群里，居然也有人提到这个数据，在咨询如何才能获取到这个完整的数据，并且可以实时更新显示。因为我的微信群昵称和集思录是一样的，所以不少人@我，我也都简单的回复了下，是使用python抓取的数据，数据保存到Mysql和MongoDB。代码行数不多，100行都不到。

具体实现在星球会有完整代码。

继续阅读 »

最近开通了知识星球，尝试在里面分享一些量化投资套利的学习教程。在里面你可以对不懂的问题进行提问，球主会一一的耐心解答。

微信扫一扫加入我的知识星球

星球的第一篇文章
python获取全市场LOF基金折溢价数据并进行套利

市场是总共的LOF基金有301只（上图右下角的圈圈是所有基金的条数），而集思录上只有120只左右，所以有些溢价厉害（大于10%）的LOF基金并没有在集思录的网站上显示，这对于专注于套利的投资者来说，会损失很多潜在的套利机会。

点击查看大图

我回复了该贴后，有大量的人私信我，问我能否提供一份这个数据，或者教对方如何获取这些数据。因为人数众多，也没有那么多精力来一一回答。毕竟不同人的水平背景不一样，逐个回答起来也很累，所以就回答了几个朋友的问题后就一一婉拒了。

然后在几个投资群里，居然也有人提到这个数据，在咨询如何才能获取到这个完整的数据，并且可以实时更新显示。因为我的微信群昵称和集思录是一样的，所以不少人@我，我也都简单的回复了下，是使用python抓取的数据，数据保存到Mysql和MongoDB。代码行数不多，100行都不到。

具体实现在星球会有完整代码。收起阅读 »

希望能够找到一个可以一起对彪的人

闲聊

看了老男孩的某个视频教程，觉得这个挺有道理的。
目前貌似身边缺少一个这样的人，希望在这个网站上能够遇到一个这样的朋友。
有意的可以私聊吧。非诚勿扰。

证券etf和券商etf的区别

量化交易股票

证券etf是：国泰中证全指证券公司ETF（512880）LOF/ETF

而券商etf是：
华宝中证全指证券公司ETF （512000）LOF/ETF

二者都是指数/lof基金，而且持仓差不多都一样的。

证券etf的持仓：

券商etf的持仓：

不同的是规模，证券ETF的规模要比券商etf的规模要大得多。目前是2倍左右的差距【2019年的数据】。所以如果你关心的是流动性，那么可以买证券ETF。

最新的规模【2020-06】其实二者在缩小，具体数据见文末。

证券ETF的规模是190亿

而券商ETF规模为160亿。(最近增长比较多，笔者在1年前记录的数据为30亿，可以到http://fundf10.eastmoney.com/gmbd_512000.html 查看历史规模。现在券商ETF的规模翻了一番。

更多量化分析，关注公众号：可转债量化分析
券商开户股票万一免5，基金申购一折，拖拉机6+1，关注公众号留言。
可转债手续费百万分之二，一万块手续费才2分钱，没有最低限制（没有最低收1元，1毛这种）

（教你使用python进行量化分析股票，可转债数据）

继续阅读 »

证券etf是：国泰中证全指证券公司ETF（512880）LOF/ETF

而券商etf是：
华宝中证全指证券公司ETF （512000）LOF/ETF

二者都是指数/lof基金，而且持仓差不多都一样的。

证券etf的持仓：

券商etf的持仓：

不同的是规模，证券ETF的规模要比券商etf的规模要大得多。目前是2倍左右的差距【2019年的数据】。所以如果你关心的是流动性，那么可以买证券ETF。

最新的规模【2020-06】其实二者在缩小，具体数据见文末。

证券ETF的规模是190亿

而券商ETF规模为160亿。(最近增长比较多，笔者在1年前记录的数据为30亿，可以到http://fundf10.eastmoney.com/gmbd_512000.html 查看历史规模。现在券商ETF的规模翻了一番。

更多量化分析，关注公众号：可转债量化分析
券商开户股票万一免5，基金申购一折，拖拉机6+1，关注公众号留言。
可转债手续费百万分之二，一万块手续费才2分钱，没有最低限制（没有最低收1元，1毛这种）

（教你使用python进行量化分析股票，可转债数据）

收起阅读 »

socketio中client的sio wait用法

socketio

用于阻塞当前的线程，后面的操作不会进行，直到服务端断开。

import time

import socketio



sio = socketio.Client()

start_timer = None





def send_ping():

    global start_timer

    start_timer = time.time()

    sio.emit('ping_from_client')





@sio.event

def connect():

    print('connected to server')

    send_ping()





@sio.event

def pong_from_server(data):

    global start_timer

    latency = time.time() - start_timer

    print('latency is {0:.2f} ms'.format(latency * 1000))

    sio.sleep(1)

    send_ping()





if __name__ == '__main__':

    sio.connect('http://localhost:5000')

    sio.wait()

    print('next')

比如上述代码中，如果调用了sio.wait() , 那么next是不会被打印的。

如果注释掉后，那么next就可以正常被打印。

继续阅读 »

用于阻塞当前的线程，后面的操作不会进行，直到服务端断开。

import time

import socketio



sio = socketio.Client()

start_timer = None





def send_ping():

    global start_timer

    start_timer = time.time()

    sio.emit('ping_from_client')





@sio.event

def connect():

    print('connected to server')

    send_ping()





@sio.event

def pong_from_server(data):

    global start_timer

    latency = time.time() - start_timer

    print('latency is {0:.2f} ms'.format(latency * 1000))

    sio.sleep(1)

    send_ping()





if __name__ == '__main__':

    sio.connect('http://localhost:5000')

    sio.wait()

    print('next')

比如上述代码中，如果调用了sio.wait() , 那么next是不会被打印的。

如果注释掉后，那么next就可以正常被打印。收起阅读 »

dataframe 根据日期重采样计算个数

数据分析

按照日期重新采样，计算每天的个数new_df = df.resample('D').count()

失落

闲聊

记住今天，铭记。

bandcamp移动开发更简单

量化分析

继续阅读 »

mongodb日期条件查找

有时候用new Date() 不管用

多线程调用yolo模型会出错（包括使用cv2载入yolo模型）

深度学习 yolo

占坑。

多线程调用yolo模型会出错（包括使用cv2载入yolo模型）

keras yolo物体检测入门教程

yolo 深度学习

占坑

RuntimeError: `get_session` is not available when using TensorFlow 2.0.

深度学习

这个问题是TensorFlow版本问题，在2.0以上get_session是被移除了。需要做一些修改，或者把tf降级。可以安装1.15版本
pip install tensorflow==1.15 --upgrade

Here, we will see how we can upgrade our code to work with tensorflow 2.0.



This error is usually faced when we are loading pre-trained model with tensorflow session/graph or we are building flask api over a pre-trained model and loading model in tensorflow graph to avoid collision of sessions while application is getting multiple requests at once or say in case of multi-threadinng



After tensorflow 2.0 upgrade, i also started facing above error in one of my project when i had built api of pre-trained model with flask. So i looked around in tensorflow 2.0 documents to find a workaround, to avoid this runtime error and upgrade my code to work with tensorflow 2.0 as well rather than downgrading it to tensorflow 1.x .



I had a project on which i had written tutorial as well on how to build Flask api on trained keras model of text classification and then use it in production



But this project was not working after tensorflow upgrade and was facing runtime error.



Stacktrace of error was something like below:



 File "/Users/Upasana/Documents/playground/deploy-keras-model-in-production/src/main.py", line 37, in model_predict

    with backend.get_session().graph.as_default() as g:

  File "/Users/Upasana/Documents/playground/deploy-keras-model-in-production/venv-tf2/lib/python3.6/site-packages/keras/backend/tensorflow_backend.py", line 379, in get_session

    '`get_session` is not available '

RuntimeError: `get_session` is not available when using TensorFlow 2.0.

Related code to get model

with backend.get_session().graph.as_default() as g:

        model = SentimentService.get_model1()

Related code to load model

def load_deep_model(self, model):

        json_file = open('./src/mood-saved-models/' + model + '.json', 'r')

        loaded_model_json = json_file.read()

        loaded_model = model_from_json(loaded_model_json)



        loaded_model.load_weights("./src/mood-saved-models/" + model + ".h5")



        loaded_model._make_predict_function()

        return loaded_model

get_session is removed in tensorflow 2.0 and hence not available.



so, in order to load saved model we switched methods. Rather than using keras’s load_model, we used tensorflow to load model so that we can load model using distribution strategy.



Note

The tf.distribute.Strategy API provides an abstraction for distributing your training across multiple processing units.



New code to get model

another_strategy = tf.distribute.MirroredStrategy()

with another_strategy.scope():

    model = SentimentService.get_model1()

New code to load model

def load_deep_model(self, model):

        loaded_model = tf.keras.models.load_model("./src/mood-saved-models/"model + ".h5")

        return loaded_model

This worked and solved the problem with runtime error of get_session not available in tensorflow 2.0 . You can refer to Tensorflow 2.0 upgraded article too



Hope, this will solve your problem too. Thanks for following this article.

继续阅读 »

这个问题是TensorFlow版本问题，在2.0以上get_session是被移除了。需要做一些修改，或者把tf降级。可以安装1.15版本
pip install tensorflow==1.15 --upgrade

Here, we will see how we can upgrade our code to work with tensorflow 2.0.



This error is usually faced when we are loading pre-trained model with tensorflow session/graph or we are building flask api over a pre-trained model and loading model in tensorflow graph to avoid collision of sessions while application is getting multiple requests at once or say in case of multi-threadinng



After tensorflow 2.0 upgrade, i also started facing above error in one of my project when i had built api of pre-trained model with flask. So i looked around in tensorflow 2.0 documents to find a workaround, to avoid this runtime error and upgrade my code to work with tensorflow 2.0 as well rather than downgrading it to tensorflow 1.x .



I had a project on which i had written tutorial as well on how to build Flask api on trained keras model of text classification and then use it in production



But this project was not working after tensorflow upgrade and was facing runtime error.



Stacktrace of error was something like below:



 File "/Users/Upasana/Documents/playground/deploy-keras-model-in-production/src/main.py", line 37, in model_predict

    with backend.get_session().graph.as_default() as g:

  File "/Users/Upasana/Documents/playground/deploy-keras-model-in-production/venv-tf2/lib/python3.6/site-packages/keras/backend/tensorflow_backend.py", line 379, in get_session

    '`get_session` is not available '

RuntimeError: `get_session` is not available when using TensorFlow 2.0.

Related code to get model

with backend.get_session().graph.as_default() as g:

        model = SentimentService.get_model1()

Related code to load model

def load_deep_model(self, model):

        json_file = open('./src/mood-saved-models/' + model + '.json', 'r')

        loaded_model_json = json_file.read()

        loaded_model = model_from_json(loaded_model_json)



        loaded_model.load_weights("./src/mood-saved-models/" + model + ".h5")



        loaded_model._make_predict_function()

        return loaded_model

get_session is removed in tensorflow 2.0 and hence not available.



so, in order to load saved model we switched methods. Rather than using keras’s load_model, we used tensorflow to load model so that we can load model using distribution strategy.



Note

The tf.distribute.Strategy API provides an abstraction for distributing your training across multiple processing units.



New code to get model

another_strategy = tf.distribute.MirroredStrategy()

with another_strategy.scope():

    model = SentimentService.get_model1()

New code to load model

def load_deep_model(self, model):

        loaded_model = tf.keras.models.load_model("./src/mood-saved-models/"model + ".h5")

        return loaded_model

This worked and solved the problem with runtime error of get_session not available in tensorflow 2.0 . You can refer to Tensorflow 2.0 upgraded article too



Hope, this will solve your problem too. Thanks for following this article.

收起阅读 »

yolo voc_label 源码分析

yolo

作用读取每个xml文件，把坐标转化为相对坐标，对应文件名保存起来

import xml.etree.ElementTree as ET

import pickle

import os

from os import listdir, getcwd

from os.path import join









sets = [('2012', 'train'), ('2012', 'val'), ('2007', 'train'), ('2007', 'val'), ('2007', 'test')]



# 20类

classes = ["aeroplane", "bicycle", "bird", "boat", "bottle", "bus", "car", "cat", "chair", "cow", "diningtable", "dog",

           "horse", "motorbike", "person", "pottedplant", "sheep", "sofa", "train", "tvmonitor"]



# size w,h

# box x-min,x-max,y-min,y-max

def convert(size, box):

    dw = 1. / size[0]

    dh = 1. / size[1]

    x = (box[0] + box[1]) / 2.0 # 中心点位置

    y = (box[2] + box[3]) / 2.0

    w = box[1] - box[0]

    h = box[3] - box[2]

    x = x * dw

    w = w * dw

    y = y * dh

    h = h * dh # 全部转化为相对坐标

    return (x, y, w, h)





def convert_annotation(year, image_id):

    # 找到2个同样的文件

    in_file = open('VOCdevkit/VOC%s/Annotations/%s.xml' % (year, image_id))

    out_file = open('VOCdevkit/VOC%s/labels/%s.txt' % (year, image_id), 'w')



    tree = ET.parse(in_file)

    root = tree.getroot()

    size = root.find('size')

    w = int(size.find('width').text)

    h = int(size.find('height').text)



    for obj in root.iter('object'):

        difficult = obj.find('difficult').text

        cls = obj.find('name').text

        if cls not in classes or int(difficult) == 1: # difficult ==1 的不要了

            continue

        cls_id = classes.index(cls) # 排在第几位

        xmlbox = obj.find('bndbox')

        b = (float(xmlbox.find('xmin').text), float(xmlbox.find('xmax').text), float(xmlbox.find('ymin').text),

             float(xmlbox.find('ymax').text))

        # 传入的是w,h 与框框的周边

        bb = convert((w, h), b)

        out_file.write(str(cls_id) + " " + " ".join([str(a) for a in bb]) + '\n')





wd = getcwd()



for year, image_set in sets:

    # ('2012', 'train') 循环5次

    # 创建目录 一次性

    if not os.path.exists('VOCdevkit/VOC%s/labels/' % (year)):

        os.makedirs('VOCdevkit/VOC%s/labels/' % (year))



    # 图片的id数据

    image_ids = open('VOCdevkit/VOC%s/ImageSets/Main/%s.txt' % (year, image_set)).read().strip().split()



    # 结果写入这个文件

    list_file = open('%s_%s.txt' % (year, image_set), 'w')



    for image_id in image_ids:

        list_file.write('%s/VOCdevkit/VOC%s/JPEGImages/%s.jpg\n' % (wd, year, image_id)) # 补全路径

        convert_annotation(year, image_id)

    list_file.close()

继续阅读 »

作用读取每个xml文件，把坐标转化为相对坐标，对应文件名保存起来

import xml.etree.ElementTree as ET

import pickle

import os

from os import listdir, getcwd

from os.path import join









sets = [('2012', 'train'), ('2012', 'val'), ('2007', 'train'), ('2007', 'val'), ('2007', 'test')]



# 20类

classes = ["aeroplane", "bicycle", "bird", "boat", "bottle", "bus", "car", "cat", "chair", "cow", "diningtable", "dog",

           "horse", "motorbike", "person", "pottedplant", "sheep", "sofa", "train", "tvmonitor"]



# size w,h

# box x-min,x-max,y-min,y-max

def convert(size, box):

    dw = 1. / size[0]

    dh = 1. / size[1]

    x = (box[0] + box[1]) / 2.0 # 中心点位置

    y = (box[2] + box[3]) / 2.0

    w = box[1] - box[0]

    h = box[3] - box[2]

    x = x * dw

    w = w * dw

    y = y * dh

    h = h * dh # 全部转化为相对坐标

    return (x, y, w, h)





def convert_annotation(year, image_id):

    # 找到2个同样的文件

    in_file = open('VOCdevkit/VOC%s/Annotations/%s.xml' % (year, image_id))

    out_file = open('VOCdevkit/VOC%s/labels/%s.txt' % (year, image_id), 'w')



    tree = ET.parse(in_file)

    root = tree.getroot()

    size = root.find('size')

    w = int(size.find('width').text)

    h = int(size.find('height').text)



    for obj in root.iter('object'):

        difficult = obj.find('difficult').text

        cls = obj.find('name').text

        if cls not in classes or int(difficult) == 1: # difficult ==1 的不要了

            continue

        cls_id = classes.index(cls) # 排在第几位

        xmlbox = obj.find('bndbox')

        b = (float(xmlbox.find('xmin').text), float(xmlbox.find('xmax').text), float(xmlbox.find('ymin').text),

             float(xmlbox.find('ymax').text))

        # 传入的是w,h 与框框的周边

        bb = convert((w, h), b)

        out_file.write(str(cls_id) + " " + " ".join([str(a) for a in bb]) + '\n')





wd = getcwd()



for year, image_set in sets:

    # ('2012', 'train') 循环5次

    # 创建目录 一次性

    if not os.path.exists('VOCdevkit/VOC%s/labels/' % (year)):

        os.makedirs('VOCdevkit/VOC%s/labels/' % (year))



    # 图片的id数据

    image_ids = open('VOCdevkit/VOC%s/ImageSets/Main/%s.txt' % (year, image_set)).read().strip().split()



    # 结果写入这个文件

    list_file = open('%s_%s.txt' % (year, image_set), 'w')



    for image_id in image_ids:

        list_file.write('%s/VOCdevkit/VOC%s/JPEGImages/%s.jpg\n' % (wd, year, image_id)) # 补全路径

        convert_annotation(year, image_id)

    list_file.close()

收起阅读 »

docker run 和 create 区别

docker

Technically, docker run = docker create + docker start .
docker create command creates a writeable container from the image and prepares it for running.
docker run command creates the container (same as docker create ) and starts it.

继续阅读 »

Technically, docker run = docker create + docker start .
docker create command creates a writeable container from the image and prepares it for running.
docker run command creates the container (same as docker create ) and starts it.

收起阅读 »

用docker编译go代码

docker

如果偶尔需要编译go代码，但是又不想要安装一堆乱七八糟的依赖和Go编译器，可以利用docker来实现。应该是解决起来话费时间最小的。
1. 用文本编辑你的go代码，现在以hello world为例：

package main import "fmt" func main() {

/* 这是我的第一个简单的程序 */    

fmt.Println("Hello, World!") 

}

2. 然后直接使用docker执行编译。docker首先会自动去下载go的编译器，顺便把所有的依赖给解决掉

docker run --rm -v "$(pwd)":/usr/src/hello -w /usr/src/hello golang:1.3 go build -v

最后会在本地生成一个编译好的hello静态文件。
上述docker命令的具体含义就是把当前路径挂在到docker容器里头，然后切换到改到改路径下，然后进行编译。

继续阅读 »

如果偶尔需要编译go代码，但是又不想要安装一堆乱七八糟的依赖和Go编译器，可以利用docker来实现。应该是解决起来话费时间最小的。
1. 用文本编辑你的go代码，现在以hello world为例：

package main import "fmt" func main() {

/* 这是我的第一个简单的程序 */    

fmt.Println("Hello, World!") 

}

2. 然后直接使用docker执行编译。docker首先会自动去下载go的编译器，顺便把所有的依赖给解决掉

docker run --rm -v "$(pwd)":/usr/src/hello -w /usr/src/hello golang:1.3 go build -v

最后会在本地生成一个编译好的hello静态文件。
上述docker命令的具体含义就是把当前路径挂在到docker容器里头，然后切换到改到改路径下，然后进行编译。

收起阅读 »

docker实战代码勘误

读后感 docker

作者在官网刊登的勘误记录：

看这本书的作者一定要看，不然坑挺多的。
一路采坑过来的哭着说。

Last updated August 21, 2016

In an effort to offer continued support beyond publication, we have listed many updates to code due to version updates.



[code - omission] Page 18



The command to start the "mailer" is missing a line. Where the book reads:

docker run -d \

  --name mailer \

the proper command should read:

docker run -d \

  --name mailer \

  dockerinaction/ch2_mailer

  

[code - regression] Page 68



Newer versions of Docker have changed the structure of the JSON returned by the docker inspect subcommand. If the following command does not work then use the replacement. Original:

docker inspect --format "{{json .Volumes}}" reader

Replacement:

docker inspect --format "{{json .Mounts}}" reader



[code - regression] Page 69



Newer versions of Docker have changed the structure of the JSON returned by the docker inspect subcommand. If the following command does not work then use the replacement. Original:

docker inspect --format "{{json .Volumes}}" student

Replacement:

docker inspect --format "{{json .Mounts}}" student



[code - regression] Page 74



The alpine image entrypoint has changed since original publication and has been unset. The last command on the page should now read:

docker run --rm \

  --volumes-from tools \

  --entrypoint /bin/sh \

  alpine:latest \

  -c 'ls /operations/*'

[code - regression] Page 75



The docker exec example on the top of page 75 was printed with the wrong tool name. The correct command is:

docker exec important_application /operations/tools/diagnostics



[code - regression] Page 86



It appears that nslookup behavior in the alpine image has changed. To run the example use the busybox:1 image.

docker run --rm \

  --hostname barker \

  busybox:1 \

  nslookup barker

  

[code - regression] Page 87 (top)



It appears that nslookup behavior in the alpine image has changed. To run the example use the busybox:1 image.

docker run --rm \

  --dns 8.8.8.8 \

  busybox:1 \

  nslookup docker.com

  

[code - regression] Page 87 (bottom)



It appears that nslookup behavior in the alpine image has changed. To run the example use the busybox:1 image.

docker run --rm \

  --dns-search docker.com \

  busybox:1 \

  nslookup registry.hub

  

[code - regression] Page 88 (bottom)



It appears that nslookup behavior in the alpine image has changed. To run the example use the busybox:1 image.

docker run --rm \

  --add-host test:10.10.10.255 \

  busybox:1 \

  nslookup test

  

[code - regression] Page 106



There are a few new problems with this example. First, the named repository (dockerfile/mariadb) no longer exists. You can use mariadb:5.5 as replacement. However, the second problem is that containers created from the mariadb image perform certain initialization at startup. That initialization work requires certain capabilities and to be started with the default user. The system should instead drop permissions after the initialization work is complete. Note that the real value of this example is in demonstrating different resource isolation mechanisms. It is not so important that you get it working. You can start the database with the following command:

docker run -d --name ch6_mariadb \

  --memory 256m \

  --cpu-shares 1024 \

  --cap-drop net_raw \

  -e MYSQL_ROOT_PASSWORD=test \

  mariadb:5.5

  

[code - regression] Page 107



Containers created from the wordpress:4.1 image perform certain initialization at startup and expect certain environment variables. That initialization work requires certain capabilities and to be started with the default user. The system should instead drop permissions after the initialization work is complete. Note that the real value of this example is in demonstrating different resource isolation mechanisms. It is not so important that you get it working. You can start wordpress with the following command:

docker run -d -P --name ch6_wordpress \

  --memory 512m \

  --cpu-shares 512 \

  --cap-drop net_raw \

  -e WORDPRESS_DB_PASSWORD=test \

  mariadb:5.5

  

[code - typo] Page 109



The device access example is missing the "run" subcommand. The command listed as:

docker -it --rm \

  --device /dev/video0:/dev/video0 \

  ubuntu:latest ls -al /dev

should have been written as:

docker run -it --rm \

  --device /dev/video0:/dev/video0 \

  ubuntu:latest ls -al /dev

  

[code - typo] Page 110 - 111



Several commands are missing the "run" subcommand. In each case the command begins with

docker -d ...

and should have been written as:

docker run -d ...



[code - regression] Page 115 (bottom)



The busybox and alpine images have been updated to fix the problem described in the paragraph below. The 'su' command does not have the SUID bit set and will not provide any automatic privilege escalation.



[command correction] Page 116



Boot2Docker has been discontinued and rolled into a newer project called Docker Machine. Because a reader is unlikely to have the boot2docker command installed, the command at the top of this page should be changed from:

boot2docker ssh

to the Docker Machine equivalent:

docker-machine ssh default

where default is the name of the machine you created.



[code - regression] Page 119



The ifconfig command has since been removed from ubuntu:latest. Instead of using the ubuntu:latest image for these examples use ubuntu:trusty. The example using ifconfig should look like:

docker run --rm \

  --privileged \

  ubuntu:trusty ifconfig

[Illustration Mistake] Page 136

Image layer ced2 on the left side of the illustration is listed at c3d2 on the right side. These two layers should represent the same item.



[code - typo] Page 140



Containers need not be in a running state in order to export their file system. The first command on page 140 uses the "run" subcommand but the command listed will never be able to start. Replace "run" with "create." The command should appear as follows:

docker create --name export-test \

  dockerinaction/ch7_packed:latest ./echo For Export

  

[code - missing line] Page 146



In the example Dockerfile near the top of the page the line with the RUN directive is missing part of the instruction. That line should read:

RUN apt-get update && apt-get install -y git



[code - evolution] Page 215



The registry:2 configuration file now requires the population of additional fields under "maintenance > uploadpurging." The example should currently look like:

# Filename: s3-config.yml

version: 0.1

log:

    level: debug

    fields:

        service: registry

        environment: development

storage:

    cache:

        layerinfo: inmemory

    s3:

        accesskey: 

        secretkey: 

        region: 

        bucket: 

        encrypt: true

        secure: true

        v4auth: true

        chunksize: 5242880

        rootdirectory: /s3/object/name/prefix

    maintenance:

        uploadpurging:

            enabled: true

            age: 168h

            interval: 24h

            dryrun: false

        readonly:

            enabled: false

http:

    addr: :5000

    secret: asecretforlocaldevelopment

    debug:

        addr: localhost:5001

        

[code - evolution] Page 216



The registry:2 configuration file now requires the population of additional fields under "maintenance > uploadpurging." The example should currently look like:

# Filename: rados-config.yml

version: 0.1

log:

    level: debug

    fields:

        service: registry

        environment: development

storage:

    cache:

        layerinfo: inmemory

    rados:

        poolname: radospool

        username: radosuser

        chunksize: 4194304

    maintenance:

        uploadpurging:

            enabled: false

            age: 168h

            interval: 24h

            dryrun: false

        readonly:

            enabled: false

http:

    addr: :5000

    secret: asecretforlocaldevelopment

    debug:

        addr: localhost:5001

        

[code - evolution] Page 218



The registry:2 configuration file now requires the population of additional fields under "maintenance > uploadpurging." The example should currently look like:

# Filename: redis-config.yml

version: 0.1

log:

    level: debug

    fields:

        service: registry

        environment: development

http:

    addr: :5000

    secret: asecretforlocaldevelopment

    debug:

        addr: localhost:5001

storage:

    cache:

        blobdescriptor: redis

    s3:

        accesskey: 

        secretkey: 

        region: 

        bucket: 

        encrypt: true

        secure: true

        v4auth: true

        chunksize: 5242880

        rootdirectory: /s3/object/name/prefix

    maintenance:

        uploadpurging:

            enabled: true

            age: 168h

            interval: 24h

            dryrun: false

        readonly:

            enabled: false

redis:

    addr: redis-host:6379

    password: asecret

    dialtimeout: 10ms

    readtimeout: 10ms

    writetimeout: 10ms

    pool:

        maxidle: 16

        maxactive: 64

        idletimeout: 300s

[code - typo] Page 220



The name of the file shown should be scalable-config.yml as in previous examples. This example also requires the addition of the newer uploadpurging attributes. The mainenance section of the file should be as follows:

    maintenance:

        uploadpurging:

            enabled: true

            age: 168h

            interval: 24h

            dryrun: false

        readonly:

            enabled: false

[text - typo] Page 240



In the second paragraph the reader is instructed to, "Open ./coffee/api/api.py" this is not the correct location of the file. The correct file location is at, "./coffee/app/api.py."

[text - typo] Page 262



The refere nce to "flock.json" in the first sentence of the third paragraph should be "flock.yml."

[code - typo] Page 270



The git clone command uses the ssh protocol instead of https. The command should read as follows:

git clone https://github.com/dockerinact ... i.git

继续阅读 »

作者在官网刊登的勘误记录：

看这本书的作者一定要看，不然坑挺多的。
一路采坑过来的哭着说。

Last updated August 21, 2016

In an effort to offer continued support beyond publication, we have listed many updates to code due to version updates.



[code - omission] Page 18



The command to start the "mailer" is missing a line. Where the book reads:

docker run -d \

  --name mailer \

the proper command should read:

docker run -d \

  --name mailer \

  dockerinaction/ch2_mailer

  

[code - regression] Page 68



Newer versions of Docker have changed the structure of the JSON returned by the docker inspect subcommand. If the following command does not work then use the replacement. Original:

docker inspect --format "{{json .Volumes}}" reader

Replacement:

docker inspect --format "{{json .Mounts}}" reader



[code - regression] Page 69



Newer versions of Docker have changed the structure of the JSON returned by the docker inspect subcommand. If the following command does not work then use the replacement. Original:

docker inspect --format "{{json .Volumes}}" student

Replacement:

docker inspect --format "{{json .Mounts}}" student



[code - regression] Page 74



The alpine image entrypoint has changed since original publication and has been unset. The last command on the page should now read:

docker run --rm \

  --volumes-from tools \

  --entrypoint /bin/sh \

  alpine:latest \

  -c 'ls /operations/*'

[code - regression] Page 75



The docker exec example on the top of page 75 was printed with the wrong tool name. The correct command is:

docker exec important_application /operations/tools/diagnostics



[code - regression] Page 86



It appears that nslookup behavior in the alpine image has changed. To run the example use the busybox:1 image.

docker run --rm \

  --hostname barker \

  busybox:1 \

  nslookup barker

  

[code - regression] Page 87 (top)



It appears that nslookup behavior in the alpine image has changed. To run the example use the busybox:1 image.

docker run --rm \

  --dns 8.8.8.8 \

  busybox:1 \

  nslookup docker.com

  

[code - regression] Page 87 (bottom)



It appears that nslookup behavior in the alpine image has changed. To run the example use the busybox:1 image.

docker run --rm \

  --dns-search docker.com \

  busybox:1 \

  nslookup registry.hub

  

[code - regression] Page 88 (bottom)



It appears that nslookup behavior in the alpine image has changed. To run the example use the busybox:1 image.

docker run --rm \

  --add-host test:10.10.10.255 \

  busybox:1 \

  nslookup test

  

[code - regression] Page 106



There are a few new problems with this example. First, the named repository (dockerfile/mariadb) no longer exists. You can use mariadb:5.5 as replacement. However, the second problem is that containers created from the mariadb image perform certain initialization at startup. That initialization work requires certain capabilities and to be started with the default user. The system should instead drop permissions after the initialization work is complete. Note that the real value of this example is in demonstrating different resource isolation mechanisms. It is not so important that you get it working. You can start the database with the following command:

docker run -d --name ch6_mariadb \

  --memory 256m \

  --cpu-shares 1024 \

  --cap-drop net_raw \

  -e MYSQL_ROOT_PASSWORD=test \

  mariadb:5.5

  

[code - regression] Page 107



Containers created from the wordpress:4.1 image perform certain initialization at startup and expect certain environment variables. That initialization work requires certain capabilities and to be started with the default user. The system should instead drop permissions after the initialization work is complete. Note that the real value of this example is in demonstrating different resource isolation mechanisms. It is not so important that you get it working. You can start wordpress with the following command:

docker run -d -P --name ch6_wordpress \

  --memory 512m \

  --cpu-shares 512 \

  --cap-drop net_raw \

  -e WORDPRESS_DB_PASSWORD=test \

  mariadb:5.5

  

[code - typo] Page 109



The device access example is missing the "run" subcommand. The command listed as:

docker -it --rm \

  --device /dev/video0:/dev/video0 \

  ubuntu:latest ls -al /dev

should have been written as:

docker run -it --rm \

  --device /dev/video0:/dev/video0 \

  ubuntu:latest ls -al /dev

  

[code - typo] Page 110 - 111



Several commands are missing the "run" subcommand. In each case the command begins with

docker -d ...

and should have been written as:

docker run -d ...



[code - regression] Page 115 (bottom)



The busybox and alpine images have been updated to fix the problem described in the paragraph below. The 'su' command does not have the SUID bit set and will not provide any automatic privilege escalation.



[command correction] Page 116



Boot2Docker has been discontinued and rolled into a newer project called Docker Machine. Because a reader is unlikely to have the boot2docker command installed, the command at the top of this page should be changed from:

boot2docker ssh

to the Docker Machine equivalent:

docker-machine ssh default

where default is the name of the machine you created.



[code - regression] Page 119



The ifconfig command has since been removed from ubuntu:latest. Instead of using the ubuntu:latest image for these examples use ubuntu:trusty. The example using ifconfig should look like:

docker run --rm \

  --privileged \

  ubuntu:trusty ifconfig

[Illustration Mistake] Page 136

Image layer ced2 on the left side of the illustration is listed at c3d2 on the right side. These two layers should represent the same item.



[code - typo] Page 140



Containers need not be in a running state in order to export their file system. The first command on page 140 uses the "run" subcommand but the command listed will never be able to start. Replace "run" with "create." The command should appear as follows:

docker create --name export-test \

  dockerinaction/ch7_packed:latest ./echo For Export

  

[code - missing line] Page 146



In the example Dockerfile near the top of the page the line with the RUN directive is missing part of the instruction. That line should read:

RUN apt-get update && apt-get install -y git



[code - evolution] Page 215



The registry:2 configuration file now requires the population of additional fields under "maintenance > uploadpurging." The example should currently look like:

# Filename: s3-config.yml

version: 0.1

log:

    level: debug

    fields:

        service: registry

        environment: development

storage:

    cache:

        layerinfo: inmemory

    s3:

        accesskey: 

        secretkey: 

        region: 

        bucket: 

        encrypt: true

        secure: true

        v4auth: true

        chunksize: 5242880

        rootdirectory: /s3/object/name/prefix

    maintenance:

        uploadpurging:

            enabled: true

            age: 168h

            interval: 24h

            dryrun: false

        readonly:

            enabled: false

http:

    addr: :5000

    secret: asecretforlocaldevelopment

    debug:

        addr: localhost:5001

        

[code - evolution] Page 216



The registry:2 configuration file now requires the population of additional fields under "maintenance > uploadpurging." The example should currently look like:

# Filename: rados-config.yml

version: 0.1

log:

    level: debug

    fields:

        service: registry

        environment: development

storage:

    cache:

        layerinfo: inmemory

    rados:

        poolname: radospool

        username: radosuser

        chunksize: 4194304

    maintenance:

        uploadpurging:

            enabled: false

            age: 168h

            interval: 24h

            dryrun: false

        readonly:

            enabled: false

http:

    addr: :5000

    secret: asecretforlocaldevelopment

    debug:

        addr: localhost:5001

        

[code - evolution] Page 218



The registry:2 configuration file now requires the population of additional fields under "maintenance > uploadpurging." The example should currently look like:

# Filename: redis-config.yml

version: 0.1

log:

    level: debug

    fields:

        service: registry

        environment: development

http:

    addr: :5000

    secret: asecretforlocaldevelopment

    debug:

        addr: localhost:5001

storage:

    cache:

        blobdescriptor: redis

    s3:

        accesskey: 

        secretkey: 

        region: 

        bucket: 

        encrypt: true

        secure: true

        v4auth: true

        chunksize: 5242880

        rootdirectory: /s3/object/name/prefix

    maintenance:

        uploadpurging:

            enabled: true

            age: 168h

            interval: 24h

            dryrun: false

        readonly:

            enabled: false

redis:

    addr: redis-host:6379

    password: asecret

    dialtimeout: 10ms

    readtimeout: 10ms

    writetimeout: 10ms

    pool:

        maxidle: 16

        maxactive: 64

        idletimeout: 300s

[code - typo] Page 220



The name of the file shown should be scalable-config.yml as in previous examples. This example also requires the addition of the newer uploadpurging attributes. The mainenance section of the file should be as follows:

    maintenance:

        uploadpurging:

            enabled: true

            age: 168h

            interval: 24h

            dryrun: false

        readonly:

            enabled: false

[text - typo] Page 240



In the second paragraph the reader is instructed to, "Open ./coffee/api/api.py" this is not the correct location of the file. The correct file location is at, "./coffee/app/api.py."

[text - typo] Page 262



The refere nce to "flock.json" in the first sentence of the third paragraph should be "flock.yml."

[code - typo] Page 270



The git clone command uses the ssh protocol instead of https. The command should read as follows:

git clone https://github.com/dockerinact ... i.git

收起阅读 »

jieba.posseg TypeError: cannot unpack non-iterable pair object 词性分析报错

NLP

词性标注的例子出现错误 'pair' object is not iterable

例子：

import jieba.posseg as pseg

seg_list = pseg.cut("我爱北京天安门")

for word,flag in seg_list:

    print(word)

    print(flag)

---------------------------------------------------------------------------

TypeError                                 Traceback (most recent call last)

<ipython-input-5-f105f6980f88> in <module>()

      1 import jieba.posseg as pseg

      2 seg_list = pseg.cut("我爱北京天安门")

----> 3 for word,flag in seg_list:

      4     print(word)

      5     print(flag)



TypeError: cannot unpack non-iterable pair object

原因是新版本中seg_list是一个生成器，所以只能 for win seg_list

然后从word中解包出来

print(w.word)

print(w.flag)

这样问题就解决了。

继续阅读 »

词性标注的例子出现错误 'pair' object is not iterable

例子：

import jieba.posseg as pseg

seg_list = pseg.cut("我爱北京天安门")

for word,flag in seg_list:

    print(word)

    print(flag)

---------------------------------------------------------------------------

TypeError                                 Traceback (most recent call last)

<ipython-input-5-f105f6980f88> in <module>()

      1 import jieba.posseg as pseg

      2 seg_list = pseg.cut("我爱北京天安门")

----> 3 for word,flag in seg_list:

      4     print(word)

      5     print(flag)



TypeError: cannot unpack non-iterable pair object

原因是新版本中seg_list是一个生成器，所以只能 for win seg_list

然后从word中解包出来

print(w.word)

print(w.flag)

这样问题就解决了。

收起阅读 »

华为AI云市场的modelarts是要收费的<坑>

闲聊吐槽

点击查看大图

看了介绍说觉得不错，就注册了，然后还要求实名才能使用。想着华为大公司应该不会拿个人数据到处倒卖吧，就按教程下一步下一步注册了。
拿了几张样本上传到服务器，自己打标注，生成模型。然后测试一下效果，感觉产品做得不错。但整个过程没提示说要收费。

然后晚上就收到几条短信说华为的AI云处于欠费状态，晕，原来是秋后算账呀。

点击查看大图

还以为不部署到服务器的话就不需要缴费呢，怎么随便测试了下就扣费了呢。

继续阅读 »

点击查看大图

看了介绍说觉得不错，就注册了，然后还要求实名才能使用。想着华为大公司应该不会拿个人数据到处倒卖吧，就按教程下一步下一步注册了。
拿了几张样本上传到服务器，自己打标注，生成模型。然后测试一下效果，感觉产品做得不错。但整个过程没提示说要收费。

然后晚上就收到几条短信说华为的AI云处于欠费状态，晕，原来是秋后算账呀。

点击查看大图

还以为不部署到服务器的话就不需要缴费呢，怎么随便测试了下就扣费了呢。
收起阅读 »

scrapy在settings中定义变量不能包含小写！

python爬虫

如果变量名包含小写字母，那么你的变量会被过滤掉，在scrapy编码的其他地方都会无法被识别。
比如定义了一个叫 Redis_host = '192.168.1.1'，的值

然后在spider中，如果你调用self.settings.get('Redis_host')
那么返回值是 None。

如果用REDIS_HOST定义，那么就可以正确返回它的值。

如果你一定要用小写，也有其他方法可正常调用。
先导入settings文件
fromt xxxx import setttings # xxx为项目名

host = settings.Redis_host # 直接导入一个文件的形式来调用是可以的

继续阅读 »

如果变量名包含小写字母，那么你的变量会被过滤掉，在scrapy编码的其他地方都会无法被识别。
比如定义了一个叫 Redis_host = '192.168.1.1'，的值

然后在spider中，如果你调用self.settings.get('Redis_host')
那么返回值是 None。

如果用REDIS_HOST定义，那么就可以正确返回它的值。

如果你一定要用小写，也有其他方法可正常调用。
先导入settings文件
fromt xxxx import setttings # xxx为项目名

host = settings.Redis_host # 直接导入一个文件的形式来调用是可以的收起阅读 »

etree.strip_tags的用法

python爬虫

直接从官方文档那里拿过来，发现这个函数功能还挺不错的。
它把参数中的标签从源htmlelement中删除，并且把里面的标签文本给合并进来。

举个例子：

from lxml.html import etree

from lxml.html import fromstring, HtmlElement



test_html = '''<p><span>hello</span><span>world</span></p>'''

test_element = fromstring(test_html)

etree.strip_tags(test_element,'span') # 清除span标签

etree.tostring(test_element)

因为上述操作直接应用于test_element上的，所以test_element的值已经被修改了。

所以现在test_element 的值是
b'<p>helloworld</p>'

原创文章，转载请注明出处
http://30daydo.com/article/553

继续阅读 »

直接从官方文档那里拿过来，发现这个函数功能还挺不错的。
它把参数中的标签从源htmlelement中删除，并且把里面的标签文本给合并进来。

举个例子：

from lxml.html import etree

from lxml.html import fromstring, HtmlElement



test_html = '''<p><span>hello</span><span>world</span></p>'''

test_element = fromstring(test_html)

etree.strip_tags(test_element,'span') # 清除span标签

etree.tostring(test_element)

因为上述操作直接应用于test_element上的，所以test_element的值已经被修改了。

所以现在test_element 的值是
b'<p>helloworld</p>'

原创文章，转载请注明出处
http://30daydo.com/article/553
收起阅读 »

pycharm自带的版本控制软件挺好用的

闲聊

pycharm自带的git，svn版本控制工具已经很好用的了，所以以后可以直接不用sourcetree这种专业的GUI管理软件了

android monitor 系统找不到文件 lib\monitor-Could。

手机爬虫

最新的android studio删除了android monitor
但我觉得这是一个很好用的监控日志工具。

在sdk的tool目录底下启动

monitor.bat

然后就可以看到报错：

系统找不到文件 lib\monitor-Could

解决办法：
使用android-studio的sdk管理工具下载一个android-19的API，最新的api少了部分jar文件。

继续阅读 »

最新的android studio删除了android monitor
但我觉得这是一个很好用的监控日志工具。

在sdk的tool目录底下启动

monitor.bat

然后就可以看到报错：

系统找不到文件 lib\monitor-Could

解决办法：
使用android-studio的sdk管理工具下载一个android-19的API，最新的api少了部分jar文件。
收起阅读 »

996走起

win10 vbs无法隐藏执行批处理问题

redis desktop manager windows客户端

pyqt5 QRect在哪个类

分享一个电影网站 insiji.com

2020-03-27 新事情坚持

忘记的QQ密码

windows下logstash6.5安装配置网上的教程是linux，有坑

薅“疫情公益”羊毛，黑产恶意爬取各大出版社电子书上万册

个人的知识星球

希望能够找到一个可以一起对彪的人

证券etf和券商etf的区别

socketio中client的sio wait用法

dataframe 根据日期重采样计算个数

失落

bandcamp移动开发更简单

mongodb日期条件查找

多线程调用yolo模型会出错（包括使用cv2载入yolo模型）

keras yolo物体检测入门教程

RuntimeError: `get_session` is not available when using TensorFlow 2.0.

yolo voc_label 源码分析

docker run 和 create 区别

用docker编译go代码

docker实战代码勘误

jieba.posseg TypeError: cannot unpack non-iterable pair object 词性分析报错

华为AI云市场的modelarts是要收费的<坑>

scrapy在settings中定义变量不能包含小写！

etree.strip_tags的用法

pycharm自带的版本控制软件挺好用的

android monitor 系统找不到文件 lib\monitor-Could。

热门文章

热门话题