用Python分析了数千个微信昵称后，我们发现了这些规律...

发布时间：2018-09-08 13:17:37 所属栏目：教程来源：XksA

导读：副标题#e# 一、前言这是一篇技术文，但又不是一篇技术文，今天分享的是，当笔者获取了微信小程序英文取名的3500多个微信用户昵称、年龄段后，分析得到下面结果。二、Let's get it 1.基本信息获取访问英文取名的用户基本信息接口，获取英文取名用户微信名

(3)数据分类判断

# 昵称全中文判断 
def is_all_ch(keyword): 
    for c in keyword: 
        # 包含常见中文字符 
        if not ('u4e00' <= c <= 'u9fa5'): 
            return False 
    return True 
 
# 昵称全英文判断 
def is_all_en(keyword): 
    # 不能全部为空格或者首位为空格 
    if all(ord(c) == 32 for c in keyword) or keyword[0] == ' ': 
        return False 
    # 允许空格和英文并存(例如：Xist A) 
    if not all(65 < ord(c) < 128 or ord(c) == 32 for c in keyword): 
        return False 
    return True 
 
# 昵称全数字判断 
def is_all_di(keyword): 
    for uchar in keyword: 
        if not (uchar >= 'u0030' and uchar <= u'u0039'): 
            return False 
    return True 
 
# 昵称包含表情图判断 
def have_img(keyword): 
    # 下面是大部分图片的一个unicode编码集 
    # 详情查看：https://en.wikipedia.org/wiki/Emoji 
    img_re = re.compile(u'[' 
                      u'U0001F300-U0001F64F' 
                      u'U0001F680-U0001F6FF' 
                      u'u2600-u2B55]+', 
                      re.UNICODE) 
    if img_re.findall(keyword) : 
        return True 
    return False 
 
# 中文+数字昵称判断 
def is_ch_di(keyword): 
    for c in keyword: 
        if not ('u4e00' <= c <= 'u9fa5') and not (c >= 'u0030' and c <= u'u0039'): 
            return False 
    return True

（编辑：惠州站长网）

【声明】本站内容均来自网络，其相关言论仅代表作者个人观点，不代表本站立场。若无意侵犯到您的权利，请及时与联系站长删除相关内容!

2/11

首页

尾页