"DNA元基催化与肽计算的第5修订版本V00062191"

需积分: 0 117 浏览量更新于2023-12-29 收藏 15.15MB PDF 举报

DNA元基催化与肽计算_第5修订版本V00062191是由罗瑶光和罗荣武编写的一部中英双语PPT，无源码的教材。本书共分为多个章节，其中第一章介绍了德塔自然语言图灵系统。在此章节中，讨论了德塔分词的催化切词优化方式，以及分词、排序、神经网络索引等内容。此外，还探讨了分词在线性文本搜索中的应用以及动态POS等概念。 DNA元基催化与肽计算是一门前沿性的研究领域，涉及到生物化学、计算机科学等多个学科的知识。通过对DNA元基的催化以及肽计算的研究，可以更深入地了解生物分子在计算机模拟中的应用，进而推动生物信息学和计算机科学的发展。本教材的第5修订版本主要对DNA元基催化与肽计算进行了全面的介绍和讨论，是相关领域的学习者和研究者不可多得的参考资料。在第一章中，德塔自然语言图灵系统是一个重要的内容。德塔分词的催化切词优化方式涉及到自然语言处理领域的技术，这对于文本处理、语义分析等方面具有重要意义。分词、排序、神经网络索引等内容则涉及到信息检索、数据挖掘等领域，这些技术对于提高文本处理的效率和准确性有重要作用。分词在线性文本搜索中的应用以及动态POS也是当前自然语言处理领域的研究热点，对于提高搜索引擎的搜索结果和用户体验具有重要意义。本教材的第5修订版本还介绍了DNA元基催化与肽计算的基本概念和原理。DNA元基催化是指在DNA分子中发挥催化作用的元基，它对DNA的结构和功能具有重要影响。肽计算则是指利用肽链的结构和特性进行计算的一种方法，它在生物信息学和药物设计等领域有着重要的应用价值。通过对DNA元基催化与肽计算的研究，可以为生物医药领域的研究和应用提供重要的理论支持和技术手段。总之，DNA元基催化与肽计算_第5修订版本V00062191是一部内容全面、深入的教材，涵盖了德塔自然语言图灵系统、DNA元基催化与肽计算等多个方面的知识。本教材适用于生物信息学、计算机科学等相关专业的学习者和研究者，对于推动相关领域的研究和发展具有重要的意义。希望本教材能够为广大读者带来有益的知识和启发，推动相关领域的学术交流和科研成果的产出。

DNA 元基催化与肽计算_第 5 修订版本 V0006 16

constant values, balancing of the computing sets and the discrete conditional differentiations (Demorgan, Frequency flows etc).

And now those things widely were used in Deta’s catalytic family of technical community (Parser, Word segments, Mind reading,

NLP computing etc).

神经网络索引

1 德塔分词的词汇字典用 map 进行索引, 因为 Jdk8+的 map 对象的 key 支持 2 分搜索, 搜索速度到了峰值. refer page,

129, 131

2 德塔分词的索引不断的将大 map 进行细化分类, 如词长 map, 词类 map, 词性 map, 让搜索再次加速. refer page 55,

3 德塔分词的索引 map 支持 2 次组合计算, 支持分布式服务器进行索引 cache. 关于 2 次组合计算作者不建议单机使用.

refer page 92,

4 德塔分词 map 的 key 用 string 的 char 对应 ASCII int 进行标识来执行 find key, 方便二分搜索存储和 StringBuilder

高速计算, 实现底层核统一. refer page 92

Nero Network Index Forest

1 Deta Parser did a word segmental indexed map by using humanoid semantic verbal dictionary, for the reason why using JDK8+

tool to do the map search logic, is that It had already integrated the binary search tree, balanced map-tree arrangement and other

technologies.

2 Deta Parser’s balanced binary search tree method made an observer mode of averaged classification with all types of the

reflection java concurrent maps, those maps included the char word length, verbal types and part of speech corpus, etc. The author

did It to accelerate the Nero-marching speedly for searching the words.

3 Deta Parser supported the secondary indexing computing combinations, this way could be suitable for the distributed cache of

searching systems. The author did not suggest this technology which be used on a single desktop.

4 For the computing logic, Finally Deta Parser functions used string builder to accelerate the searching engine.

神经网络索引的价值主要体现在 2 个地方, 切词的关联索引上和词汇 map 索引上. 切词的关联索引价值, 主要体现在将

词汇的文字进行链化提取, 这种链化计算方式将词库中本相对独立的海量词汇进行了按人类语言文学中的顶针方法进行

了有效的前后长度关联(NERO), 其价值有利于大文本的文字进行有必要关联链的小段小段的提取(NLP), 类似挤牙膏一

样, 挤出来就刷牙用掉(POS).

词汇 map 索引价值, 主要体现在词汇的文字进行链化合理切分, 这种链化切分方式将词库中根据不同属性的分类 map

来组合匹配按人类语言文学中的词汇词性和主谓宾搭配严谨定义来切分. 其价值在这些分类 map 可以自适应设计和多样

化扩展. 增加切词准确度和灵活度, 适应各种不同的场景, 类似牙刷机制, 挤出牙膏根据匹配不同的牙刷和刷牙方法

(NERO + POS), 匹配适应不同的口腔环境. 描述人罗瑶光, 稍后优化下.

The accomplishment of the neural network-index is mainly reflected in two sections, 1 for the relevanced index of word

segmentation, and 2 for the lexical indexed map. The associated and relevanced index-value of word segmentation, is mainly

reflected in the chained extraction of words. This chained calculation method effectively correlates the relatively independent of a

large number of words in the thesaurus, according to the Thimble Theory in human language and Literature (Nero). The value of

the big data documental process, splits the word chain links list into small chars-token (max 4) sections, and It is similar to a

squeezing toothpaste, and a brushing teeth (POS) after a squeezed out with the DetaParser marching engine.

DNA 元基催化与肽计算_第 5 修订版本 V0006 17

The index value of the lexical map is mainly reflected in the reasonable chain-segmentation of lexical characters. This chain of

word segmental method, combines and matches the classified maps in the thesaurus, according to different attributes. And then

separates them according to the rigorous definition of lexical POS and SVO’s collocation in human literary languages. The

adaptive industrial system designed and diversified the expansion of this classification, will increase the accuracy and flexibility

of word segmentation and adapt to different segmental scenes. Similar to the way of toothbrushes, the extruded toothpaste is

matched to adapt to different oral cavity-environments, according to different toothbrushes and brushing methods (Nero + POS).

Author: Yaoguang.Luo

分词在线性文本搜索中应用,

1 德塔分词的搜索建立在 map 类的权重计算方法上, 不同的权重叠加产生的打分进行排序输出. refer page 下册 64

2 权重的计算方法按词性的主谓宾如代名动形, 和 POS 如动名形谓介分类. refer page 下册 66

3 权重与词长, 词频进行耦合 bit 叠加计算(bit 位计算比乘法要快一个数量级), 生成最终输出结果. refer page 下册 68

4 权重与词长的比值可以精度调节, 确定搜索的精确性和记录个人搜索偏好. refer page 下册 68

The Deta Parser word segmentation and Its applications in the linear text document environments.

1 There had a lot of rights weight by each indexed map, based on those right weights, Deta Parser did a marching score system to

do the computation and calculation for the Chinese word segmentation logic.

2 The search weight of the computing logic, such as Subject Predicate Object(SVO), and part of speech(POS), for instance, Noun,

Verb and Adjective etc.

3 To make a computing acceleration, the author injected a combination factor in the marching logics, such as bit calculation,

frequency statistics and word length observations. Similars to the theory of Count Down Latch and Cyclic Barrier logic (made

definitions first then proved, or proved first then did a conclusion) ways etc.

4 Above all things and logics once became JAVA transportations, the author set all global and local valuable scales to build the

Foolishman- Self-Controller components to make the algorithms easy and simple.

动态 POS 函数流水阀门细化遍历内核匹配

1 动态的核分为前序核和后序核两种. 根据词汇分析的位置进行实时变动更新. refer page 97

2 前序核主要缓存存储词汇的位置和词性, 用于 POS 词性搭配的 POS 函数流水阀门细化遍历计算. refer page 97

3 后序核主要缓存词汇的切词链后面准备跟进的词语. 用于 POS 语法的修正计算, 如连词匹配. refer page 97

4 内核采用 StringBuilder 做核载体进行计算加速. refer page 97

Dynamic River Flows Gate Function Marching and Circustantly Loop the POS Kernel Computing.

1 Dynamic kernel contains prefix and postfix two types, can read the word token one by one. It does dynamic computing also at

the same time.

2 Prefix kernel stores a POS cache buffer by each current word piece of information such as positions, frequency etc, to accelerate

the word marching.

DNA 元基催化与肽计算_第 5 修订版本 V0006 19

并输出了. 描述人罗瑶光

POS functional gate river flows and their relationships. For example, the author did the word segmentation by using '如果是非常

理想' in this sentence. At the first through the indexed forest mapped dictionary, Deta Parser could cut '如果是非常理想' into ‘如

果’, ‘是非常’, ‘理想’ those three associated chars word sets token list. And in this result list, ‘如果’ and ‘理想’ these two lexical

words seems to be immutably boned. ‘是非常’ was a three chars word token, then did an inner marching computing by using

POS functional gate river flows theory. And at this time, the orthos corpus mapped base of the author's Deta Parser system which

could not find any verbals such as‘是非常’, then continued do the two chars marched for the next step. About more powerful of

these algorithms, was the Chinese chars literacy-grammar marching system, for the chars segmental section, ‘是非常’ did a

separation into two types such as ‘是非-常’ and ‘是-非常’, then analyzed contrast and distinguishment by these two segments.

After analysis of each word and Its prefix and postfix, POS combined with relationships, (The prefix token of ‘是非’ was ‘如果’,

the prefix token of ‘非常’ was ‘是’, the prefix tokens of ‘常’ were ‘是非' and ‘非’, and the prefix tokens of '理想’ were ‘常’ and

‘非常’). This POS word segmentational theory was fixedly and immutably, which meant It should not contain any probability

events here. If at this time, the DetaPaser did not find any associated chars relationships, then promoted to the next steps as

reading and cutting sequence-list chars as single one by one. Above all, the result of the sample graph did a good show that

DetaParser did a ‘如果-是-非常’ response because the priority of (Conjunction- Adj, v- Adj, v) was higher than (conjunction-

noun- adj, v).

Author: Yaoguang Luo

2019 年 3 月 18 日之前作者 Github 的该算法函数编码框架已经出现

https://github. com/yaoguangluo/Deta_Parser/commit/25b90c9847d15df85c5c991448f2c271e0ad8106

注意: 链接的 CNN 关键词的历史记录属于作者用词错误, 作者当年基础学术累积不够, 关于卷积的知识仅仅学了计算机

视觉的理论课, 以为带内核计算的都叫 CNN 卷积

剩余327页未读，继续阅读

豆瓣时间

粉丝: 30
资源: 329

"DNA元基催化与肽计算的第5修订版本V00062191"

DNA元基催化与肽计算_第5修订版本V00055291

DNA元基催化与肽计算_第5修订版本V00058171

DNA元基催化与肽计算_第5修订版本V00062251

DNA元基催化与肽计算_第5修订版本V00051021

DNA元基催化与肽计算_第5修订版本V000611

DNA元基催化与肽计算_第5修订版本V000581

DNA元基催化与肽计算_第5修订版本V00055191

DNA元基催化与肽计算_第5修订版本V00051051

DNA元基催化与肽计算_第5修订版本V00062171

DNA元基催化与肽计算_第5修订版本V00052181

最新资源