How to load an aligned word2vec model in Gensim?

I have a time series dataset. therefore, for each time period I trained a word2vec model and realigned the models.

However, when I try to load the aligned word2vec models as follows I get the below mentioned error.

#Load model

model = word2vec.Word2Vec.load('model_1970')

Error:

train_words_pow += wv.vocab[wv.index2word[word_index]].count**power

KeyError: 'ctrx'

Is there a way to resolve this error? :)

I have attached a sample trained word2vec model that gives error for testing purposes

Link: https://drive.google.com/file/d/1IBbUgeAubr2xzNYLKZgPt34xOEsW92bO/view?usp=sharing

EDIT:
Mention below is the log of my program.

2018-11-30 14:23:43,897 : INFO : loading Word2Vec object from model_1970

2018-11-30 14:23:43,961 : INFO : loading wv recursively from model_1970.wv.* with mmap=None

2018-11-30 14:23:43,965 : INFO : loading vectors from model_1970.wv.vectors.npy with mmap=None

2018-11-30 14:23:44,005 : INFO : setting ignored attribute vectors_norm to None

2018-11-30 14:23:44,009 : INFO : loading vocabulary recursively from model_1970.vocabulary.* with mmap=None

2018-11-30 14:23:44,009 : INFO : loading trainables recursively from model_1970.trainables.* with mmap=None

2018-11-30 14:23:44,009 : INFO : loading syn1neg from model_1970.trainables.syn1neg.npy with mmap=None

2018-11-30 14:23:44,053 : INFO : setting ignored attribute cum_table to None

2018-11-30 14:23:44,053 : INFO : loaded model_1970

Reloaded modules: __mp_main__

Traceback (most recent call last):



  File "<ipython-input-3-3b9230dacba9>", line 1, in <module>

    runfile('C:/Users/Emi/Desktop/code/word2vec_distance.py', wdir='C:/Users/Emi/Desktop/code')



  File "C:UsersEmiAnaconda3libsite-packagesspyder_kernelscustomizespydercustomize.py", line 668, in runfile

    execfile(filename, namespace)



  File "C:UsersEmiAnaconda3libsite-packagesspyder_kernelscustomizespydercustomize.py", line 108, in execfile

    exec(compile(f.read(), filename, 'exec'), namespace)



  File "C:/Users/Emi/Desktop/code/word2vec_distance.py", line 26, in <module>

    model_1 = word2vec.Word2Vec.load(word2vec_model_name_1)



  File "C:UsersEmiAnaconda3libsite-packagesgensimmodelsword2vec.py", line 975, in load

    return super(Word2Vec, cls).load(*args, **kwargs)



  File "C:UsersEmiAnaconda3libsite-packagesgensimmodelsbase_any2vec.py", line 631, in load

    model.vocabulary.make_cum_table(model.wv)  # rebuild cum_table from vocabulary



  File "C:UsersEmiAnaconda3libsite-packagesgensimmodelsword2vec.py", line 1383, in make_cum_table

    train_words_pow += wv.vocab[wv.index2word[word_index]].count**power



KeyError: 'cmnx'

edited Nov 30 '18 at 4:00

asked Nov 26 '18 at 1:58

Emi

332215

Would it be possible to make one of the models available to test?

– Pedro Torres
Nov 26 '18 at 2:05

@PedroTorres Thanks a lot for the comment. I am using the following code to align my word2vec models gist.github.com/quadrismegistus/… My real models are very big. I will do a test model and attach it soon :)

– Emi
Nov 26 '18 at 2:14

@PedroTorres updated the question :)

– Emi
Nov 26 '18 at 2:40

Tried it here. It does show the error. It seems like the problem is with the vocabulary dictionary given by wv.vocab. For some reason it does not have all the words

– Pedro Torres
Nov 26 '18 at 2:49

@PedroTorres I did a modification to the model's vocabulary (i.e aligning the model) using the gensim code in gist.github.com/quadrismegistus/… Any idea how to resolve this issue? :)

– Emi
Nov 26 '18 at 5:23

|
show 6 more comments

I have a time series dataset. therefore, for each time period I trained a word2vec model and realigned the models.

However, when I try to load the aligned word2vec models as follows I get the below mentioned error.

#Load model

model = word2vec.Word2Vec.load('model_1970')

Error:

train_words_pow += wv.vocab[wv.index2word[word_index]].count**power

KeyError: 'ctrx'

Is there a way to resolve this error? :)

I have attached a sample trained word2vec model that gives error for testing purposes

Link: https://drive.google.com/file/d/1IBbUgeAubr2xzNYLKZgPt34xOEsW92bO/view?usp=sharing

EDIT:
Mention below is the log of my program.

2018-11-30 14:23:43,897 : INFO : loading Word2Vec object from model_1970

2018-11-30 14:23:43,961 : INFO : loading wv recursively from model_1970.wv.* with mmap=None

2018-11-30 14:23:43,965 : INFO : loading vectors from model_1970.wv.vectors.npy with mmap=None

2018-11-30 14:23:44,005 : INFO : setting ignored attribute vectors_norm to None

2018-11-30 14:23:44,009 : INFO : loading vocabulary recursively from model_1970.vocabulary.* with mmap=None

2018-11-30 14:23:44,009 : INFO : loading trainables recursively from model_1970.trainables.* with mmap=None

2018-11-30 14:23:44,009 : INFO : loading syn1neg from model_1970.trainables.syn1neg.npy with mmap=None

2018-11-30 14:23:44,053 : INFO : setting ignored attribute cum_table to None

2018-11-30 14:23:44,053 : INFO : loaded model_1970

Reloaded modules: __mp_main__

Traceback (most recent call last):



  File "<ipython-input-3-3b9230dacba9>", line 1, in <module>

    runfile('C:/Users/Emi/Desktop/code/word2vec_distance.py', wdir='C:/Users/Emi/Desktop/code')



  File "C:UsersEmiAnaconda3libsite-packagesspyder_kernelscustomizespydercustomize.py", line 668, in runfile

    execfile(filename, namespace)



  File "C:UsersEmiAnaconda3libsite-packagesspyder_kernelscustomizespydercustomize.py", line 108, in execfile

    exec(compile(f.read(), filename, 'exec'), namespace)



  File "C:/Users/Emi/Desktop/code/word2vec_distance.py", line 26, in <module>

    model_1 = word2vec.Word2Vec.load(word2vec_model_name_1)



  File "C:UsersEmiAnaconda3libsite-packagesgensimmodelsword2vec.py", line 975, in load

    return super(Word2Vec, cls).load(*args, **kwargs)



  File "C:UsersEmiAnaconda3libsite-packagesgensimmodelsbase_any2vec.py", line 631, in load

    model.vocabulary.make_cum_table(model.wv)  # rebuild cum_table from vocabulary



  File "C:UsersEmiAnaconda3libsite-packagesgensimmodelsword2vec.py", line 1383, in make_cum_table

    train_words_pow += wv.vocab[wv.index2word[word_index]].count**power



KeyError: 'cmnx'

edited Nov 30 '18 at 4:00

asked Nov 26 '18 at 1:58

Emi

332215

Would it be possible to make one of the models available to test?

– Pedro Torres
Nov 26 '18 at 2:05

@PedroTorres Thanks a lot for the comment. I am using the following code to align my word2vec models gist.github.com/quadrismegistus/… My real models are very big. I will do a test model and attach it soon :)

– Emi
Nov 26 '18 at 2:14

@PedroTorres updated the question :)

– Emi
Nov 26 '18 at 2:40

Tried it here. It does show the error. It seems like the problem is with the vocabulary dictionary given by wv.vocab. For some reason it does not have all the words

– Pedro Torres
Nov 26 '18 at 2:49

@PedroTorres I did a modification to the model's vocabulary (i.e aligning the model) using the gensim code in gist.github.com/quadrismegistus/… Any idea how to resolve this issue? :)

– Emi
Nov 26 '18 at 5:23

|
show 6 more comments

I have a time series dataset. therefore, for each time period I trained a word2vec model and realigned the models.

However, when I try to load the aligned word2vec models as follows I get the below mentioned error.

#Load model

model = word2vec.Word2Vec.load('model_1970')

Error:

train_words_pow += wv.vocab[wv.index2word[word_index]].count**power

KeyError: 'ctrx'

Is there a way to resolve this error? :)

I have attached a sample trained word2vec model that gives error for testing purposes

Link: https://drive.google.com/file/d/1IBbUgeAubr2xzNYLKZgPt34xOEsW92bO/view?usp=sharing

EDIT:
Mention below is the log of my program.

2018-11-30 14:23:43,897 : INFO : loading Word2Vec object from model_1970

2018-11-30 14:23:43,961 : INFO : loading wv recursively from model_1970.wv.* with mmap=None

2018-11-30 14:23:43,965 : INFO : loading vectors from model_1970.wv.vectors.npy with mmap=None

2018-11-30 14:23:44,005 : INFO : setting ignored attribute vectors_norm to None

2018-11-30 14:23:44,009 : INFO : loading vocabulary recursively from model_1970.vocabulary.* with mmap=None

2018-11-30 14:23:44,009 : INFO : loading trainables recursively from model_1970.trainables.* with mmap=None

2018-11-30 14:23:44,009 : INFO : loading syn1neg from model_1970.trainables.syn1neg.npy with mmap=None

2018-11-30 14:23:44,053 : INFO : setting ignored attribute cum_table to None

2018-11-30 14:23:44,053 : INFO : loaded model_1970

Reloaded modules: __mp_main__

Traceback (most recent call last):



  File "<ipython-input-3-3b9230dacba9>", line 1, in <module>

    runfile('C:/Users/Emi/Desktop/code/word2vec_distance.py', wdir='C:/Users/Emi/Desktop/code')



  File "C:UsersEmiAnaconda3libsite-packagesspyder_kernelscustomizespydercustomize.py", line 668, in runfile

    execfile(filename, namespace)



  File "C:UsersEmiAnaconda3libsite-packagesspyder_kernelscustomizespydercustomize.py", line 108, in execfile

    exec(compile(f.read(), filename, 'exec'), namespace)



  File "C:/Users/Emi/Desktop/code/word2vec_distance.py", line 26, in <module>

    model_1 = word2vec.Word2Vec.load(word2vec_model_name_1)



  File "C:UsersEmiAnaconda3libsite-packagesgensimmodelsword2vec.py", line 975, in load

    return super(Word2Vec, cls).load(*args, **kwargs)



  File "C:UsersEmiAnaconda3libsite-packagesgensimmodelsbase_any2vec.py", line 631, in load

    model.vocabulary.make_cum_table(model.wv)  # rebuild cum_table from vocabulary



  File "C:UsersEmiAnaconda3libsite-packagesgensimmodelsword2vec.py", line 1383, in make_cum_table

    train_words_pow += wv.vocab[wv.index2word[word_index]].count**power



KeyError: 'cmnx'

edited Nov 30 '18 at 4:00

asked Nov 26 '18 at 1:58

Emi

332215

I have a time series dataset. therefore, for each time period I trained a word2vec model and realigned the models.

However, when I try to load the aligned word2vec models as follows I get the below mentioned error.

#Load model

model = word2vec.Word2Vec.load('model_1970')

Error:

train_words_pow += wv.vocab[wv.index2word[word_index]].count**power

KeyError: 'ctrx'

Is there a way to resolve this error? :)

I have attached a sample trained word2vec model that gives error for testing purposes

Link: https://drive.google.com/file/d/1IBbUgeAubr2xzNYLKZgPt34xOEsW92bO/view?usp=sharing

EDIT:
Mention below is the log of my program.

2018-11-30 14:23:43,897 : INFO : loading Word2Vec object from model_1970

2018-11-30 14:23:43,961 : INFO : loading wv recursively from model_1970.wv.* with mmap=None

2018-11-30 14:23:43,965 : INFO : loading vectors from model_1970.wv.vectors.npy with mmap=None

2018-11-30 14:23:44,005 : INFO : setting ignored attribute vectors_norm to None

2018-11-30 14:23:44,009 : INFO : loading vocabulary recursively from model_1970.vocabulary.* with mmap=None

2018-11-30 14:23:44,009 : INFO : loading trainables recursively from model_1970.trainables.* with mmap=None

2018-11-30 14:23:44,009 : INFO : loading syn1neg from model_1970.trainables.syn1neg.npy with mmap=None

2018-11-30 14:23:44,053 : INFO : setting ignored attribute cum_table to None

2018-11-30 14:23:44,053 : INFO : loaded model_1970

Reloaded modules: __mp_main__

Traceback (most recent call last):



  File "<ipython-input-3-3b9230dacba9>", line 1, in <module>

    runfile('C:/Users/Emi/Desktop/code/word2vec_distance.py', wdir='C:/Users/Emi/Desktop/code')



  File "C:UsersEmiAnaconda3libsite-packagesspyder_kernelscustomizespydercustomize.py", line 668, in runfile

    execfile(filename, namespace)



  File "C:UsersEmiAnaconda3libsite-packagesspyder_kernelscustomizespydercustomize.py", line 108, in execfile

    exec(compile(f.read(), filename, 'exec'), namespace)



  File "C:/Users/Emi/Desktop/code/word2vec_distance.py", line 26, in <module>

    model_1 = word2vec.Word2Vec.load(word2vec_model_name_1)



  File "C:UsersEmiAnaconda3libsite-packagesgensimmodelsword2vec.py", line 975, in load

    return super(Word2Vec, cls).load(*args, **kwargs)



  File "C:UsersEmiAnaconda3libsite-packagesgensimmodelsbase_any2vec.py", line 631, in load

    model.vocabulary.make_cum_table(model.wv)  # rebuild cum_table from vocabulary



  File "C:UsersEmiAnaconda3libsite-packagesgensimmodelsword2vec.py", line 1383, in make_cum_table

    train_words_pow += wv.vocab[wv.index2word[word_index]].count**power



KeyError: 'cmnx'

python nlp gensim word2vec

edited Nov 30 '18 at 4:00

asked Nov 26 '18 at 1:58

Emi

332215

edited Nov 30 '18 at 4:00

asked Nov 26 '18 at 1:58

Emi

332215

edited Nov 30 '18 at 4:00

asked Nov 26 '18 at 1:58

Emi

332215

asked Nov 26 '18 at 1:58

Emi

332215

asked Nov 26 '18 at 1:58

Emi

332215

Would it be possible to make one of the models available to test?

– Pedro Torres
Nov 26 '18 at 2:05

@PedroTorres Thanks a lot for the comment. I am using the following code to align my word2vec models gist.github.com/quadrismegistus/… My real models are very big. I will do a test model and attach it soon :)

– Emi
Nov 26 '18 at 2:14

@PedroTorres updated the question :)

– Emi
Nov 26 '18 at 2:40

Tried it here. It does show the error. It seems like the problem is with the vocabulary dictionary given by wv.vocab. For some reason it does not have all the words

– Pedro Torres
Nov 26 '18 at 2:49

@PedroTorres I did a modification to the model's vocabulary (i.e aligning the model) using the gensim code in gist.github.com/quadrismegistus/… Any idea how to resolve this issue? :)

– Emi
Nov 26 '18 at 5:23

|
show 6 more comments

Would it be possible to make one of the models available to test?

– Pedro Torres
Nov 26 '18 at 2:05

@PedroTorres Thanks a lot for the comment. I am using the following code to align my word2vec models gist.github.com/quadrismegistus/… My real models are very big. I will do a test model and attach it soon :)

– Emi
Nov 26 '18 at 2:14

@PedroTorres updated the question :)

– Emi
Nov 26 '18 at 2:40

Tried it here. It does show the error. It seems like the problem is with the vocabulary dictionary given by wv.vocab. For some reason it does not have all the words

– Pedro Torres
Nov 26 '18 at 2:49

@PedroTorres I did a modification to the model's vocabulary (i.e aligning the model) using the gensim code in gist.github.com/quadrismegistus/… Any idea how to resolve this issue? :)

– Emi
Nov 26 '18 at 5:23

Would it be possible to make one of the models available to test?

– Pedro Torres
Nov 26 '18 at 2:05

@PedroTorres Thanks a lot for the comment. I am using the following code to align my word2vec models gist.github.com/quadrismegistus/… My real models are very big. I will do a test model and attach it soon :)

– Emi
Nov 26 '18 at 2:14

@PedroTorres updated the question :)

– Emi
Nov 26 '18 at 2:40

Tried it here. It does show the error. It seems like the problem is with the vocabulary dictionary given by wv.vocab. For some reason it does not have all the words

– Pedro Torres
Nov 26 '18 at 2:49

@PedroTorres I did a modification to the model's vocabulary (i.e aligning the model) using the gensim code in gist.github.com/quadrismegistus/… Any idea how to resolve this issue? :)

– Emi
Nov 26 '18 at 5:23

|
show 6 more comments

0

active

oldest

votes

Your Answer

StackExchange.ifUsing("editor", function () {
StackExchange.using("externalEditor", function () {
StackExchange.using("snippets", function () {
StackExchange.snippets.init();
});
});
}, "code-snippets");

StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "1"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});

}
});

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53473868%2fhow-to-load-an-aligned-word2vec-model-in-gensim%23new-answer', 'question_page');
}
);

Post as a guest

Name

Required, but never shown

0

active

oldest

votes

0

active

oldest

votes

draft saved

draft discarded

Thanks for contributing an answer to Stack Overflow!

Please be sure to answer the question. Provide details and share your research!

But avoid …

Asking for help, clarification, or responding to other answers.

Making statements based on opinion; back them up with references or personal experience.

To learn more, see our tips on writing great answers.

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Name

Required, but never shown

Name

Required, but never shown

This page is only for reference, If you need detailed information, please check here

搜尋此網誌

Ytukyg