Working with text

Basilica provides 2 functions for working with images:

  • embed_sentence
  • embed_sentences

The embed_sentence function takes a single characther vector (a string) and returns a vector of features. The embed_sentences functions takes a list of character vectors returns a matrix with a feature vector on every row.

embed_sentece

sentences <- c(
  "This is a sentence!",
  "This is a similar sentence!",
  "I don't think this sentence is very similar at all..."
)

library('basilica')
conn <- connect("SLOW_DEMO_KEY")

embeddings <- list()
embeddings[[1]] <- embed_sentence(sentence[[1]], conn=conn)
embeddings[[2]] <- embed_sentence(sentence[[2]], conn=conn)
embeddings[[3]] <- embed_sentence(sentence[[3]], conn=conn)

print(cor(embeddings[[1]], embeddings[[2]]))
print(cor(embeddings[[1]], embeddings[[3]]))

embed_senteces

sentences <- c(
  "This is a sentence!",
  "This is a similar sentence!",
  "I don't think this sentence is very similar at all..."
)

library('basilica')
conn <- connect("SLOW_DEMO_KEY")

embeddings <- embed_sentences(sentences, conn=conn)

print(cor(embeddings[1,], embeddings[2,]))
print(cor(embeddings[1,], embeddings[3,]))