using regex with spacy