I built a regexp to strip away everything other than website names from a buffer: "\\\\(?:\\\\(?:\\n\\\\|.\\\\)*?\\\\